problem of imbalnce dataset

hello i am new in data mining and rapidminer, I have problem of imbalance data set, I wok with decision tree and naïve bayes and random foreset the accuracy of DT, NB is very good but it is not real my question is what is best operators that work with three techniques , my data set contain 1031 sample
hana mohamed
student
Best Answer
-
Hi @hanaabdalrahman,
as mentioned earlier: This operator is part of an extension and not of RM core. The extension can be found in our marketplace:
Best,
Martin
1
Answers
-
@hanaabdalrahman you will need to use the Sample operator and toggle on the 'balance data' option. Then enter the classes and # of samples for each class.
1 -
Hi Hana,
i recommend to use the SMOTE operator which is part of the operator toolbox extension.
Best,
Martin
2 -
thanks for replay...
but how i use it the class false (44) only and class true about (986)
0 -
thanks...
i work on version 8.0.001 these operator not found in it.. what is best one instead of it and how work?
0 -
Hi @hanaabdalrahman,
as mentioned earlier: This operator is part of an extension and not of RM core. The extension can be found in our marketplace:
Best,
Martin
1 -
thanks very match.. upsampling operator solve the problem
1