Hi
With RapidMiner is it possible to automatically collapse the classes in a learning set on a given number of classes by their cardinality so that variance? The goal is to improve the precision of methods such as SVM and KNN.
I have a learning set of 20.000 elements divided in more than 100 classes, with high variance in the number of elements and I need to reduce them to 20 classes.
For example:
Class A - 3 elements
Class B - 4 elements
Class C - 8 elements
It would be nice to have the opportunity to reduce to a given number of classes, i.e. 2 this way:
Class 1 - 7 elements (obtained by Class A and
Class 2 - 8 elements (obtained by Class C)
Please, help me!! I'm trying with operations research methods but have so less time...
Thank you!