Hi,
I am experimenting with rapidminer for a couple of weeks now and am very impressed by the great possibilities it offers and the very helpful team. Over the time, some ideas came to my mind. I'll post them in a short list and let you decide if one or two of them are any good:
- More information when viewing a decision tree model: In addition to the graphic representation of the label distribution in each node or leaf, it would be nice if one could hover over a node/leaf and see the distribution in absolute numbers (how many cases of each class of the training set are in the current note/leaf
- When doing parameter optimization, so far only the performance of the best combination is returned. It would be nice, if one could also see how other combinations performed (e.g. the top n combinations, where n would be a user defined value). Maybe there are combinations very close to the best one, that have other advantages which make them more desirable than the best one.
- I would always like to see the final model in the end. Currently, this is not possible with all operators. e.g. the optimize selection operator trains a model, but does not allow you the see the final model in the end, without adding another model training step using the selected attributes
- Stacking using probabilities instead/in addition to final labels. See http://rapid-i.com/rapidforum/index.php/topic,2744.0.html
- Stop subprocess button, allowing you to exit of an "infinite loop" without canceling the entire process. See end of first post in http://rapid-i.com/rapidforum/index.php/topic,2745.0.html
- Difficult to implement and not so important: Graphical representations of more models, e.g. a 2D-Representation of SVM, displaying how the boundary separates the data. Something like here: http://kernelsvm.tripod.com/
Thank you very much for considering this and best regards
Hanspeter