Altair RISE
A program to recognize and reward our most engaged community members
Nominate Yourself Now!
Home
Discussions
Community Q&A
Student Dataset is giving different classification accuracy using cross validation on RapidMiner 9.6
VikasRattan
Student Dataset is giving different classification accuracy using cross validation on RapidMiner 9.6(educational version)
Find more posts tagged with
AI Studio
Accepted answers
varunm1
Try to run the attached process without changing multiple times as see the results. I enable random seed for SMOTE, Cross-Validation & random tree. You can import this process by going to File --> Import process. You need to set a random seed for all operators that have that option. A random seed will help generate the same data all the time and even in the random tree, it will do the same randomization. These are critical to producing reproducible results.
Let me know if you still have issues.
Student_Vikas.rmp
All comments
varunm1
Hello
@VikasRattan
Did you set "Random Seed" option in cross-validation? If not, your folds might be divided differently during different runs. Also which algorithm are you using inside cross-validation?
VikasRattan
Varun Ji, I observed it for Random forest, Random tree, Knn, Naive Bayes. I have set Random seed, which is 1922, and without setting random seed. In both cases, got different accuracy on different runs. Even though, i used startified sampling, shuffled sampling and linear sampling, i got different accuracy when executing at different point of time.
varunm1
Can you share your .rmp file? You can go to File --> Export Process and then attach that process here.
VikasRattan
Sure Sir
File is attached.
Student_Vikas.rmp
varunm1
Try to run the attached process without changing multiple times as see the results. I enable random seed for SMOTE, Cross-Validation & random tree. You can import this process by going to File --> Import process. You need to set a random seed for all operators that have that option. A random seed will help generate the same data all the time and even in the random tree, it will do the same randomization. These are critical to producing reproducible results.
Let me know if you still have issues.
Student_Vikas.rmp
Quick Links
All Categories
Recent Discussions
Activity
Unanswered
日本語 (Japanese)
한국어(Korean)
Groups