Student Dataset is giving different classification accuracy using cross validation on RapidMiner 9.6

VikasRattan
New Altair Community Member
Student Dataset is giving different classification accuracy using cross validation on RapidMiner 9.6(educational version)
Tagged:
0
Best Answer
-
Try to run the attached process without changing multiple times as see the results. I enable random seed for SMOTE, Cross-Validation & random tree. You can import this process by going to File --> Import process. You need to set a random seed for all operators that have that option. A random seed will help generate the same data all the time and even in the random tree, it will do the same randomization. These are critical to producing reproducible results.
Let me know if you still have issues.1
Answers
-
Hello @VikasRattan
Did you set "Random Seed" option in cross-validation? If not, your folds might be divided differently during different runs. Also which algorithm are you using inside cross-validation?
0 -
Varun Ji, I observed it for Random forest, Random tree, Knn, Naive Bayes. I have set Random seed, which is 1922, and without setting random seed. In both cases, got different accuracy on different runs. Even though, i used startified sampling, shuffled sampling and linear sampling, i got different accuracy when executing at different point of time.0
-
Can you share your .rmp file? You can go to File --> Export Process and then attach that process here.0
-
-
Try to run the attached process without changing multiple times as see the results. I enable random seed for SMOTE, Cross-Validation & random tree. You can import this process by going to File --> Import process. You need to set a random seed for all operators that have that option. A random seed will help generate the same data all the time and even in the random tree, it will do the same randomization. These are critical to producing reproducible results.
Let me know if you still have issues.1