Weka Random forest constantly better than Random Forest Rapidminer

Question

hi,

I teste W-RAndom Forest and Random Forest from Rapidminer on the same dataset, for W-RF, I got around 89%, whereas for Random Forest I got only 76%, why is that? I thought the Algorithm / Method is the same? Are the implementations so entirely different that I get such a performance discrepancy?

Fred12 · Answer

is there a solution found to that now?

I think it is rather the Random forest implementation from Rapidminer that causes the results rather than any parameter settings...

I mean its a quite big difference, someone should check that...

Fred12 · Answer

that might be the problem, I used gain ratio I will try out information gain

EDIT: with information gain I also got around 77%-... but my dataset is far harder than iris data...

Thomas_Ott · Answer

Are you comparing it with the same splitting criteron? This post says that W-RF uses information criteron to split: http://stackoverflow.com/questions/30150970/what-splitting-criterion-does-random-tree-in-weka-3-7-11-use-for-numerical-attri When I do that, the results of the attached Iris data set works the same. The model created in the Training step is applied to the current test set (10 %).
The performance is evaluated and sent to the operator results. The model created in the Training step is applied to the current test set (10 %).
The performance is evaluated and sent to the operator results.