Home
Discussions
Community Q&A
How do Random Forests in RapidMiner support missing values?
dudwell
Does a random forest predict a missing value or does it exclude it from the final prediction
Find more posts tagged with
AI Studio
Missing Values
Accepted answers
MartinLiebig
i think we go both ways and take the average of the prediction, but i would need to check
varunm1
Generally, random forest algorithms impute missing values by an average of proximity values or mode. But if you are selecting Criterion as gain_ratio it uses C 4.5 algorithm developed by Quinlan, in this it doesn't impute values but it will calculate an impurity score based on missing values and uses it if it encounters missing values in the test set. So looks like it's not removing samples with missing values or it depends on criterion we are selecting.
correct me this if there any misconception.
Thanks
All comments
MartinLiebig
i think we go both ways and take the average of the prediction, but i would need to check
varunm1
Generally, random forest algorithms impute missing values by an average of proximity values or mode. But if you are selecting Criterion as gain_ratio it uses C 4.5 algorithm developed by Quinlan, in this it doesn't impute values but it will calculate an impurity score based on missing values and uses it if it encounters missing values in the test set. So looks like it's not removing samples with missing values or it depends on criterion we are selecting.
correct me this if there any misconception.
Thanks
Quick Links
All Categories
Recent Discussions
Activity
Unanswered
日本語 (Japanese)
한국어(Korean)