🎉Community Raffle - Win $25

An exclusive raffle opportunity for active members like you! Complete your profile, answer questions and get your first accepted badge to enter the raffle.
Join and Win

how many trees and datasets are used to optimize random forest?

User: "IqbalMalikAlfaruq"
New Altair Community Member
Updated by Jocelyn
I'm making predictions that produce fast, medium, and slow predictions. I used 100 trees and around 1000 data training. but always returns fast prediction.

Find more posts tagged with

Sort by:
1 - 1 of 11
    User: "BalazsBaranyRM"
    New Altair Community Member
    Accepted Answer
    Hi!

    Do I understand correctly that you're doing classification and your classes are fast, medium and slow?

    Sometimes datasets are not suitable for a particular machine learning algorithm, or its default parameters. Sometimes they are imbalanced and then the "best" approach for a machine learning algorithm is to predict the majority class.

    Take a look at your data. Is fast overrepresented by a large margin? If it is, can you downsample the class? 
    Do decision trees, naive bayes, k-NN give you the same result or are they better able to cope with the data? 

    There are videos in the RapidMiner Academy for topics like sampling and validation that could help you.

    Regards,
    Balázs