how to select important variables for my model

ed1305
ed1305 New Altair Community Member
edited November 2024 in Community Q&A
I have a large data set .I want to reduce the attribute count and keep only the important ones.So how do i do that in rapidminer using the whole data set as input
Tagged:

Best Answer

Answers

  • varunm1
    varunm1 New Altair Community Member
    edited March 2019
    Hi @ed1305

    You can use feature selection techniques like "Optimize selection" operator or any other relevant operator provided by RapidMiner.  You can find how to use them in a tutorial of that operator.

    Sometimes, I just run automodel to check which attributes are in red, yellow and green and select only attributes that are in green and yellow for model training.
  • Brian_Wells
    Brian_Wells New Altair Community Member
    If you haven't already seen Ingo's feature engineering webinar, I would highly recommend taking a few minutes to watch it (link below).  His explanation is extremely intuitive and easy to connect with and will leave you with a gut level (verses academic) understanding of how it works and its true power to transform a dataset so that it can be represented by a wider array of models.  This could drastically reduce training time, make implementation easier, and allow the model to be more easily explained to the traditional skeptics of more advanced "machine learning" techniques.