When AutoModel asks you to "Deselect" red or yellow attributes, is AutoModel doing feature selection

tonyboy9
tonyboy9 New Altair Community Member
edited November 5 in Community Q&A
Tagged:

Best Answers

  • RolandJones
    RolandJones
    Altair Employee
    Answer ✓
    Hi @tonyboy9,

    The colouring of different attributes is based off their measured quality, the results of which you can see on the right hand side columns. For example, the top 2 columns appear to be marked yellow given their zero correlation with your target. When AutoModel is asking you to deselect, it's asking you if you'd like to remove lower quality features based off the measures. Of course, there will be instances where you want to keep certain features anyway, and other times when it would be inappropriate to keep features labelled green/good. 

    Hope this helps,

    Best,
    Roland
  • RolandJones
    RolandJones
    Altair Employee
    Answer ✓
    Hi @tonyboy9,

    It comes down to how in depth you want to improve the input data to your model to improve predictive power. The AutoModel stage which you showed is a set of statistical measures where we grade each variable and then select or deselect based on quality - this is a first step of feature selection. What you'll find is that later on when you use AutoModel, you get the opportunity to do advanced Feature Selection and Feature generation. This would be termed as a topic "Feature Engineering". I'd recommend this video as a nice starting point on why we do this: https://academy.rapidminer.com/learn/video/feature-engineering-intro

    Within RapidMiner Studio, there's a number of different ways we can approach it, in order to produce the best outcome. 

    Best,
    Roland

Answers

  • RolandJones
    RolandJones
    Altair Employee
    Answer ✓
    Hi @tonyboy9,

    The colouring of different attributes is based off their measured quality, the results of which you can see on the right hand side columns. For example, the top 2 columns appear to be marked yellow given their zero correlation with your target. When AutoModel is asking you to deselect, it's asking you if you'd like to remove lower quality features based off the measures. Of course, there will be instances where you want to keep certain features anyway, and other times when it would be inappropriate to keep features labelled green/good. 

    Hope this helps,

    Best,
    Roland
  • tonyboy9
    tonyboy9 New Altair Community Member
    Okay, Roland, thanks for that. Perhaps I can take this one step further. What good are all the feature selection operators available to build a process? I tried to follow a response explaining the roles of these operators, which just reads overly complicated. Why not just rely on AutoModel decisions?
  • RolandJones
    RolandJones
    Altair Employee
    Answer ✓
    Hi @tonyboy9,

    It comes down to how in depth you want to improve the input data to your model to improve predictive power. The AutoModel stage which you showed is a set of statistical measures where we grade each variable and then select or deselect based on quality - this is a first step of feature selection. What you'll find is that later on when you use AutoModel, you get the opportunity to do advanced Feature Selection and Feature generation. This would be termed as a topic "Feature Engineering". I'd recommend this video as a nice starting point on why we do this: https://academy.rapidminer.com/learn/video/feature-engineering-intro

    Within RapidMiner Studio, there's a number of different ways we can approach it, in order to produce the best outcome. 

    Best,
    Roland