When AutoModel asks you to "Deselect" red or yellow attributes, is AutoModel doing feature selection
tonyboy9
New Altair Community Member
Best Answers
-
Hi @tonyboy9,
The colouring of different attributes is based off their measured quality, the results of which you can see on the right hand side columns. For example, the top 2 columns appear to be marked yellow given their zero correlation with your target. When AutoModel is asking you to deselect, it's asking you if you'd like to remove lower quality features based off the measures. Of course, there will be instances where you want to keep certain features anyway, and other times when it would be inappropriate to keep features labelled green/good.
Hope this helps,
Best,
Roland0 -
Hi @tonyboy9,
It comes down to how in depth you want to improve the input data to your model to improve predictive power. The AutoModel stage which you showed is a set of statistical measures where we grade each variable and then select or deselect based on quality - this is a first step of feature selection. What you'll find is that later on when you use AutoModel, you get the opportunity to do advanced Feature Selection and Feature generation. This would be termed as a topic "Feature Engineering". I'd recommend this video as a nice starting point on why we do this: https://academy.rapidminer.com/learn/video/feature-engineering-intro
Within RapidMiner Studio, there's a number of different ways we can approach it, in order to produce the best outcome.
Best,
Roland0
Answers
-
Hi @tonyboy9,
The colouring of different attributes is based off their measured quality, the results of which you can see on the right hand side columns. For example, the top 2 columns appear to be marked yellow given their zero correlation with your target. When AutoModel is asking you to deselect, it's asking you if you'd like to remove lower quality features based off the measures. Of course, there will be instances where you want to keep certain features anyway, and other times when it would be inappropriate to keep features labelled green/good.
Hope this helps,
Best,
Roland0 -
Okay, Roland, thanks for that. Perhaps I can take this one step further. What good are all the feature selection operators available to build a process? I tried to follow a response explaining the roles of these operators, which just reads overly complicated. Why not just rely on AutoModel decisions?0
-
Hi @tonyboy9,
It comes down to how in depth you want to improve the input data to your model to improve predictive power. The AutoModel stage which you showed is a set of statistical measures where we grade each variable and then select or deselect based on quality - this is a first step of feature selection. What you'll find is that later on when you use AutoModel, you get the opportunity to do advanced Feature Selection and Feature generation. This would be termed as a topic "Feature Engineering". I'd recommend this video as a nice starting point on why we do this: https://academy.rapidminer.com/learn/video/feature-engineering-intro
Within RapidMiner Studio, there's a number of different ways we can approach it, in order to produce the best outcome.
Best,
Roland0