Most of the modern ML algorithms implemented in RapidMiner include adjustments for perfect multi-collinearity if needed, so dummy coding is actually just fine. But the Nominal To Numerical operator supports the n-1 encoding approach as well, just select the "effect coding" option in the coding type parameter instead of dummy coding and then specify the omitted categories in the resulting "comparison groups" dialog box. This is tedious for a large number of attributes, though, so if you can use dummy coding, that is preferable.

View in context

Dummy Encoding in Rapidminer

Guys, please share your thoughts.

Regards,

Find more posts tagged with

Quick Links