X-means in Rapid MIner Go

Chemical_eng
Chemical_eng New Altair Community Member
edited November 5 in Community Q&A
I have 2 questions about X-means algorithm in Rapid miner Go : 
1. How does it handle categorical variables? 
2. Does it perform automatic normalization of numerical variables or should I do this beforehand? 

Best Answer

  • alebo
    alebo New Altair Community Member
    edited February 2022 Answer ✓
    Dear @Chemical_eng,

    1. We use one-hot encoding on categorical columns, as long as the number of values is below 50. Any column with more is removed.
    2. Yes, the method we use for normalization is called Z-transformation. You can learn more about it here: link under Parameters / method.

    After doing these (and more) preparation steps, we run x-means algorithm. Then the cluster result column (called Groups) is appended back to the original data for easier interpretability.

    Regards,
    Andras

Answers

  • alebo
    alebo New Altair Community Member
    edited February 2022 Answer ✓
    Dear @Chemical_eng,

    1. We use one-hot encoding on categorical columns, as long as the number of values is below 50. Any column with more is removed.
    2. Yes, the method we use for normalization is called Z-transformation. You can learn more about it here: link under Parameters / method.

    After doing these (and more) preparation steps, we run x-means algorithm. Then the cluster result column (called Groups) is appended back to the original data for easier interpretability.

    Regards,
    Andras
  • Chemical_eng
    Chemical_eng New Altair Community Member
    thanks, that answers my question