Loop Cluster - exclude examples

aarapidi
aarapidi New Altair Community Member
edited November 2024 in Community Q&A
Hi,
I have used Rapidminer to create many files with clusters. Therefore, my data file has many examples with a set of numeric attributes, a label, and a special attribute - cluster attribute. Next, I am trying to use (Loop Clusters) process to run classification models within validation process which is primarily working for the data files I have. However, some data files have clusters with one or 2 examples or observations; therefore an error is generated whenever the  loop reaches any of the clusters with small number of examples. My question is how can I exclude clusters with low number of examples in the Loop Clusters process.
Thanks

Answers

  • David_A
    David_A New Altair Community Member
    Hi,

    you can extract the size of your cluster subset with the Extract Macro operator and the option "number of examples", next you use a branch operator, set as condition that the number of examples has to be greater two, and perform your model building in the Then-branch. The Else-branch can return an empty example set or some dummy data, whatever fits best.

    Best,
    David
  • aarapidi
    aarapidi New Altair Community Member
    Thank you!

Welcome!

It looks like you're new here. Sign in or register to get started.

Welcome!

It looks like you're new here. Sign in or register to get started.