"Clustering data"

mskh
mskh New Altair Community Member
edited November 5 in Community Q&A
Hi,
I calculate standard deviation and average of my data set. I want to cluster my data set into 3 clusters which cluster_0 consists of data between 0 to average, cluster_1 between average and average+standard deviation and cluster_3 consists of data between 2*standard deviation to maximum value. which clustering technique i should use?
Thanks

Answers

  • Telcontar120
    Telcontar120 New Altair Community Member
    If you have discrete cluster identities in mind, then this really isn't an application of clustering. Clustering techniques are generally non-deterministic and unsupervised ML algorithms.
    But you can easily code your "clusters" manually in RapidMiner using some if/then logic within Generate Attributes, or Discretize by User Specification.