"Clustering data"

mskh · March 2019

Hi,
I calculate standard deviation and average of my data set. I want to cluster my data set into 3 clusters which cluster_0 consists of data between 0 to average, cluster_1 between average and average+standard deviation and cluster_3 consists of data between 2*standard deviation to maximum value. which clustering technique i should use?
Thanks

Telcontar120 · March 2019

If you have discrete cluster identities in mind, then this really isn't an application of clustering. Clustering techniques are generally non-deterministic and unsupervised ML algorithms.
But you can easily code your "clusters" manually in RapidMiner using some if/then logic within Generate Attributes, or Discretize by User Specification.

"Clustering data"

Answers

Welcome!

Welcome!

Quick Links

Categories