Greetings!
I'm new to data mining, and I'm currently interested on learning kMeans... and I've got some questions for you guys.
My sample dataset consists of 49 records, each having 60 attributes/values.
I want to learn how the computation and assignment for the means/centroids is done.
I would also like to ask if my operators for this clustering algorithm are correct:
Root
|__AccessSampleSource (I chose this one because my database format is MS Access 2003)
|__MissingValueReplenishment (set to zero)
|__KMeans
For the visualization, I always choose Scatter Multiple, having the x-axis as the cluster, and some of the attributes (usually 15 attributes) as the y-cluster.
Am I doing it right?
I hope someone could enlighten me soon!
Thank you, and more power to RapidMiner!