· ·
IngoRM Posts: 1,750
RM
Founder
February 2013
Hi,
in the "Samples" repository delivered together with RapidMiner you
can find an example for creating the desired plot:
//Samples/processes/07_Clustering/09_KMeansWithPlot
It uses a parameter
iteration for the number of clusters (k) and a Log operator for collecting the
values for DB-Index (DB) and the average within cluster distance (W). The
process log can then be inspected as a table or immediately plotted. I
recommend the plot type "Scatter Multiple" with "k" on the
x-axis and both "DB" and "W" on the y-axis. In the settings
at the bottom you could even activate lines between the points simplifying the
detection of the elbow.
I leave it to
you to determine if 3, 4, or 5 clusters should be used in this case ;-)
Screen shot 1: I tried to build the process suggested by Ingo.
Can anyone out there help me with Ingo's suggestion.
Or provide another way to get to k.
Thank you for your time.
Tony

Screen shot 2, I tried to execute the process:
