Why does X-means clustering always give k-min as the ideal number of clusters?
Hello everybody,
I am learning about clustering in Rapidminer. K-means clustering works fine but you must know the number of clusters you want in advance. Therefore I tried X-means but that just always gives the minimumvalue of k as the ideal number of clusters. That can't be right.
As a simple test I entered the following 20 rows:
2,3 4
2 3
1,5 4
2 4
1,5 3,5
2 3,5
12 13
11 12,5
10 14
11 14
12 14
11,5 14,3
10 2
10,2 2,2
9,5 2,4
2 14
2,2 14,2
1,8 13,8
1,9 14,3
11,9 13,9
When you plot those points it is obvious that they form 4 seperate clusters. Then why does X-means not find 4 as the ideal number of clusters? If I put k_min equal to 2 it says 2, if I put k_min equal to 3 it says 3...