[SOLVED] Assigning "Topics" to Text Clusters
Hi All,
I recently used the k-Means operator in a process to cluster several thousand message board posts. Now that I have the clusters I'd like somehow "classify" them based upon type of content/key words they contain.
Any recommendations on how best to do this? I read a paper on the ROCK algorithm that somehow assigns topics to documents based on key word frequency, but it doesn’t appear we have this algorithm in Rapidminer.
Also, how do I know what if I am producing a reasonable number of clusters with k-Means based upon my content?
Thanks!!!
I recently used the k-Means operator in a process to cluster several thousand message board posts. Now that I have the clusters I'd like somehow "classify" them based upon type of content/key words they contain.
Any recommendations on how best to do this? I read a paper on the ROCK algorithm that somehow assigns topics to documents based on key word frequency, but it doesn’t appear we have this algorithm in Rapidminer.
Also, how do I know what if I am producing a reasonable number of clusters with k-Means based upon my content?
Thanks!!!