"confusion matrix in rapidminer for clustering"

Question

Hi ... 
In rapidminer, how can I compute the confusion matrix for the "clustering results" (assuming the actual classes are provided with the data, in order to evaluate the performance of a clustering algorithm, say k-medoid ?
Thanks.

kypexin · Answer

Hi @SamiRami

It could be easier to help you if you could share here actual dataset on which you want to produce confusion matrix and evaluate performance metrics.

SamiRami · Answer

I am just star testing rapid miner...
Can you please provide me the processes needed in sequence along with its parameters setting.
Appreciate it ...

kypexin · Answer

Hi @SamiRami

I'd add one concern here, technically you can actually use PERFORMANCE (CLASSIFICATION) operator on an arbitrary dataset, you only need to be sure that there's an attribute of type 'label', which indicates actual class, and another attribute of type 'prediction', which indicates model predicted class. If you already have a dataset representing this, you can use SET ROLE operator to define label and prediction columns respectively.