RapidMiner's Auto Model Clustering for iris dataset


1. Introduction

Iris dataset is one of the most famous basic dataset for machine learning. This is a test of RapidMiner's Auto Model Clustering. The data includes 50 each for setosa, versicolor, and virginica, for a total of 150. 

https://www.embedded-robotics.com/iris-dataset-classification/

2. Clustering

Select Clusters after load data in RapidMiner's Auto Model

Select Input variables. 

Set 3 for Number of Clusters

The result shows proper classfication. 

Cluster Tree shows the main differences between the clusters. 

 

Centroid Chart shows the values for the cluster centroids in a parallel chart.

Scatter Plot displays a scatter plot in terms of the two most important Attributes.

 

3. Conclusion

RapidMiner Clusters worked well. The clusters almost coincides with the original classification.