Clustering by variable
Hi everyone!
I'm working on a group project with Rapidminer and my classmates and I are trying to divide our data into some clusters, but we don't know how to chose the variable to do the clustering since it seems like Rapidminer automatically uses the one of the first column of the dataset we use.
We wanted to define them by frequency but in the screenshots you can see the results we actually got.
Can anyone please help us sort out how to proceed if for instance we want to create these clusters by frequency?
I'm working on a group project with Rapidminer and my classmates and I are trying to divide our data into some clusters, but we don't know how to chose the variable to do the clustering since it seems like Rapidminer automatically uses the one of the first column of the dataset we use.
We wanted to define them by frequency but in the screenshots you can see the results we actually got.
Can anyone please help us sort out how to proceed if for instance we want to create these clusters by frequency?