"how to save the clustered result into two folders"

Question

I have a set of documents stored in a single folder. I run an unsupervised clustering algorithm, like K-means to construct two groups. Here is the workflow I created. Is there an approach that can separate the original folder into two folders based on the clustering result? In other words, I want to put the files belonging to cluster 1 into one folder and put the files belonging to cluster 2 into another folder.

MariusHelf · Answer

Hi,

if you don't have the Move File operator, please update RapidMiner to the latest version (5.2.008). You'll find an explanation of Loop Values in this thread.

Best, Marius

roya67 · Answer

Marius  wrote:
Hi,

first of all, filter the clustered dataset by the "cluster" attribute with Filter Examples. Then you can use "Loop Values" to loop over the "metadata_path" attribute. Loop Values creates an iteration macro which contains the current value, i.e. in this case the path of the document. You can use it as the "file" parameter of Move File. The choice of the second one is up to you and based on the cluster value.

Of course, instead of manually filtering each cluster value in the first step, you could use a second Loop Values to loop the cluster values.

Best,
Marius

Hi, could you please explain it more? I don't have move file. what is loop values? thanks

MariusHelf · Answer

Hi,

first of all, filter the clustered dataset by the "cluster" attribute with Filter Examples. Then you can use "Loop Values" to loop over the "metadata_path" attribute. Loop Values creates an iteration macro which contains the current value, i.e. in this case the path of the document. You can use it as the "file" parameter of Move File. The choice of the second one is up to you and based on the cluster value.

Of course, instead of manually filtering each cluster value in the first step, you could use a second Loop Values to loop the cluster values.

Best,
Marius