Find more posts tagged with
Sort by:
1 - 3 of
31
Hi again,
well, in case of text files I would go for the "Process Documents from Files" operator instead of the "Loop Files" as I have said before.
1. Install the Text Extension.
2. Create a process using the "Process Documents from Files" operator and configure it.
3. Add a simple tokenization inside of the operator (double click it to go inside).
4. Change vectorization mode from TFIDF to "Term Occurences".
5. Run it.
There are dozens of videos about this and other basic text mining tasks on our web site, on RapidMiner Resources, on YouTube, on...
Cheers,
Ingo
well, in case of text files I would go for the "Process Documents from Files" operator instead of the "Loop Files" as I have said before.
1. Install the Text Extension.
2. Create a process using the "Process Documents from Files" operator and configure it.
3. Add a simple tokenization inside of the operator (double click it to go inside).
4. Change vectorization mode from TFIDF to "Term Occurences".
5. Run it.
There are dozens of videos about this and other basic text mining tasks on our web site, on RapidMiner Resources, on YouTube, on...
Cheers,
Ingo
did you already found the operator "Loop Files"?
If the files are actually text data which should be transformed with the Text Extension, there is also an operator called "Process Documents from Files".
Cheers,
Ingo