How to add filename to the wordlist output?
sridhar
New Altair Community Member
Hi,
I am processing text files. I want to add text file name for the word list output.
I would like to see the output as follows:
TextFile_Name| Word | occurances
---------------------------------------------------
R1.doc | java | 2
R1.doc | oracle | 3
R1.doc | database | 1
R2.doc | sql | 1
Can you please suggest on how to achieve the same in Rapid Miner?
Thanks a lot for your help!
Regards
Sridhar
I am processing text files. I want to add text file name for the word list output.
I would like to see the output as follows:
TextFile_Name| Word | occurances
---------------------------------------------------
R1.doc | java | 2
R1.doc | oracle | 3
R1.doc | database | 1
R2.doc | sql | 1
Can you please suggest on how to achieve the same in Rapid Miner?
Thanks a lot for your help!
Regards
Sridhar
Tagged:
0
Answers
-
Hello
The blog post here http://rapidminernotes.blogspot.co.uk/2013/04/counting-words-in-lots-of-documents.html has an example where the file name is used in a text processing context. You could use this as a starting point.
regards
Andrew0