How to import and label multiple txt datas?
Kebab68
New Altair Community Member
Hello, I have 3 folders containing txt files. Each folder represents an attribute that I would like to use for naive bayes classification.(For example folder 1 represents spam, folder 2 ham, folder 3 Neither ham/spam)
How can I import the txt files,every txt file labeled correctly and apply naive bayes on it? I already installed the text processing extension. I can import every txt file from one folder(with "process documents from files") but they aren't labeled. Thank you
How can I import the txt files,every txt file labeled correctly and apply naive bayes on it? I already installed the text processing extension. I can import every txt file from one folder(with "process documents from files") but they aren't labeled. Thank you
Tagged:
0
Answers
-
Hello @Kebab68
There is a loop files operator that will enable you to read all the text files. I understood that you want to label all these files based on folder names. If that is correct, then you can look at the below thread where @KostasBonikos explained very well on extracting folder names and generating attributes with them
https://community.rapidminer.com/discussion/52499/loading-folder-names-for-text-processing
Hope this helps. @kayman might add more details.
Thanks1 -
Not much too add
In essence all is described in the mentioned thread, use the foldername as reference, generate an attribute based on this foldername and set it to label using the role operator.0