"Excel Text Processing-Frequency"
waynestate13
New Altair Community Member
Hello All,
I am new to RapidMiner and after watching some youtube tutorials and going through the tutorial I have some questions regarding text processing.
I have an excel file with about 300 comments (all comments in one line). I would like to process the data to ultimately have Rapid Miner come up with association rules. However, I am having trouble to process the data.
So far I am using: Read Document (uploading data in txt file) then from there process document where the vector is set up based on term occurrences. Inside the Process Document there is a tokenizer that is tokenizing based on an expressin (&). In the text file, after each comment I have included &.
However, I am not able to even get the data to read.
Thanks in advance for any feedback.
I am new to RapidMiner and after watching some youtube tutorials and going through the tutorial I have some questions regarding text processing.
I have an excel file with about 300 comments (all comments in one line). I would like to process the data to ultimately have Rapid Miner come up with association rules. However, I am having trouble to process the data.
So far I am using: Read Document (uploading data in txt file) then from there process document where the vector is set up based on term occurrences. Inside the Process Document there is a tokenizer that is tokenizing based on an expressin (&). In the text file, after each comment I have included &.
However, I am not able to even get the data to read.
Thanks in advance for any feedback.
Tagged:
0
Answers
-
Hi Wayne,
I think the problem is the '&', but without looking at your setup it's difficult to say; in regex use ampersands need to be escaped in order to be seen 'literally' (& -> \&), and not in their regex role. Just a thought.
Hope that helps!0