🎉Community Raffle - Win $25

An exclusive raffle opportunity for active members like you! Complete your profile, answer questions and get your first accepted badge to enter the raffle.
Join and Win

"Excel Text Processing-Frequency"

User: "waynestate13"
New Altair Community Member
Updated by Jocelyn
Hello All,

I am new to RapidMiner and after watching some youtube tutorials and going through the tutorial I have some questions regarding text processing.

I have an excel file with about 300 comments (all comments in one line).  I would like to process the data to ultimately have Rapid Miner come up with association rules. However, I am having trouble to process the data.

So far I am using: Read Document (uploading data in txt file) then from there process document where the vector is set up based on term occurrences. Inside the Process Document there is a tokenizer that is tokenizing based on an expressin (&). In the text file, after each comment I have  included &.

However, I am not able to even get the data to read.

Thanks in advance for any feedback.

Find more posts tagged with

Sort by:
1 - 1 of 11
    User: "haddock"
    New Altair Community Member
    Hi Wayne,

    I think the problem is the '&', but without looking at  your setup it's difficult to say; in regex use ampersands need to be escaped in order to be seen 'literally' (& -> \&), and not in their regex role. Just a thought.

    Hope that helps!