Count wordlist occurrences from data

Question

Hi, I want to use rapidminer for sentiment analysis. Currently I am struggling with what I presume is a very simple question, however I am unable to solve it. I import data from a repository, one of the fields contains text. I also import multiple text files, using 'Process Documents From Files', with different sentiments like: positive and negative. As a result i want to have something like this:TextpostivenegativeThis is a bad text01This is a good text10 The occurrences of positive and negative words from every text entry from the repository. I currently use this but it does not work: Sorry for the newbie question. Thank you in advance for helping. Vincent

vincent · Answer

Sorry for my late reaction.

I think you understand what i would like to achieve however. I do not see how this is possible with the post you referred me to.

Do you have a more specific solution?

Vincent

MartinLiebig · Answer

so you want to count the number of occurences of the words in the dictionaries (Positive.txt,negative.txt) in your file?

If so, have a look here: http://rapid-i.com/rapidforum/index.php/topic,8638.msg29140.html There i do pretty similar stuff.

This seems somehow a thing some people try to do. I might write a tutorial for this.

vincent · Answer

No, maybe i was unclear i would like to have this:

Input files: 
RepositorytextThis is a good textThis is a bad text

The sentiment .txt files (loaded with 'Process documents from files):
Positive.txt
good
great 
awesome

Negative.txt
bad
sad

Output:TextPositiveNegativeThis is a good text10This is a bad text01

Hopefully I have clarified myself a bit more.

Thank you again for the help