I am trying to count the occurencies of words in documents from a given list of words (dictionary). With Filter Tokens (by content) I can only use one single word, otherwise I only get either all words or none. How can I get a vector matrix with the occurences/frequences only of the words from the dictionary?
You might be able to modify this for your needs
http://rapidminernotes.blogspot.co.uk/2013/04/finding-needles-in-text-haystacks.html
Regards
Andrew