Text filtering problem! Please help!

karhunen
karhunen New Altair Community Member
edited November 5 in Community Q&A
Hey community,

I'm new in working with rapidminer and I try to filter multiple words from different pdf-files.

First I tried to filter just one word after tokenizing the files with the "Filter Tokens (by content)" Module.
I used the condition "contains" and specified my "string". This actually works fine.
Now i want to filter multiple words but i just dont know how to do this.

Can you please help me? I would really appreciate it!

Background:
I'm trying to classify some documents by using a wordlist with positive and negative words.
Rapidminer should analyse the given pdf-files regarding the amount of positive and negative words.

Any ideas?
Tagged:

Answers