[SOLVED]The approach for filtering non-letter tokens
Answers
-
Hi,
first of all, you have to configure the Tokenize operator to use a splitting pattern appropriate to your problem. By default, it splits at "non-letters", you could change it to e.g. split by all space characters.
Then, to filter, you can use the Filter Tokens operator with a customized pattern.
If you have probems with the regular expressions, please post again.
Happy Mining!
~Marius0 -
Marius, Thanks.0