🎉Community Raffle - Win $25

An exclusive raffle opportunity for active members like you! Complete your profile, answer questions and get your first accepted badge to enter the raffle.
Join and Win

"Add spelling filter"

User: "yijun"
New Altair Community Member
Updated by Jocelyn
There is filter in text processing to remove dictionary words (stop words). Is there a filter to remove none-dictionary words?

One of the use is to filter words NOT in user-file. If the user-file is "linux.words", English dictionary, then this will remove none-English words. This is useful when we want to remove bad words from poorly scanned collection of OCR text files.

Find more posts tagged with

Sort by:
1 - 1 of 11
    User: "MariusHelf"
    New Altair Community Member
    Hi, this is currently not possible out of the box with the Text Processing operators. You can however transform the document into an example set and then use the standard RapidMiner operators to remove all words which are not contained in a dictionary.

    Best,
    Marius