[SOLVED] Need to extract tokens/words based on a wordlist

Vineet
Vineet New Altair Community Member
edited November 5 in Community Q&A
Hello,
I am using Process documents from files operator.
Is there any way to extract the tokens which match one of the words in the list of words in a wordlist?

Answers

  • MariusHelf
    MariusHelf New Altair Community Member
    Hi, sorry, that is not easily possible. You can only remove words that are part of a list via Filter Stopwords (Dictionary), or use regular expressions to filter token with Filter Tokens (by Content).

    Best regards,
    Marius
  • Vineet
    Vineet New Altair Community Member
    Thanks a Lot Marius!
  • Vineet
    Vineet New Altair Community Member
    But is there any other way to do it at all??