Community & Support
Learn
Marketplace
Discussions
Categories
Discussions
General
Platform
Academic
Partner
Regional
User Groups
Documentation
Events
Altair Exchange
Share or Download Projects
Resources
News & Instructions
Programs
YouTube
Employee Resources
This tab can be seen by employees only. Please do not share these resources externally.
Groups
Join a User Group
Support
Altair RISE
A program to recognize and reward our most engaged community members
Nominate Yourself Now!
Home
Discussions
Community Q&A
"Dictionary based analysis"
mlctu
Hi!
Is there a way to use RapidMiner to perform dictionary-based analysis of document collections?
In particular I'm interested in term frequency and other statistics to be applied to term occurrences in the documents, where the terms of interest are provided by the user, already classified in one or more user-defined category lists (dictionaries).
Thanks for your help!
Giulio
Find more posts tagged with
AI Studio
Text Mining + NLP
Accepted answers
All comments
land
Hi,
yes this is easily possible with the Text Processing Extension. You can simply use the Dictionary based filtering to remove all uninteresting words.
Another way around would be to first count all words and then postprocess this word list using the "WordList to Data" operator.
Greetings,
Sebastian
Quick Links
All Categories
Recent Discussions
Activity
Unanswered
日本語 (Japanese)
한국어(Korean)
Groups