Altair RISE
A program to recognize and reward our most engaged community members
Nominate Yourself Now!
Home
Discussions
Community Q&A
Text Mining Extracting Data from text
Raphael2304
Dear all,
I have a small problem concerning Text Mining with rapidminer. I have a bunch of press releases, all structured the same way. Now I want to extract the headline of the press releases (1st line), the date it was published (2nd line) and the coloured parts of the releases same as the whole paragraph where the coloured parts were found. All releases are within one .rtf file and are separated with section breaks. Any idea how to do it the fastest way possible?
Thanks a lot in advance!
Best
Raphael
Find more posts tagged with
AI Studio
Data Sets
Text Mining + NLP
Entity Extraction
Accepted answers
kayman
using a combination of split and some regex looking at newlines should do the trick.
Attached a very rough example that can get you started.
split_sentences.rmp
All comments
kayman
using a combination of split and some regex looking at newlines should do the trick.
Attached a very rough example that can get you started.
split_sentences.rmp
Raphael2304
Hey
@kayman
,
thanks a lot for your answer. Will try your solution right now, thanks a lot!
Quick Links
All Categories
Recent Discussions
Activity
Unanswered
日本語 (Japanese)
한국어(Korean)
Groups