Altair RISE
A program to recognize and reward our most engaged community members
Nominate Yourself Now!
Home
Discussions
Community Q&A
"Finding Document Similarity Based Mostly on Keywords and Title"
joandcruz
Hi all,
Sorry for the long title, but I could not find an efficient one Smiley
I am new to RM, and I am finding document similarities via RM. My sources are the webpages, and I basically read them and compare them.
So far so good; but here is the problem:
I want to determine keywords and title for the documents, and I also want to assign weights to keywords. When I run the program, title and keywords seem to be '?'. So, is there a way to manually enter the keywords and title for now? For later stages: how can RM automatically get keywords from webpages?
Find more posts tagged with
AI Studio
Text Mining + NLP
Accepted answers
All comments
MartinLiebig
Hi,
if you just want to assign a score, based on the keywords you might want to have a look at this thread
http://rapid-i.com/rapidforum/index.php/topic,8638.0.html
If you want to find the words automatically, you can do standard text mining on them. The trick is, that you can cluster the documents. Afterwards you can use the cluster information as Label and do a feature selection on them. Thus you can get the important words per cluster.
Cheers,
Martin
Quick Links
All Categories
Recent Discussions
Activity
Unanswered
日本語 (Japanese)
한국어(Korean)
Groups