Text Classification
Nancy
New Altair Community Member
Hi,
I am working with a text document.It contains around 1000 small paragraphs.My objective is to group the words which frequently repeating in the paragraphs or sentences (need not continuous words but those words are in that paragraph) .Which operarator can I use to classify the document on the basis of group of words.
Thanks
Nancy
I am working with a text document.It contains around 1000 small paragraphs.My objective is to group the words which frequently repeating in the paragraphs or sentences (need not continuous words but those words are in that paragraph) .Which operarator can I use to classify the document on the basis of group of words.
Thanks
Nancy
Tagged:
0
Answers
-
Hi,
sorry, but what you are asking for seems not to be consistent. In the first sentence you are explaining, that you are going to group the words. In the question, you want to classify the documents. From this I cannot comprehend what your real objective is and how I could help you, reaching it.
Greetings,
Sebastian0 -
Hi Sebastain,
My objective is to classify the documents.But now I want to group the words.After that I will classify the entire documents on the basis of these words.So can you suggest any way to group the words.
Thanks
Nancy0 -
Hi,
if you want to select words, which will be suitable for later classification, you could use feature selection or simply weighting.
Otherwise, you would have to specify, would objective you have with grouping the words. Every combination is a group, so just building a grouping isn't much sensible.
Greetings,
Sebastian0