Text Mining: Document Clustering of JSON files
Hi everyone,
as I am really new at RapidMiner, I have big difficulties on clustering documents.
The idea is to import about 1350 documents (.txt) that are wiritten in JSON format, to convert them into a table (each row represents a document) and to run a document clustering incl. performance measurement. Btw. the content of the document is web content from diferent websites (in english and german).
Unfortunately I do not manage to import these files, so that RapidMiner recognizes them as JSON.
Is there anyone who could help me with that? I would really appreciate any help!
If needed I could send some documents.
Thx a lot!!