Calculate number of unique words in text and number of repeating paragraphs
Find more posts tagged with
Sort by:
1 - 5 of
51
Hi ln777,
you are always allowed to ask questions - that's what we are here for . The only question is if we can answer them.
i would create a similarity/synonym dictionary. I would go for worldist to data, take the sentences as an input for a 2nd process documents, tokenize on words and calculate a cross distance on the result. There i would go for a high cosine similarity to define a "synomym". This dictionary can then be used to replace texts in the original document.
~Martin
Sort by:
1 - 1 of
11
Hi,
you can simply use a Process Documents operator with binary occurences and use Generate Aggregation afterwards to get the sum of a row.
~Martin
Hi,
you can simply use a Process Documents operator with binary occurences and use Generate Aggregation afterwards to get the sum of a row.
~Martin