Hello,
I'm trying to create a process which consist on comparing 2 pdf that are subtly different.
I process my documents (tokenize, filter stopwords, generate n grams...) from two differents files and merge it into one common example set with the operator "Append" and use the operator "Remove duplicates" to see differences in the pdf. Please find attached my process, I have 2 questions :
1) Is it possible to convert my example set result into a wordlist to have a table by row rather than column ?
2) It seems that something went wrong because there are words which are in the 2 files which appears in the output, while it should show words that are in a specific document and whiich is absent in the other one, and so on
Thanks !
Sabine