similarity

Bashar
Bashar New Altair Community Member
edited November 5 in Community Q&A
How are you .. How can I use data to similarity to calculate the similarity of a document with all the lines in a database and choose the most similar
Thank you

Best Answer

  • BalazsBarany
    BalazsBarany New Altair Community Member
    Answer ✓
    Hi!

    You would use these operators:
    - Read Database for getting the data
    - Process Documents from Data (from the Text Processing extension) to create a document vector
    - A second Read Database (or different data source) for the data to compare
    - Process Documents from Data, with the wor (Wordlist) output from the first one connected to the input of this one. This makes sure that the tables have the same structure
    - Cross Distances

    Then you would select the documents with the smallest distance (= largest similarity). 

    Regards,
    Balázs

Answers

  • BalazsBarany
    BalazsBarany New Altair Community Member
    Answer ✓
    Hi!

    You would use these operators:
    - Read Database for getting the data
    - Process Documents from Data (from the Text Processing extension) to create a document vector
    - A second Read Database (or different data source) for the data to compare
    - Process Documents from Data, with the wor (Wordlist) output from the first one connected to the input of this one. This makes sure that the tables have the same structure
    - Cross Distances

    Then you would select the documents with the smallest distance (= largest similarity). 

    Regards,
    Balázs