nav[aria-label="Primary Navigation"] { padding: 0; & ul { list-style: none; width: 100%; display: flex; flex-direction: row; justify-content: start; align-items: start; gap: 30px; padding: 0; & li { margin: 0; } & ul li { list-style: none; } } }

Siemens Community Catalyst Program

The Siemens Community Catalyst program was co-created with our community to acknowledge technology leaders who consistently contribute to the Siemens Community. Nominations are accepted on a rolling basis.

Nominate Now

⚠️Please Note

Technical discussions have been migrated to the Siemens Support Center as Knowledge Base (KB) articles; please note that this content is no longer maintained and may be outdated, so for the latest information, log in to the Siemens Support Center, search online, or contact our support team.

Search for Content in Siemens Support Center

How Data to Similarity operator works on Large Dataset

statspro

Dear Community Members,

I wanted to know how Data to Similarity operator works on large dataset. As per my understanding this operator works in a permutation & combination manner (i.e. nC2 ways). If we have only 50 text then it will check the combination with 49 text and it will gives us the similarity results in the result window (First, Second & Similarity) but if we have large datasets (i.e. 100000 text) then how it works. is there any other specific filter I need to use for checking the text similarity for large dataset ?

Can anyone help me on this.

Thanks,
Arun