An exclusive raffle opportunity for active members like you! Complete your profile, answer questions and get your first accepted badge to enter the raffle.
ID1 ID2 SIM2 1 0.51 2 0.5
if([FIRST_ID]>[SECOND_ID], concat(str([FIRST_ID]),"_",str([SECOND_ID])), concat(str([SECOND_ID]),"_",str([FIRST_ID])))
<?xml version="1.0" encoding="UTF-8" standalone="no"?><process version="7.0.001"> <context> <input/> <output/> <macros/> </context> <operator activated="true" class="process" compatibility="7.0.001" expanded="true" name="Process"> <process expanded="true"> <operator activated="true" class="retrieve" compatibility="7.0.001" expanded="true" height="68" name="Retrieve Golf" width="90" x="112" y="34"> <parameter key="repository_entry" value="//Samples/data/Golf"/> </operator> <operator activated="true" class="data_to_similarity" compatibility="7.0.001" expanded="true" height="82" name="Data to Similarity" width="90" x="246" y="34"/> <operator activated="true" class="similarity_to_data" compatibility="7.0.001" expanded="true" height="82" name="Similarity to Data" width="90" x="380" y="34"/> <operator activated="true" class="generate_attributes" compatibility="7.0.001" expanded="true" height="82" name="Generate Attributes" width="90" x="514" y="34"> <list key="function_descriptions"> <parameter key="IdToRemoveDuplicates" value="if([FIRST_ID]>[SECOND_ID], 	concat(str([FIRST_ID]),"_",str([SECOND_ID])), 	concat(str([SECOND_ID]),"_",str([FIRST_ID])) )"/> </list> <description align="center" color="transparent" colored="false" width="126">Create an ID to remove the stuff</description> </operator> <operator activated="true" class="remove_duplicates" compatibility="7.0.001" expanded="true" height="82" name="Remove Duplicates" width="90" x="648" y="34"> <parameter key="attribute_filter_type" value="single"/> <parameter key="attribute" value="IdToRemoveDuplicates"/> </operator> <connect from_op="Retrieve Golf" from_port="output" to_op="Data to Similarity" to_port="example set"/> <connect from_op="Data to Similarity" from_port="similarity" to_op="Similarity to Data" to_port="similarity"/> <connect from_op="Data to Similarity" from_port="example set" to_op="Similarity to Data" to_port="exampleSet"/> <connect from_op="Similarity to Data" from_port="exampleSet" to_op="Generate Attributes" to_port="example set input"/> <connect from_op="Generate Attributes" from_port="example set output" to_op="Remove Duplicates" to_port="example set input"/> <connect from_op="Remove Duplicates" from_port="example set output" to_port="result 1"/> <portSpacing port="source_input 1" spacing="0"/> <portSpacing port="sink_result 1" spacing="0"/> <portSpacing port="sink_result 2" spacing="0"/> </process> </operator></process>
And how to get count of similar looking sets( Text field). For the below set I want count like
ABC is good text -----3
XYZ is great -----------2
FIRST SECOND SIMILARITY textfield
1 2 1 ABC is a good text
3 8 1 ABC is a good text
4 9 1 ABC is a good text
12 32 1 XYZ is great
31 77 1 XYZ is great
Can't you use an Aggregate operator for this?
Thanks Thomas. Results achieved.