A program to recognize and reward our most engaged community members
Marius wrote:If I understand you correctly, your final data set should look like this:ID, T1, T2Apples, 3, 1Oranges, 2, 5Bananas, 2, 3Kiwis, 0, 7That can be done with the Join operator (mode: outer join), followed by a replace missing values. Probably you have to rename the attributes of one of the examplesets beforehand.Please let me know if I understood you correctly and if this works. If not, please define "merge the example sets" and give an example of the desired output.Best regards,Marius
<?xml version="1.0" encoding="UTF-8" standalone="no"?><process version="5.3.006"> <context> <input/> <output/> <macros/> </context> <operator activated="true" class="process" compatibility="5.3.006" expanded="true" name="Process"> <process expanded="true"> <operator activated="true" class="generate_data" compatibility="5.3.006" expanded="true" height="60" name="Generate Data" width="90" x="112" y="30"> <parameter key="number_examples" value="10"/> <parameter key="number_of_attributes" value="1"/> </operator> <operator activated="true" class="generate_id" compatibility="5.3.006" expanded="true" height="76" name="Generate ID" width="90" x="313" y="30"/> <operator activated="true" class="generate_data" compatibility="5.3.006" expanded="true" height="60" name="Generate Data (2)" width="90" x="112" y="120"> <parameter key="number_examples" value="10"/> <parameter key="number_of_attributes" value="1"/> </operator> <operator activated="true" class="generate_id" compatibility="5.3.006" expanded="true" height="76" name="Generate ID (2)" width="90" x="313" y="120"> <parameter key="offset" value="5"/> </operator> <operator activated="true" class="join" compatibility="5.3.006" expanded="true" height="76" name="Join" width="90" x="447" y="75"> <parameter key="remove_double_attributes" value="false"/> <parameter key="join_type" value="outer"/> <list key="key_attributes"/> </operator> <connect from_op="Generate Data" from_port="output" to_op="Generate ID" to_port="example set input"/> <connect from_op="Generate ID" from_port="example set output" to_op="Join" to_port="left"/> <connect from_op="Generate Data (2)" from_port="output" to_op="Generate ID (2)" to_port="example set input"/> <connect from_op="Generate ID (2)" from_port="example set output" to_op="Join" to_port="right"/> <connect from_op="Join" from_port="join" to_port="result 1"/> <portSpacing port="source_input 1" spacing="0"/> <portSpacing port="sink_result 1" spacing="0"/> <portSpacing port="sink_result 2" spacing="0"/> </process> </operator></process>
Marius wrote:Ask less, experiment more - RapidMiner is a more powerful wizard than you might think But seriously, just give things a try - in this case, the join operator does the job quite well, as you can also see in the attached process. Please note that I deactivated the option remove_duplicate_attributes.Best regards,Marius <?xml version="1.0" encoding="UTF-8" standalone="no"?><process version="5.3.006"> <context> <input/> <output/> <macros/> </context> <operator activated="true" class="process" compatibility="5.3.006" expanded="true" name="Process"> <process expanded="true"> <operator activated="true" class="generate_data" compatibility="5.3.006" expanded="true" height="60" name="Generate Data" width="90" x="112" y="30"> <parameter key="number_examples" value="10"/> <parameter key="number_of_attributes" value="1"/> </operator> <operator activated="true" class="generate_id" compatibility="5.3.006" expanded="true" height="76" name="Generate ID" width="90" x="313" y="30"/> <operator activated="true" class="generate_data" compatibility="5.3.006" expanded="true" height="60" name="Generate Data (2)" width="90" x="112" y="120"> <parameter key="number_examples" value="10"/> <parameter key="number_of_attributes" value="1"/> </operator> <operator activated="true" class="generate_id" compatibility="5.3.006" expanded="true" height="76" name="Generate ID (2)" width="90" x="313" y="120"> <parameter key="offset" value="5"/> </operator> <operator activated="true" class="join" compatibility="5.3.006" expanded="true" height="76" name="Join" width="90" x="447" y="75"> <parameter key="remove_double_attributes" value="false"/> <parameter key="join_type" value="outer"/> <list key="key_attributes"/> </operator> <connect from_op="Generate Data" from_port="output" to_op="Generate ID" to_port="example set input"/> <connect from_op="Generate ID" from_port="example set output" to_op="Join" to_port="left"/> <connect from_op="Generate Data (2)" from_port="output" to_op="Generate ID (2)" to_port="example set input"/> <connect from_op="Generate ID (2)" from_port="example set output" to_op="Join" to_port="right"/> <connect from_op="Join" from_port="join" to_port="result 1"/> <portSpacing port="source_input 1" spacing="0"/> <portSpacing port="sink_result 1" spacing="0"/> <portSpacing port="sink_result 2" spacing="0"/> </process> </operator></process>