Supplied test set...
yogafire
New Altair Community Member
Hello, I am dimas from Indonesia.
I wonder how to test my model based on supplied test set at RapidMiner. I usually do such activity on WEKA. ::)
My purpose is at least to find out performance of my model when it's applied on testing set. such that, I can choose the best model having best accuracy on training set and test set. ;D
I apreciate all replies from you all. if you're so kind to share the XML process, of course it will be great!
Thank You
I wonder how to test my model based on supplied test set at RapidMiner. I usually do such activity on WEKA. ::)
My purpose is at least to find out performance of my model when it's applied on testing set. such that, I can choose the best model having best accuracy on training set and test set. ;D
I apreciate all replies from you all. if you're so kind to share the XML process, of course it will be great!
Thank You
Tagged:
0
Answers
-
Hi,
here's a RapidMiner 5.0 process which gives you an impression how easy it is to accomplish such tasks. On might say, RapidMiner is designed for exactly this<?xml version="1.0" encoding="UTF-8" standalone="no"?>
Greetings,
<process version="5.0">
<context>
<input>
<location/>
</input>
<output>
<location/>
<location/>
</output>
<macros/>
</context>
<operator activated="true" class="process" expanded="true" name="Process">
<process expanded="true" height="395" width="790">
<operator activated="true" class="generate_data" expanded="true" height="60" name="Generate Train" width="90" x="45" y="30">
<parameter key="target_function" value="sum classification"/>
</operator>
<operator activated="true" class="linear_regression" expanded="true" height="76" name="Learn" width="90" x="179" y="30"/>
<operator activated="true" class="generate_data" expanded="true" height="60" name="Generate Test" width="90" x="45" y="165">
<parameter key="target_function" value="sum classification"/>
</operator>
<operator activated="true" class="apply_model" expanded="true" height="76" name="Apply Model" width="90" x="313" y="165">
<list key="application_parameters"/>
</operator>
<operator activated="true" class="performance_classification" expanded="true" height="76" name="Performance" width="90" x="447" y="165">
<parameter key="accuracy" value="true"/>
<list key="class_weights"/>
</operator>
<connect from_op="Generate Train" from_port="output" to_op="Learn" to_port="training set"/>
<connect from_op="Learn" from_port="model" to_op="Apply Model" to_port="model"/>
<connect from_op="Generate Test" from_port="output" to_op="Apply Model" to_port="unlabelled data"/>
<connect from_op="Apply Model" from_port="labelled data" to_op="Performance" to_port="labelled data"/>
<connect from_op="Performance" from_port="performance" to_port="result 1"/>
<portSpacing port="source_input 1" spacing="0"/>
<portSpacing port="sink_result 1" spacing="126"/>
<portSpacing port="sink_result 2" spacing="18"/>
</process>
</operator>
</process>
Sebastian0 -
OK, Thanks Before.
I'll try it.
I'll keep you informed of the result0