[SOLVED] Same workflows differ in run time

fras
fras New Altair Community Member
edited November 5 in Community Q&A
Hi,
I’m playing around with a performance work flow just mentioned in the seminar last week
(thank again for this nice seminar).
I drastically reduced this work flow cutting “Loop Parameters” and “X-Validation” so I left
with “Retrieve” and “Decision Tree”. But running this one it never returns within minutes.
BUT if I create this work flow from scratch it finishes in one second and switches to the result
perspective. I attached these two work flows for further investigations.
Thanks in advance,
Frank

[fras-desktop] Sandbox > cat WF_NotWorks

<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<process version="5.2.006">
 <context>
   <input/>
   <output/>
   <macros/>
 </context>
 <operator activated="true" breakpoints="after" class="process" compatibility="5.2.006" expanded="true" name="Process">
   <parameter key="logverbosity" value="init"/>
   <parameter key="random_seed" value="2001"/>
   <parameter key="send_mail" value="never"/>
   <parameter key="notification_email" value=""/>
   <parameter key="process_duration_for_mail" value="30"/>
   <parameter key="encoding" value="SYSTEM"/>
   <process expanded="true" height="550" width="752">
     <operator activated="true" class="retrieve" compatibility="5.2.006" expanded="true" height="60" name="Retrieve" width="90" x="45" y="75">
       <parameter key="repository_entry" value="//Samples/data/Sonar"/>
     </operator>
     <operator activated="true" class="decision_tree" compatibility="5.2.006" expanded="true" height="76" name="Decision Tree" width="90" x="313" y="75">
       <parameter key="criterion" value="gain_ratio"/>
       <parameter key="minimal_size_for_split" value="4"/>
       <parameter key="minimal_leaf_size" value="2"/>
       <parameter key="minimal_gain" value="0.1"/>
       <parameter key="maximal_depth" value="20"/>
       <parameter key="confidence" value="0.25"/>
       <parameter key="number_of_prepruning_alternatives" value="3"/>
       <parameter key="no_pre_pruning" value="false"/>
       <parameter key="no_pruning" value="false"/>
     </operator>
     <connect from_op="Retrieve" from_port="output" to_op="Decision Tree" to_port="training set"/>
     <connect from_op="Decision Tree" from_port="model" to_port="result 1"/>
     <portSpacing port="source_input 1" spacing="0"/>
     <portSpacing port="sink_result 1" spacing="0"/>
     <portSpacing port="sink_result 2" spacing="0"/>
   </process>
 </operator>
</process>

[fras-desktop] Sandbox > cat WF_works

<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<process version="5.2.006">
 <context>
   <input/>
   <output/>
   <macros/>
 </context>
 <operator activated="true" class="process" compatibility="5.2.006" expanded="true" name="Process">
   <parameter key="logverbosity" value="init"/>
   <parameter key="random_seed" value="2001"/>
   <parameter key="send_mail" value="never"/>
   <parameter key="notification_email" value=""/>
   <parameter key="process_duration_for_mail" value="30"/>
   <parameter key="encoding" value="SYSTEM"/>
   <process expanded="true" height="397" width="749">
     <operator activated="true" class="retrieve" compatibility="5.2.006" expanded="true" height="60" name="Retrieve" width="90" x="28" y="66">
       <parameter key="repository_entry" value="//Samples/data/Sonar"/>
     </operator>
     <operator activated="true" class="decision_tree" compatibility="5.2.006" expanded="true" height="76" name="Decision Tree" width="90" x="246" y="165">
       <parameter key="criterion" value="gain_ratio"/>
       <parameter key="minimal_size_for_split" value="4"/>
       <parameter key="minimal_leaf_size" value="2"/>
       <parameter key="minimal_gain" value="0.1"/>
       <parameter key="maximal_depth" value="20"/>
       <parameter key="confidence" value="0.25"/>
       <parameter key="number_of_prepruning_alternatives" value="3"/>
       <parameter key="no_pre_pruning" value="false"/>
       <parameter key="no_pruning" value="false"/>
     </operator>
     <connect from_op="Retrieve" from_port="output" to_op="Decision Tree" to_port="training set"/>
     <connect from_op="Decision Tree" from_port="model" to_port="result 1"/>
     <connect from_op="Decision Tree" from_port="exampleSet" to_port="result 2"/>
     <portSpacing port="source_input 1" spacing="0"/>
     <portSpacing port="sink_result 1" spacing="0"/>
     <portSpacing port="sink_result 2" spacing="0"/>
     <portSpacing port="sink_result 3" spacing="0"/>
   </process>
 </operator>
</process>
Tagged:

Answers

  • MariusHelf
    MariusHelf New Altair Community Member
    Hi Frank,

    nice to hear the you are actually continuing on experimenting with RapidMiner. This is a problem which also occured during the seminars, and we have an item on our TODO list to fix it.
    You added (probably by accident) a breakpoint to the root operator. You'll easily see it, if you open the Tree view (e.g. via View -> Show View -> Tree). To remove it, click somewhere onto an empty space in your process, then right-click and remove the breakpoint.

    Best, Marius