Im new in Rapidminer, can someone help me with this Kaggle challenge ? only got 69% right

Xandesaraiva_
Xandesaraiva_ New Altair Community Member
edited November 5 in Community Q&A


<context>
    <input>
      <location>//Local Repository/data/1º DESAFIO</location>
    </input>
    <output/>
    <macros/>
  </context>
  <operator activated="true" class="process" compatibility="9.3.001" expanded="true" name="Process">
    <parameter key="logverbosity" value="init"/>
    <parameter key="random_seed" value="2001"/>
    <parameter key="send_mail" value="never"/>
    <parameter key="notification_email" value=""/>
    <parameter key="process_duration_for_mail" value="30"/>
    <parameter key="encoding" value="SYSTEM"/>
    <process expanded="true">
      <operator activated="true" class="retrieve" compatibility="9.3.001" expanded="true" height="68" name="Retrieve 1º DESAFIO" width="90" x="45" y="34">
        <parameter key="repository_entry" value="//Local Repository/data/1º DESAFIO"/>
      </operator>
      <operator activated="true" class="naive_bayes" compatibility="9.3.001" expanded="true" height="82" name="Naive Bayes" width="90" x="179" y="34">
        <parameter key="laplace_correction" value="true"/>
      </operator>
      <operator activated="true" class="apply_model" compatibility="9.3.001" expanded="true" height="82" name="Apply Model" width="90" x="313" y="34">
        <list key="application_parameters"/>
        <parameter key="create_view" value="false"/>
      </operator>
      <operator activated="true" class="performance" compatibility="9.3.001" expanded="true" height="82" name="Performance" width="90" x="447" y="34">
        <parameter key="use_example_weights" value="true"/>
      </operator>
      <connect from_op="Retrieve 1º DESAFIO" from_port="output" to_op="Naive Bayes" to_port="training set"/>
      <connect from_op="Naive Bayes" from_port="model" to_op="Apply Model" to_port="model"/>
      <connect from_op="Naive Bayes" from_port="exampleSet" to_op="Apply Model" to_port="unlabelled data"/>
      <connect from_op="Apply Model" from_port="labelled data" to_op="Performance" to_port="labelled data"/>
      <connect from_op="Performance" from_port="performance" to_port="result 2"/>
      <connect from_op="Performance" from_port="example set" to_port="result 1"/>
      <portSpacing port="source_input 1" spacing="0"/>
      <portSpacing port="sink_result 1" spacing="0"/>
      <portSpacing port="sink_result 2" spacing="0"/>
      <portSpacing port="sink_result 3" spacing="0"/>
    </process>
  </operator>
</process>



Answers

  • Xandesaraiva_
    Xandesaraiva_ New Altair Community Member
    the goal is to find out (starting by the number 892)  if the person survived or not. databases (train)
  • sgenzer
    sgenzer
    Altair Employee
    hi @Xandesaraiva_ great to have you here! Maybe can you please give us a bit more info (Which Kaggle challenge? There are thousands). And the XML you posted is cut off. Could you please try editing your message or posting a new reply with a clean copy?

    Scott
  • varunm1
    varunm1 New Altair Community Member
    Hello @Xandesaraiva_

    From your .rmp file, I can say that you are training and testing on the same dataset. If you are interested in learning good validation methods or how the Rapidminer process work, you can go to the rapidminer academy that consists of tutorial videos (easy to understand). 

    https://academy.rapidminer.com/

    If you need more information or any help with the processes we are happy to help as well.

    Thanks