Importing CSV

Question

If I already have splitted data in test.csv and Train.csv. what do I do? how I import two CSV files? how we apply the model on both files.
Normally we import a CSV/excel file and apply x_validation on it which break the input in two parts that is train and test.

varunm1 · Answer

As @hughesfleming68 mentioned you can do this with cross-validation for training and then connect the trained model to test data. Example with Titanic data set below. If you have a label attribute in test dataset as well, you can connect a performance operator to apply model which gives you test performance. You can run the below XML code by copying it from here and open a new process in RapidMiner --> (View --> Show Panel --> XML) --> Paste this code in XML window and press the green tick mark. It shows you the process and you can run this.

hughesfleming68 · Answer

Jamia, build your model using cross validation using just your train.csv. As a final step you can import your test.csv  and apply your model to your test data. This keeps your test data out of sample.

sgenzer · Answer

hi @Jamia all of these questions are fundamental to the software and can be answered by going through the "Getting Started" course in RapidMiner Academy. I would highly recommend taking some time with the course. It will give you a foundation to do a lot more.

Scott