"Training with multiple CSVs"
Hi all!
Very sorry if this is a head slappingly basic question. I have tried to find an answer in the manual but I probably just don't know what I'm looking for!
I am using data from a series of races. I think I need to train my model with multiple races before I can try to predict a winner. But how do I set up my data so that RapidMiner knows that each race needs to be analysed as one event with one winner rather than a series of unrelated records containing winners and losers - should I use one CSV with a label or ID for each row that belongs to the same race, or should I have separate CSVs for each race - and if so how do I use multiple CSVs as input?
Very sorry if this is a head slappingly basic question. I have tried to find an answer in the manual but I probably just don't know what I'm looking for!
I am using data from a series of races. I think I need to train my model with multiple races before I can try to predict a winner. But how do I set up my data so that RapidMiner knows that each race needs to be analysed as one event with one winner rather than a series of unrelated records containing winners and losers - should I use one CSV with a label or ID for each row that belongs to the same race, or should I have separate CSVs for each race - and if so how do I use multiple CSVs as input?