Hello.
I'm just getting started to learn Radpidminer and would like some help to my first project.
I've managed to import my data into RM and made a predictive process on some training data and can predict my unlabeled data based on the trained data.
Now, I've trained my model with the raw training data, but I had realized that I need to cleanse the data first. See I only want my model to be trained with the data with the lowest possible value of the column B.
So here is my setup;
Column A - Labeled data
Column B - Data where i only want the lowest value
Coumn C - Data
Example:
First row
Column A: 100
Column B: 20
Column C: 50
Second row
Column A: 100
Column B: 19
Column C: 50
Now, since A+C is matching I would only want my model to be trained with the second row, since the first row has a higher value of column B.
Is this achiveable in Rapidminer?
I do not hope that my explanation is too confusing 