[Solved]UNBALANCED DATA - Newbie Question
Hello All,
I am new to this forum and I have read through previous posts but I'm not understanding the basic steps needed to set up a process to balance data.
I have a label with the following split (97% = Y, 3% = N). I have used WEKA's "resample" filter in the past which does what I would like to do in RapidMiner. Essentially you can expand your under-represented value to match your over-represented value. My questions is, which operator(s) should I use and with which settings?
Sorry for the rookie question,
Paul
I am new to this forum and I have read through previous posts but I'm not understanding the basic steps needed to set up a process to balance data.
I have a label with the following split (97% = Y, 3% = N). I have used WEKA's "resample" filter in the past which does what I would like to do in RapidMiner. Essentially you can expand your under-represented value to match your over-represented value. My questions is, which operator(s) should I use and with which settings?
Sorry for the rookie question,
Paul