🎉Community Raffle - Win $25

An exclusive raffle opportunity for active members like you! Complete your profile, answer questions and get your first accepted badge to enter the raffle.
Join and Win

[Solved]UNBALANCED DATA - Newbie Question

User: "dynera"
New Altair Community Member
Updated by Jocelyn
Hello All,

I am new to this forum and I have read through previous posts but I'm not understanding the basic steps needed to set up a process to balance data.

I have a label with the following split (97% = Y, 3% = N).  I have used WEKA's "resample" filter in the past which does what I would like to do in RapidMiner.  Essentially you can expand your under-represented value to match your over-represented value.  My questions is, which operator(s) should I use and with which settings?

Sorry for the rookie question,

Paul

Find more posts tagged with