Sample (Stratified)
tobyb
New Altair Community Member
It would be nice if the current operator was modified so that the proportion of classes can be specified. I have worked with several datasets where one class is underrepresented. As a result the model classifies everything as the class that is overrepresented. The modified operator I am proposing would be able to capture a number of cases by class. This way you could control how many cases were sampled from each class. This type of preprocesing in my experience leads to meaningful models.
Tagged:
0
Answers
-
I am replying to my own post. I am new to this forum and I posted this request before I looked thru the other posts. I realized after the fact that Axel has proposed something similar in the Balancing Operator post. I apologize for duplicating this request.0
-
Hi,
at least you showed the urgency of this proposal
Greetings,
Sebastian0