Specifying Prior Probabilities
tobyb
New Altair Community Member
Is there a way to specify prior probabilities in Rapid Miner? For example let's say I have a dataset that has 80% of one class and 20% of another class. A subset is created that has 50% of both classes. I would like to be able to specify that the prior probabilities were 80% and 20%.
Tagged:
0
Answers
-
Hi there,
You could do this by filtering and counting using data macros, but a quick and sneaky fix sometimes has its place, like this...<operator name="Root" class="Process" expanded="yes">
Good weekend to all!
<operator name="ExampleSetGenerator" class="ExampleSetGenerator">
<parameter key="target_function" value="simple non linear classification"/>
</operator>
<operator name="EqualLabelWeighting" class="EqualLabelWeighting">
</operator>
</operator>
0 -
I'm probably missing something obvious, but it seems like this is backwards. The original question was about data with a true (prior) probability of 80/20, but with the minority label oversampled such that the training data was 50/50. Wouldn't EqualLabelWeighting be more like taking an 80/20 sample to a 50/50 prior?haddock wrote:
Hi there,
You could do this by filtering and counting using data macros, but a quick and sneaky fix sometimes has its place, like this...
<process omitted>
Keith
0 -
Hi Keith,
Have you not heard? Backwards is the new forwards! Perhaps I should have been more explicit; we can use the fact that we know the number of classes and the 'equal weight' number to keep track of the original distribution. In the binominal case we simply divide 0.5 by the weight to produce the count, like this...<operator name="Root" class="Process" expanded="yes">
<operator name="ExampleSetGenerator" class="ExampleSetGenerator">
<parameter key="target_function" value="simple non linear classification"/>
</operator>
<operator name="EqualLabelWeighting" class="EqualLabelWeighting">
</operator>
<operator name="AttributeConstruction" class="AttributeConstruction">
<list key="function_descriptions">
<parameter key="Count" value="0.5/weight"/>
</list>
</operator>
</operator>0