Altair RISE
A program to recognize and reward our most engaged community members
Nominate Yourself Now!
Home
Discussions
Community Q&A
What ate the definition of profit and gain?
koichi
In the result of Auto Modeling, there are Profits and Gain. What are the definition of those?
Find more posts tagged with
AI Studio
Accepted answers
BalazsBaranyRM
Hi Koichi,
you're right, this part of AutoModel isn't documented very well.
The cost matrix is applied like this: In the validation part, we know the actual outcome and the predicted value. These are compared and the appropriate cell of the matrix is taken into account. For example, 1 for True Not Worthy and Predicted Not Worthy. These are summarized. If you have negative values in the matrix, you could even get a negative outcome.
"Not having a model" is the situation before applying data mining. E. g. a company just executing all orders, some of them might be fraudulent. This is the baseline. (You wouldn't randomly guess in this situation and cancel one half of all orders.)
Regards,
Balázs
All comments
BalazsBaranyRM
Hi!
You can enter a cost matrix when executing Auto Model.
Quite often, the (financial or other) consequences of correct and wrong classifications are different. E. g. you miss a fraud case - you lose 1000 $; you falsely flag a good case as fraud - somebody has to look at the data (costing you e. g. 2 $).
The false positives and negatives and also the correct predictions are weighted with this cost matrix.
The gain is compared to the baseline of not having a model.
Best regards,
Balázs
koichi
Thank you for your answer.
OK. Here I can enter Cost Matrix, right? I'm just started to
learn RapidMiner. It's little bit confusing because in Machine Learning usually cost imply cost of "cost function”. And there is no information about this cost Matrix in the left panel. And also I could not find any information about this in RapidMiner documentation and tutorial videos.
>The false positives and negatives and also the correct predictions are weighted with this cost matrix.
Does this mean Confusion matrix multiplied by cost Matrix in each cells?
>The gain is compared to the baseline of not having a model.
The baseline is "Profits for Best Option"?
How RapidMiner calculate "Profits for Best Option"? This may be the baseline of not having a model? For example, Random guessing?
Best Regards,
Koichi
BalazsBaranyRM
Hi Koichi,
you're right, this part of AutoModel isn't documented very well.
The cost matrix is applied like this: In the validation part, we know the actual outcome and the predicted value. These are compared and the appropriate cell of the matrix is taken into account. For example, 1 for True Not Worthy and Predicted Not Worthy. These are summarized. If you have negative values in the matrix, you could even get a negative outcome.
"Not having a model" is the situation before applying data mining. E. g. a company just executing all orders, some of them might be fraudulent. This is the baseline. (You wouldn't randomly guess in this situation and cancel one half of all orders.)
Regards,
Balázs
koichi
Thanks a lot. Balázs
Now, I understood.
Those illustration should be included in the documentation.
Regards,
Koichi
Quick Links
All Categories
Recent Discussions
Activity
Unanswered
日本語 (Japanese)
한국어(Korean)
Groups