Community & Support
Learn
Marketplace
Discussions
Categories
Discussions
General
Platform
Academic
Partner
Regional
User Groups
Documentation
Events
Altair Exchange
Share or Download Projects
Resources
News & Instructions
Programs
YouTube
Employee Resources
This tab can be seen by employees only. Please do not share these resources externally.
Groups
Join a User Group
Support
Altair RISE
A program to recognize and reward our most engaged community members
Nominate Yourself Now!
Home
Discussions
Altair RapidMiner
Classification by Regression Operator vs. Polynomial By Binomial Classification
mbuko
Hi,
I am analyzing some given text mining processes which use the SVM (
mySVM
) classification. Because there are several possible classes a
multi class
approach is needed. I also want an multi class output in order to get for each class a confidence value, but only one label / prediction.From my point of view only
binomial classification
can be used regarding SVM. In order to enable the multi class feature these two operators can be used to wrap around the SVM operator: Classification by Regression Operator or Polynomial By Binomial Classification.
For each of them I have trained a model and get different kinds of
confidence values
.
For
Classification by Regression Operator
: confidence value element of (-∞, 1]. This seems to be the signed distance to the hyperplane. Is this correct? Why are there no values higher than 1? (1 would mean that it is on the edge of the margin. Might it depend on the kernel function?)
For
Polynomial By Binomial Classification
: confidence value element of [0,1]. Is this any kind of
probability
? Definition?
For my purpose I need confidence values which are
quantitatively comparable
. But from my point of view the signed distances of binomial classification models are not comparable? A probability would be very helpful. I have read that Platt-Scaling and Isotonic Regression are suitable methods to achieve this - unfortunately I have not understand these methods yet (can I apply them after the training only based on the confidence values?).
So my final question is, how do these two operators handle the
training and the training data
, and also what is the meaning/definition of the confidence values? Are there any
references
or official information to the definitions? The RapidMiner documentation does not give any hint regarding these issues.
Thank you!
Best regards
mBuko
Find more posts tagged with
AI Studio
Classification
Regression
Comments
There are no comments yet
Quick Links
All Categories
Recent Discussions
Activity
My Discussions
Unanswered
日本語 (Japanese)
한국어(Korean)
Groups