Hi,
I've some questions about the values stored in a PerformanceVector
since I couldn't find any info in the documentation.
This one is taken from the sample 11_LeaveOneOut:
PerformanceVector [
-----accuracy: 77.50% +/- 14.58% (mikro: 77.50%)
ConfusionMatrix:
True: bad good
bad: 13 8
good: 1 18
-----precision: 96.00% +/- 8.00% (mikro: 94.74%) (positive class: good)
ConfusionMatrix:
True: bad good
bad: 13 8
good: 1 18
-----recall: 70.00% +/- 21.91% (mikro: 69.23%) (positive class: good)
ConfusionMatrix:
True: bad good
bad: 13 8
good: 1 18
-----AUC: 0.760 +/- 0.200 (mikro: 0.760) (positive class: good)
]
The confusion matrix by itself is clear. Also the computation of
the "class recall" and "class prediction" (not shown above, but
in RapidMiner) is obvious. My questions are:
1) What do the +/- values behind accuracy, precision and recall
mean? Like the 14.58% in
accuracy: 77.50% +/- 14.58%
Is this the confidence interval, i.e. in which interval the real value
with respect to the estimated value can lie? If so, what is the
confidence level used here? Does this have something to do
with the average and standard deviation?
2) How do you calculate the total precision (96%) and recall (70%)?
I didn't get it.

3) What is the meaning of the mikro values for the two parameters
from 2) ?
Cheers,
benjamin