A program to recognize and reward our most engaged community members
hi,
in the performance(Classification) operator, correlation is defined as correlation between the label and the prediction, but how is this calculated? with which informaion criteria? I dont understand the results...
I think its standard pearson correlation with by using the integer mapping of the classes.
~Martin
Hi,
Just looked up the Java doc and here is what it says: Computes the empirical corelation coefficient 'r' between label and prediction. For P=prediction, L=label, V=Variance, Cov=Covariance we calculate r by: Cov(L,P) / sqrt(V(L)*V(P))
P=prediction, L=label, V=Variance, Cov=Covariance
Cov(L,P) / sqrt(V(L)*V(P))
So it is the Pearson correlation between the label and the prediction. Judging from a quick check of the source code itself this description is accurate.
Cheers,
Ingo