Leave One Out results in AUC of 0.5

Question

My target label is binominal, number of examples is 553. Running supervised classification deep learning with cross validation:* 10-fold  results in AUC = 0.846 and Accuracy = 76%
* Leave 3 Out (180 fold) results in AUC = 0.5 and Accuracy = 42%

How should I interpret this? Which config should I trust?

rfuentealba · Accepted Answer

Hello @erik_van_ingen,

It may sound a bit mind-boggling, but I wouldn't trust any of these. Why? Because you are using supervised Deep Learning and your number of examples is not big enough to justify it.

First of all, I would check if the classes are balanced enough to provide meaningful training and repeat the results.

Now, this raises another question: what kind of sampling are you using for your cross-validation? Try using stratified sampling and check how it performs. If you are using linear sampling, for example, and you have your data ordered by your label or target variable, leaving one out will probably not work well. Do this before beginning to work with SMOTE sampling or your sampling technique du jour.

Hope this helps,

Rodrigo.

varunm1 · Accepted Answer

Hello @erik_van_ingen

Would like to point two things. First, as the data set is small the higher number of folds give lesser test data which means you will have a lot of variance in your results. This is due to the inability of the test set to capture all the underlying distributions in data. I recommend going with a 3 or 5 fold for this dataset.

Second, I am not sure if you applied any feature selection techniques (forward, backward etc.) on your data, you can do that and see attributes that are helpful in predicting. This might improve your performances and reduce computational complexity as well.

Thanks for your understanding.