My target label is binominal, number of examples is 553. Running supervised classification deep learning with cross validation:
- 10-fold results in AUC = 0.846 and Accuracy = 76%
- Leave 3 Out (180 fold) results in AUC = 0.5 and Accuracy = 42%
How should I interpret this? Which config should I trust?