"[SOLVED] Question: Cross-Validation"
Hello everyone,
while learning RapidMiner, I came across X-Validation (which is a useful thing!), but how does it exactly work?
Let's assume, we've got a data set of 100 examples and want to build a decision tree and the number of validation is 10.
There are (at least) 2 possibilities:
a) The output model is the decision tree based on the 100 examples, but the performance is always trained with 90 examples and tested with 10 examples (so the tree might always be different than the actual output tree!)
b) The output model is the decision tree based on the 100 examples and the performance is tested with 10 * 10 examples on the output tree.
After reading the description of X-Validation, I think a) is correct, but b) makes more sense, since the decision tree in a) might always be different than the actual output tree.
Which alternative is correct and if it is a) am I right that the tree might always be different?
Cheers Q-Dog