Cross validation

New Altair Community Member

May 17, 2019

Updated Nov 5, 2024 by Jocelyn

Hello,
Is there anybody who can solve me this problem?
in the first picture I have this:

Image: https://us.v-cdn.net/6030995/uploads/editor/wo/d4c3yhftfh8c.png

Here I measure the performance on the same data, and the accuracy is 87,44%.

When I have the same procedure but inside cross validation like this:

Image: https://us.v-cdn.net/6030995/uploads/editor/om/b8858y88x6yp.png

(inside cross validation)

Image: https://us.v-cdn.net/6030995/uploads/editor/kx/7nkjihg1dl7w.png

The accuracy I have here is 82.11%.
It is about the same procedure but inside a cross validation operator.
Why there is that difference on two cases?
What I have understand is that because in the second case my model is being trained and then it measures the performance in the testing section so it is more accurate.
So more training doesn't always means greater accuracy?
I hope my question is clear.
Thanks in advance.

Find more posts tagged with

AI Studio

Cross Validation

Sort by:

1 - 2 of 21

MartinLiebig

Altair Employee

Accepted Answer

May 17, 2019

The first picture is measuring, how well you describe your training data. The second one is measuring how good you predict unknown (out-of-sample) data. You almost everytime want to do the second.

View in context

varunm1

New Altair Community Member

Accepted Answer

May 19, 2019

Hello @Papad

As Martin informed, in the first case you are training and testing the model on the same data, which is not useful to validate your model. In the second case, you are cross-validating a model, which means you are training on one data and testing on another data which the model never saw, this is the best method to validate your model.

To understand cross-validation, here is an excellent post from Scott.

https://community.rapidminer.com/discussion/54621/cross-validation-and-its-outputs-in-rm-studio

Thanks

View in context

Cross validation

Find more posts tagged with

Quick Links