An exclusive raffle opportunity for active members like you! Complete your profile, answer questions and get your first accepted badge to enter the raffle.
Hi,
I have two data sets. First measured values, second calculated using generate attributes. I would like to compare these data sets statistically. I prefer Leave-one-out Cross Validation [https://en.wikipedia.org/wiki/Cross-validation_(statistics)], but usage of Cross Validation is different in RapidMiner (Divide datasets as training and validation).
Any suggestion?
Data:
3 independent variable( > 200 examples)
1 dependent variable
1 predicted variable
Hi @hapaydin,
Can you share your dataset(s) plesase ?
Regards,
Lionel
csv and jpg of my data
You have directly in RapidMiner access to the staistics of Runoff(M) and Runoff(Pre) in the Statistics
panel of the Results and then compare their statistics :
or you can use the Charts panels to represent your two datasets (Here an example using histograms) :
If it's not enough, can you precise what you want .
Hi again @hapaydin,
To go further et to complete my last reply, you can explore the different operators
of the Statistics Extension (to download and install from the Marketplace).
I hope it will be helpful.
Thank you for your kind interst. I will examine Statistics Extension