Background information Sample datasets in RapidMiner Studio
dannyV
New Altair Community Member
Best Answers
-
hi @dannyV now that is a good question! I am always amazed on how few people ask about these kind of things.
Some of the sample data sets in RapidMiner come from the UCI (University of California Urvine) Machine Learning Repository. Iris is a good example: https://archive.ics.uci.edu/ml/datasets/Iris. Others, like Titanic, have been used in the field of data science so long that honestly I have no idea where the original source is (just tried googling for 5 min and kept being sent to Kaggle).
As @IngoRM was the one who likely inserted these back in the day, I'm tagging him for some insight here.
Scott
1 -
Yip, Scott covered this already. Most of those should be UCI data sets. If you are not finding the corresponding data set there, please ask for the specific one. I may remember the source :-)
5
Answers
-
hi @dannyV now that is a good question! I am always amazed on how few people ask about these kind of things.
Some of the sample data sets in RapidMiner come from the UCI (University of California Urvine) Machine Learning Repository. Iris is a good example: https://archive.ics.uci.edu/ml/datasets/Iris. Others, like Titanic, have been used in the field of data science so long that honestly I have no idea where the original source is (just tried googling for 5 min and kept being sent to Kaggle).
As @IngoRM was the one who likely inserted these back in the day, I'm tagging him for some insight here.
Scott
1 -
Yip, Scott covered this already. Most of those should be UCI data sets. If you are not finding the corresponding data set there, please ask for the specific one. I may remember the source :-)
5 -
Alright, thank you for the response.
I was looking for the SONAR dataset and I might have found it on UCI data sets..
Thank you!
Regards,
Danny1