Background information Sample datasets in RapidMiner Studio

dannyV
dannyV New Altair Community Member
edited November 5 in Community Q&A
Hi All,

Where can I find the background information on the available datasets in the Sample folder in RapidMiner Studio?
I'd like to know what the data is about before using the data.

Thank you.
Regards,
Danny

Best Answers

  • sgenzer
    sgenzer
    Altair Employee
    Answer ✓
    hi @dannyV now that is a good question! I am always amazed on how few people ask about these kind of things.
    Some of the sample data sets in RapidMiner come from the UCI (University of California Urvine) Machine Learning Repository. Iris is a good example: https://archive.ics.uci.edu/ml/datasets/Iris. Others, like Titanic, have been used in the field of data science so long that honestly I have no idea where the original source is (just tried googling for 5 min and kept being sent to Kaggle).
    As @IngoRM was the one who likely inserted these back in the day, I'm tagging him for some insight here.
    Scott

Answers

  • sgenzer
    sgenzer
    Altair Employee
    Answer ✓
    hi @dannyV now that is a good question! I am always amazed on how few people ask about these kind of things.
    Some of the sample data sets in RapidMiner come from the UCI (University of California Urvine) Machine Learning Repository. Iris is a good example: https://archive.ics.uci.edu/ml/datasets/Iris. Others, like Titanic, have been used in the field of data science so long that honestly I have no idea where the original source is (just tried googling for 5 min and kept being sent to Kaggle).
    As @IngoRM was the one who likely inserted these back in the day, I'm tagging him for some insight here.
    Scott

  • dannyV
    dannyV New Altair Community Member
    Alright, thank you for the response.
    I was looking for the SONAR dataset and I might have found it on UCI data sets..

    Thank you!

    Regards,
    Danny