Partial retrieve of example set

kypexin
kypexin New Altair Community Member
edited November 2024 in Community Q&A

Hi rapidminers, 

 

Small question, by any chance is there a possibility of 'partial retrieve' of an example set in RM? For example, if I have example set of 1,000,000 rows and I want to perform some kind of 'filter examples' without first retrieving the whole example set from a repository?

 

Thanks.  

Tagged:

Best Answer

  • sgenzer
    sgenzer
    Altair Employee
    Answer ✓

    Hello @kypexin - so that's an interesting question.  Not from a locally-stored repository from what I know.  In general when I have a dataset that large and wish to pull pieces of it for analysis in RapidMiner, I will put it in SQL database and then use a query to pull what I need.

     

    Scott

     

Answers

  • sgenzer
    sgenzer
    Altair Employee
    Answer ✓

    Hello @kypexin - so that's an interesting question.  Not from a locally-stored repository from what I know.  In general when I have a dataset that large and wish to pull pieces of it for analysis in RapidMiner, I will put it in SQL database and then use a query to pull what I need.

     

    Scott

     

  • kypexin
    kypexin New Altair Community Member

    Thanks @sgenzer, basically that was my question whether RM can incorporate some 'best prtactices' from SQL in data handling like I have described. Obviously SQL is suitable for that :)

  • sgenzer
    sgenzer
    Altair Employee

    yes I would say so.  Any dataset that has trouble being loaded locally should probably just be queried from a database.


    Scott