Use Rapidminer Studio also like an ETL to create new database on a local server?

FMartin
FMartin New Altair Community Member
edited November 5 in Community Q&A
Hello,

I would acquire rapidminer studio license first before investing in RM Server but I have got one question.
Are we able to prepare data with RM Studio and create/write new database on a local server (sqlserver) ?
New database available for RM Studio but also for any other software.
I suppose it does but I want to be sure. 

Thanks

Best Answer

  • BalazsBarany
    BalazsBarany New Altair Community Member
    Answer ✓
    Hi @FMartin,

    RapidMiner is capable of creating and maintaining databases. I've been using it for these purposes for many years.
    I'm describing the operators for database access here for you.

    You can use the operator Execute SQL for doing database-specific things like creating databases, users, schemas, tables etc.

    Write Database appends data to an existing database table, overwrites existing data in the table if you want, and can even create the appropriate tables if they don't exist yet.

    Update Database works on an existing table. You define a combination of fields (one or more) that are unique in the table. If a matching record is already there, Update Database changes it (update). If not, it gets inserted.

    Read Database can be used for reading entire tables/views, or executing complex SQL queries that return a result.

    There's also the In-Database Processing extension that can put complex data processing operations (join, filter, sort, transform) into the database by generating the SQL statements behind the scenes. You just work with RapidMiner operators and don't need to know SQL.

    There's one limitation, though. RapidMiner works by loading all the data into the memory. If you work with very large tables, try to load them and then just filter and process the result instead of doing this in the database, you'll get a bad performance. This is where In-Database Processing helps.

    Regards,
    Balázs

Answers

  • BalazsBarany
    BalazsBarany New Altair Community Member
    Answer ✓
    Hi @FMartin,

    RapidMiner is capable of creating and maintaining databases. I've been using it for these purposes for many years.
    I'm describing the operators for database access here for you.

    You can use the operator Execute SQL for doing database-specific things like creating databases, users, schemas, tables etc.

    Write Database appends data to an existing database table, overwrites existing data in the table if you want, and can even create the appropriate tables if they don't exist yet.

    Update Database works on an existing table. You define a combination of fields (one or more) that are unique in the table. If a matching record is already there, Update Database changes it (update). If not, it gets inserted.

    Read Database can be used for reading entire tables/views, or executing complex SQL queries that return a result.

    There's also the In-Database Processing extension that can put complex data processing operations (join, filter, sort, transform) into the database by generating the SQL statements behind the scenes. You just work with RapidMiner operators and don't need to know SQL.

    There's one limitation, though. RapidMiner works by loading all the data into the memory. If you work with very large tables, try to load them and then just filter and process the result instead of doing this in the database, you'll get a bad performance. This is where In-Database Processing helps.

    Regards,
    Balázs
  • FMartin
    FMartin New Altair Community Member
    Thanks for your help