How to store data only in RapidMiner Server Repository

gracekim
gracekim New Altair Community Member
edited November 5 in Community Q&A

Hello I am gracekim.

I want to use security sensitive data that contains personal information for analysis.

I only want to save it to the RapidMiner server.

 Is there a way to do it?

Tagged:

Answers

  • varunm1
    varunm1 New Altair Community Member
    edited August 2019
    Hello @gracekim

    Based on my understanding, the data will be stored locally on your disk, mostly C drive in Windows system even if you store data on a server. 

    For example, C:\Users\***\rapidminer-server\rapidminer-server-home\data

    Are you looking for the process to store data in local server repository? or are you asking about storing in some cloud? Just want to confirm.
  • gracekim
    gracekim New Altair Community Member

    Hello. @varunm1

    First of all, I'm sorry for my lack of explanation.

    RapidMiner Server was installed on the server computer and RapidMiner Studio was installed on the personal computer.

    I am using the process I have analyzed connected to the server repository. using localhost: port

    Is there a separate process for storing data in local server storage?

    I'm curious to think that if several people are using data from a database built on a server computer, it should only be stored in the server repository in order to manage security.

    What can I do to prevent analysts from storing data on their personal computers?

  • kypexin
    kypexin New Altair Community Member
    Hi @gracekim
    What can I do to prevent analysts from storing data on their personal computers?
    Technically, nothing. Once data is used within RapidMiner process, it can be stored either on the remote server or local repository, but you cannot restrict saving the data locally. 
  • gracekim
    gracekim New Altair Community Member

    kypexin 

    Thank you so much for your answer.

    Recently, the company asked these questions due to personal information security issues. I'll try to solve it the other way around.

  • Marco_Boeck
    Marco_Boeck New Altair Community Member
    Hi,

    In these cases, I strongly suggest to keep the data in a database which cannot be accessed by client machines due to network security settings. You can build a process to work on the data , though it will be a bit annoying because you have no meta data to help pre-fill operator paramters (you'd need to know what column names you'll have manually).
    Your processes can then run via RM Server, and work on the data as the JobContainer machines would be set up with network access to the DB, w/o the data ever coming to the local computers. That's pretty much the only way.
    I mean just looking at the data would allow users to copy&paste it somewhere, so you have to strictly keep it out of their hands at all times if that is needed.

    Regards,
    Marco