"Struggling importing CSV from R Studio to Rapidminer"

matteomartignon
matteomartignon New Altair Community Member
edited November 5 in Community Q&A

Hello guys,

I have a lot of problems importing my CSV from R Studio into Rapidminer.

The firt 50 rows are completly gone..

 

Here the comand in R Studio I used to save my dataframe:

write.table(musei, file = "musei_18_F.csv", quote = TRUE, sep = ",", row.names = FALSE)

 

//SCREENSHOTS

Schermata 2017-07-26 alle 11.26.59.pngMy CSV from R StudioSchermata 2017-07-26 alle 11.26.43.pngThe same CSV imported in Rapidminer

Tagged:

Answers

  • Thomas_Ott
    Thomas_Ott New Altair Community Member

    You have 31 warnings when importing the CSV file, look at the lower right of the 2nd image. Click on that and see what it says.

  • matteomartignon
    matteomartignon New Altair Community Member

    Got it

    Is there a way to fix this problem in R?

    Thanks on advanceSchermata 2017-07-26 alle 12.23.36.pngWarnings

  • Telcontar120
    Telcontar120 New Altair Community Member

    In the last screenshot it looks like your separator might be a ";" instead of a "," which isn't what you are telling RapidMiner, which would definitely cause problems.  If you post a sample data file in addition to the screenshots it would be easier to troubleshoot.

     

  • matteomartignon
    matteomartignon New Altair Community Member

    Sure, this is my CSV.

    It is a data frame I extracted with RFacebook. The function was "search.pages".

    Thanks in advance,

    matteo

  • Telcontar120
    Telcontar120 New Altair Community Member

    Looking at the raw csv it is evident that there are many line break characters in some of the records and that is what is causing the parsing problem with the import of certain records in RapidMiner.  It's theoretically possible to clean this up in RapidMiner but it is probably easier to do it in R or Python using some scripting to remove the extra characters before importing.

     

  • SGolbert
    SGolbert New Altair Community Member

    I've also seen your CSV. As far as I can tell, there is a problem writing the CSV. The problem is entirely in the R part, so there is no much help to give from our part.

     

    I can suggest to save the CSV and then read it again from R with read.table() or similar. When you manage to correctly retrieve the file, it will be ready to be read by RapidMiner.

     

    Or you can put the code that generates the dataset inside the R scripting operator in RapidMiner, then you won't need to save a CSV file.