"Google CSV export format issue - AdWords, WebmasterTools, Analytics..."

Antal_Sofalvy
Antal_Sofalvy New Altair Community Member
edited November 2024 in Community Q&A

Hello,

Recently I've realized some issues with Google generated CSV files. When I CSV Read them into Rapidminer (v7.1) the result is strange - I attached a file for

The same happens in case of any CSV files exported from Google web related tools, like AdWords, Keyword Planner, WebmasterTools, etc.

The CSVs are usually tab separated and this is win1250 (I think) - but charset does not an issue

Please help me what I am overlooking?

 

Thanks,

Antal

 

PS Read File turnaround works but VERY time consuming...

 

Tagged:

Welcome!

It looks like you're new here. Sign in or register to get started.

Best Answer

Answers

  • bhupendra_patil
    bhupendra_patil New Altair Community Member
    Answer ✓

    it looks like some sort of encoding issue,

     

    in your import wizard try one of the UTF -encoding like in the screen shot below

    2016-08-09 16_26_51-Settings.png

     

     

  • Antal_Sofalvy
    Antal_Sofalvy New Altair Community Member

    Thanks it looks OK - so encoding setting is the solution.

    The issue is, that NOT every CSV is encoded accordingly - so I have to find out one by one... but it is my challenge.

     

     

    Thanks,

    Antal

     

     

  • Antal_Sofalvy
    Antal_Sofalvy New Altair Community Member

    Thank you!

    So it is encoding - hopefully all Google CSVs are UTF16...

    Regards,

    Antal

     

    BTW if you want to set tab / tabulator for coloumn separator type \t

    I have not found it in the documentation, hopefully it is useful

Welcome!

It looks like you're new here. Sign in or register to get started.

Welcome!

It looks like you're new here. Sign in or register to get started.