Replacing special characters when text mining

robin
robin New Altair Community Member
edited November 5 in Community Q&A
Hi

I am attempting to replace special characters inside of a dataset but not having much luck. 

André Sho
émaker
Adri
é Spéllman

It seems as though RapidMiner is unable to understand those characters and is simply marking them as 
�. Non of the replace operators can seem to fix this issue. 

Should I be doing this inside of a generate attributes operator to account for this or is there a better way?


Best Answers

Answers

  • BalazsBaranyRM
    BalazsBaranyRM New Altair Community Member
    Hi!

    Where is your dataset coming from? Operators like Read CSV that import external data have an "encoding" parameter. If you set the encoding correctly for your data import, you'll see all special characters.

    Regards,
    Balázs
  • rfuentealba
    rfuentealba New Altair Community Member
    Another thing: -I is a capital "i".