problem with Arabic diacritics

yusuf_nazari
yusuf_nazari New Altair Community Member
edited November 2024 in Community Q&A

Hi

there is a problem with "Arabic diacritics".

Rapidminer do not recogniz it as i know.

for example no result for a text like this :

أنا أدرُسُ في فَرع اللُّغة العَربيَّة

but if you emit the diacritics reach the result.

also Rapidminer not recognise أن إن as a two diffrent words.

 

please help me solve this problems.

thanks

Tagged:

Answers

  • sgenzer
    sgenzer
    Altair Employee

    hello @yusuf_nazari - I do not speak Arabic so unfortunately I can provide little help. However I do know that very often this is an encoding issue (UTF-8 vs ISO-8859-1, etc...). I would strongly suggest that you "play around" with your system encoding to see if this will improve things.


    Scott