sentiment extraction for non-English
wclaster
New Altair Community Member
Hello. Are there sentiment analysis operators or tools for working with Japanese? How about Chinese? And how about other Asian languages? I saw the Sentiment Extract operator. It seems to have German and French versions for Vader. Thank you!
Tagged:
0
Best Answer
-
1
Answers
-
Hello Wclaster,Here you have two links with resourses for that.
Problem Mandarin Text mining - HanMiner — RapidMiner Community
GitHub - joeyhaohao/rapidminer-HanMiner: A Rapidminer extension for easy Chinese language processing and text miningkind regards
0 -
Thank you! I will leave this question open because I am really looking for Japanese.0
-
Hi @wclaster ,if you have chinese or japanase dictionaries i can add them . Not a big thing. The bigger one would be tokenization in those languages.Best,Martin0
-
@ceaperez by the way, if you have a good Spanish dictionary I am happy to add this as well . I didn't find anything in a quick search for one. ideally i want to cover the big languages with a dictionary each.
0 -
Hello mschmitz, thank you. Yes, I think tokenization would be quite a challenge. MeCab is an open-source text segmentation library for use with text written in the Japanese language but I don't know how this would all fit together.
From Wikipedia
Besides segmenting the text, MeCab also lists the part of speech of the word, and, if applicable and in the dictionary, its pronunciation.
MeCab - Wikipedia
Would this be simple?0 -
1
-
Bit late to the party but we had some decent results using Ginza together with Spacy, using the python extension in some of our rapidminer workflows.0