Windows and UTF-8

eisioriginal
eisioriginal New Altair Community Member
edited November 5 in Altair RapidMiner
Hello,

i currently try to use Rapidminer to crawl some chinese content. I use the crawl web operator and store the crawled pages to my file system. I also use a content filter within my process.

When i set some chinese words within the content filter those characters are ??? when i reload the process within rapid miner. I also have wrong characters in the resulting crawled pages in my folder because the files are stored in ANSI Format.

I already tried the encoding option of rpid miner with no success. How can i run RapidMiner on windows in a way that its storing utf-8 files and process files?

Thank you

Andreas
Tagged: