Windows and UTF-8
eisioriginal
New Altair Community Member
Hello,
i currently try to use Rapidminer to crawl some chinese content. I use the crawl web operator and store the crawled pages to my file system. I also use a content filter within my process.
When i set some chinese words within the content filter those characters are ??? when i reload the process within rapid miner. I also have wrong characters in the resulting crawled pages in my folder because the files are stored in ANSI Format.
I already tried the encoding option of rpid miner with no success. How can i run RapidMiner on windows in a way that its storing utf-8 files and process files?
Thank you
Andreas
i currently try to use Rapidminer to crawl some chinese content. I use the crawl web operator and store the crawled pages to my file system. I also use a content filter within my process.
When i set some chinese words within the content filter those characters are ??? when i reload the process within rapid miner. I also have wrong characters in the resulting crawled pages in my folder because the files are stored in ANSI Format.
I already tried the encoding option of rpid miner with no success. How can i run RapidMiner on windows in a way that its storing utf-8 files and process files?
Thank you
Andreas
Tagged:
0