Text Processing

Rhmanig
Rhmanig New Altair Community Member
edited November 2024 in Community Q&A
Hi

I am using Process Document to tokenize text (plus transform case, filter stop words and generate n-grams). I wonder why RapidMiner does not make a use of free memory and CPU and the process takes such a log time.

The current data size is 1059MB and the process is running for almost 5 days :/ The system has four cores and 29GB RAM. on average it uses %46 of CPU and right now it uses 75% of memory (the memory usage is going up slowly).

Please explain if you know why.

Thanks

Tagged:

Answers

  • MartinLiebig
    MartinLiebig
    Altair Employee
    Hi!

    Could you provide me with the process itself? Are there any Loops inside?

    Cheers,

    Martin