Operator Crawl: Process failed

domo
domo New Altair Community Member
edited November 5 in Community Q&A
Hi,
I have installed the lastest version 10.1.001 and I have a problem with the operator Crwal.
The process fail and here the error message.
I have checked the version of Java and the version is 1.8.0_361

  • Exception: java.lang.NoClassDefFoundError
  • Message: org/apache/tika/parser/html/HtmlParser
  • Stack trace:
  • edu.uci.ics.crawler4j.parser.TikaHtmlParser.(TikaHtmlParser.java:34)
  • edu.uci.ics.crawler4j.parser.Parser.(Parser.java:42)
  • edu.uci.ics.crawler4j.crawler.CrawlController.(CrawlController.java:85)
  • com.rapidminer.operator.web.crawler.CrawlerOperator.doWork(CrawlerOperator.java:269)
  • com.rapidminer.operator.Operator.execute(Operator.java:1024)
  • com.rapidminer.operator.execution.SimpleUnitExecutor.execute(SimpleUnitExecutor.java:77)
  • com.rapidminer.operator.ExecutionUnit$2.run(ExecutionUnit.java:804)
  • com.rapidminer.operator.ExecutionUnit$2.run(ExecutionUnit.java:799)
  • java.base/java.security.AccessController.doPrivileged(Native Method)
  • com.rapidminer.operator.ExecutionUnit.execute(ExecutionUnit.java:799)
  • com.rapidminer.operator.OperatorChain.doWork(OperatorChain.java:423)
  • com.rapidminer.operator.Operator.execute(Operator.java:1024)
  • com.rapidminer.Process.executeRoot(Process.java:1476)
  • com.rapidminer.Process.lambda$executeRootInPool$5(Process.java:1452)
  • com.rapidminer.studio.concurrency.internal.AbstractConcurrencyContext$AdaptedCallable.exec(AbstractConcurrencyContext.java:362)
  • java.base/java.util.concurrent.ForkJoinTask.doExec(Unknown Source)
  • java.base/java.util.concurrent.ForkJoinPool$WorkQueue.topLevelExec(Unknown Source)
  • java.base/java.util.concurrent.ForkJoinPool.scan(Unknown Source)
  • java.base/java.util.concurrent.ForkJoinPool.runWorker(Unknown Source)
  • java.base/java.util.concurrent.ForkJoinWorkerThread.run(Unknown Source)
Could you help me to solve the problem?
Thanks

Answers

  • ruhaila
    ruhaila New Altair Community Member
    Perhaps share the .rpm file?