Read PDF Error BouncyCastleProvider

johan_CG
johan_CG New Altair Community Member
edited November 5 in Community Q&A
Hi everybody,

I've got an unknow error when I run my RapidMiner process on some folder.
To make a long story short, the goal of my process can be describe as following:
 Starting points:
  • Excel file .xls with "Open File"
  • the path of a directory to scan with "Set macro"
Outputs of the process
  • a PDF report
  • several ExampleSet
The process counts words, keywords, calculates ratios ...
It works very well on some folders and on others I've got the following pop-up message:

Process failed
org/bouncycastle/jce/provider/BouncyCastleProvider
Reason: org.bouncycastle.jce.provider.BouncyCastleProvider
The process works on HTML and PDF files but in all folder there are also xls, csv, doc.

Can somebody help me please?
Thanks in advance
Johan

P.S.: I can't put the process because it's too long.

Answers

  • johan_CG
    johan_CG New Altair Community Member
    Hi Everybody,

    I've got some new information about my problem.
    The error occurs when I try to analyze PDF which is "protected".

    Can somebody help me to solve this problem?

    Thanks in advance
    Johan
  • Marco_Boeck
    Marco_Boeck New Altair Community Member
    Hi,

    unfortunately protected PDFs are currently not supported by the Text Extension, so you will have to make sure there are no protected PDFs in a folder you're trying to process.
    Thank you for reporting it however, I created an internal ticket for this issue.

    Regards,
    Marco
  • johan_CG
    johan_CG New Altair Community Member
    Hi Marco,

    Thank you for your reply.
    I found a small freeware to convert PDFs to Text and runs on protected also.
    http://www.colorpilot.com/extract-pdf-text.html
    Moreover this software is open source, maybe it can be usefull for a futur release of Text Processing Extension  ;)

    Regards,
    Johan