How to do OCR with images

Question

Hello all,

I want to do OCR on images in order to get text.

I already installed the (image Miner extension, Text processing extension, Aylien Text Analysis extension, and Feature Selection extension)

after I use the "Open Color Image" or "Open Gray-scale Image" operators, which operator should I use to recognize and extract the text and features from image file?

Any help will be appreciated.

Thanks,

lj · Answer

Hi Thomas,

Wow, that sounds good. I'm excited.Thank You a lot.

I will leave a feedback after having tried it out.

Best whishes.

Lukas

Thomas_Ott · Answer

There is a new extension on the Marketplace that RM just developed called PDF Table Extraction. Maybe that can help?

https://marketplace.rapidminer.com/UpdateServer/faces/product_details.xhtml?productId=rmx_pdf_table_extraction

lj · Answer

Hi,

I somehow didn't notice Your reply. Sorry. And then, as always the time.

I'll take a look on Tesseract.

Thank You.

Beste Whishes

Lukas