Combining extraction methods for PDFs
I'm using Data Prep Studio to parse the data in a PDF. The PDF contains multiple tables as well as some ad-hoc text in various places. The PDF table extractor works perfectly, but I'm unable to grab anything other than tables with this method. Is it possible to combine techniques? A sample file is attached. Any help would be appreciated. I'm new to Monarch and still evaluating the free trial. For example, how would I grab "Ohio Capital Partners" from the top of the page in addition to all the other tables? I'd also like to capture the date from the top (July 31, 2020) and the name of the fund, (Ohio Capital Partners Onshore LP)