I need help trapping data with inconsistent placement in text
I’m not very familiar with this version of Monarch, we’ve had a much older version for many years, so this is a great improvement.
My issue is that the position of the header information in my data changes based on whether there are more or less than 3 columns per page, and also based on the length of the provider name and location, which are centered (name) and right justified (location).
The data is in printed schedules. A schedule can be anywhere from 1 - 7 columns wide. All of the schedules arrive in the same file and can’t be broken out by number of columns. I’m able to pull the detail lines fine, and can get the first 2 fields from the header rows, but then I have problems with the header.
In the attached example the first schedule has only 2 columns. Everything on the first header line to the left of the words “Daily Schedule” is left justified, to the right it is right justified. In the second header line, the provider number and name are centered with the location number and name right justified.
The second schedule (in the same file) has 3 columns. You can see the header is much wider.
I considered running the process to grab the data twice using 2 different templates, but I can’t figure out how to tell it to scrape based on 2 or 3 columns. I've attached screenshots so I could obfuscate certain information. I could attach the actual file after redacting if that would be helpful.
I appreciate any help or advice you have. Thanks!
Answers
-
Hi Bernadette! Looks like you need a multi column region to extract this. I will email you so we can work with this via a screenshare.0
-
Hello Baba,
I just replied via email. Thank you!
Bernadette
0