I need help trapping data with inconsistent placement in text

Bernadette_21682
Bernadette_21682 New Altair Community Member
edited June 2021 in Community Q&A

 

 

I’m not very familiar with this version of Monarch, we’ve had a much older version for many years, so this is a great improvement.

My issue is that the position of the header information in my data changes based on whether there are more or less than 3 columns per page, and also based on the length of the provider name and location, which are centered (name) and right justified (location).

The data is in printed schedules. A schedule can be anywhere from 1 - 7 columns wide. All of the schedules arrive in the same file and can’t be broken out by number of columns. I’m able to pull the detail lines fine, and can get the first 2 fields from the header rows, but then I have problems with the header.  

In the attached example the first schedule has only 2 columns. Everything on the first header line to the left of the words “Daily Schedule” is left justified, to the right it is right justified. In the second header line, the provider number and name are centered with the location number and name right justified.

The second schedule (in the same file) has 3 columns. You can see the header is much wider.

I considered running the process to grab the data twice using 2 different templates, but I can’t figure out how to tell it to scrape based on 2 or 3 columns. I've attached screenshots so I could obfuscate certain information. I could attach the actual file after redacting if that would be helpful.

I appreciate any help or advice you have. Thanks!

Answers

  • Baba_Majekodunmi_703
    Baba_Majekodunmi_703
    Altair Employee
    edited June 2021
    Hi Bernadette! Looks like you need a multi column region to extract this. I will email you so we can work with this via a screenshare.
  • Bernadette_21682
    Bernadette_21682 New Altair Community Member
    edited June 2021

    Hello Baba,

     

    I just replied via email. Thank you!

    Bernadette