Altair RISE

A program to recognize and reward our most engaged community members
Nominate Yourself Now!

Replace Token Solution for abbreviates that are part of other words

User: "TRSisme05"
New Altair Community Member
Updated by Jocelyn
I am working on a text analysis. There are abbreviations in my original text, such as cust or cust. for customer. I can put the replace token operator before the tokenize operator and enter multiple replacements such as replace cust space with customer and cust. with customer, but I am curious if there is a way to do it after the tokenization because it has "grouped" the cust abbreviations together. I did try placing the replace operator after tokenization but it replaced all occurrences of cust with customer, including the full word customer. Any thoughts/ideas?  thank you for your help.

 

Find more posts tagged with

Sort by:
1 - 1 of 11
    User: "kayman"
    New Altair Community Member
    Accepted Answer
    @lionelderkrikor probably forgot the dot. There are 2 ways to deal with this, either with + or *

    (cust).+$ means you need to have cust followed by at least one character
    (cust).*$ means you need to have cust and optional additional characters. 

    So the last one is probably safer to use