Regular Expressions (Regexes) may be used to manipulate captured data in EzeScan. For example - formatting dates; removing unwanted characters; replacing data with something else.

Regexes have some common anhors, character classes, quantifiers etc. Here are a few:

  • * identifies that there can be none, one or more of the preceding item
    e.g. 0*1 can be used to find 01 or 0001 or 1.
  • + identifies that there must be one or more of the preceding item
    e.g. 0+1 can be used to find 01 or 0001 but not 1.
  • ^ is for the start of the search text, e.g. ^0* would find leading 0's
    This can be used as a replace regex to strip the leading zeros from 000012 to leave 12.
  • . is used to match any character
    e.g ^.*[0-9]$ would find any numeric value or any value that ends with a numeric.
  • $ is for the end of the search text
    e.g. -$ would remove a trailing dash, e.g. 1234- would become 1234.
  • \b is used to identify the start and end of a word
    e.g. \b\d{2}\b would find 12 in 12 Years.
  • | is the OR separator this means we can look for multiple words
    e.g a 2 digit word or a 3 digit word would be \b\d{2}\b|\b\d{3}\b
  • \is an escape character. This is in case you may need to remove a character that is used in regex codes
    e.g. to remove || at the end of a value you can't do ||$ you need to do \|\|$.

These are a few examples of Regexes commonly used within EzeScan:

TIP: The REGEX's in this section may be copied and pasted straight into the EzeScan REGEX editor. Drag your mouse across the values from the first " to the last "
for example "^0*(\d*)-(\d*)( - .*)","$1$2$3" The blue text represent the values to be replaced and the red text represents what the text is being replaced with the blue text.

  • Replace the 3rd and 6th character with a /
    This is handy if an OCR has picked up the / in a date as a 1 e.g. 12112112 will become 12/12/12
    Regex = "^(..).(..).","$1/$2/"
  • Keep the first value where it is delimited with two pipes (|)
    e.g. PO1234||0||1 will become P01234
    Regex = "^([^\|]+).*$","$1"
  • This will add a .00 if there is no decimal value. If there is a decimal value it will leave as is
    e.g. 100 will become 100.00 but 100.45 would remain as 100.45
    Regex = "(?!\s-?\d+\.\d+|^-?\d+\.\d+)(\s-?\d+\b|^-?\d+\b)","$1.00"
  • Clear out the whole value if it ends with a \
    e.g. "ezescan\" will become blank (null) whereas "ezescan" will remain as "ezescan"
    Regex = "^.*\\+$",""
  • Set a value to error if a numeric value has a minus
    e.g. 1 or 1.00 will be OK whereas -1 or -1.00 will change the value to error
    Regex = "-[0-9]+\.?[0-9]*","error"
  • Remove the first two characters out of a value
    e.g. "BA123456" will become "123456"
    Regex="^..",""
    Tip - add more dots to remove more characters
  • Convert a HP RM/TRIM KFI browse value to just output the first name and last name
    e.g. "Citizen, John (Mr) -lu 1660" will become "Jon Citizen"
    Regex = "-(lu [0-9]+)",""," *([(][^()]*[)])","","([^,]+),([^,]+)","$2$1","^ *",""
  • To remove the word VIC and any words after it
    e.g. "1 Smith St VIC" will become "1 Smith St"
    Regex = "^(.*)\sVIC\s.*$","$1"n
    Note - Change the value VIC to NSW or QLD etc for other states

If you are new to using Regexes we suggest you take a look at these resources on the internet.

1. http://www.cheatography.com/davechild/cheat-sheets/regular-expressions/

Regular Expressions Cheat Sheet by Dave Child

A quick reference guide for regular expressions (regex), including symbols, ranges, grouping, assertions and some sample patterns to get you started.

2. http://www.regexbuddy.com/

Learn, Create, Understand, Test, Use and Save Regular Expressions with RegexBuddy

You could buy a copy of RegexBuddy from the programs Authors.

Install and license it onto your PC where EzeScan  is installed.

Then start EzeScan, edit a KFI field and use one of our Regex Edit buttons to startup our 'Edit Regex' form.

For example the window below appears when working with a KFI Discovery field.

Then press the 'Editor' button (if it's grayed out tick the 'Use' button below it to enable it).

Ezescan will launch RegexBuddy if you have it installed on your PC.

Any regex entered into RegexBuddy will be automatically transferred to the corresponding regex field on the EzeScan Edit Regex form when you close RegexBuddy.

 

Direct link to FAQ