|
Pattern
|
Description
|
|
{alpha}
|
Set of all alphabets [A-Za-z]
|
|
{alpha_numeric}
|
Set of all alphabets and numbers [A-Za-z0-9]
|
|
{lower}
|
Set of all lower characters [a-z]
|
|
{upper}
|
Set of all upper characters [A-Z]
|
|
{digit}
|
Set of all integers [0-9]
|
|
{number}
|
Set of all integers and decimal numbers
|
|
{special_character}
|
Set of all special characters [e.g. -/,*&^%#@! etc.]
|
|
{white_space}
|
A white space character [' ']
|
|
{any}
|
Set of all characters
|
|
{other}
|
Set of all non-ASCII characters
|
|
{'constant'}
|
Matches the given text enclosed inside single quotes or double quotes
|
|
{word}
|
Set of all alphabets, numbers and underscore i.e. {alpha_numeric} (including underscore)
|
|
{username}
|
Characters prefixed with @
|
|
{hashtag}
|
Characters prefixed with #
|
|
Pattern
|
Description
|
|
{start}
|
Start of the line
|
|
{end}
|
End of the line
|
|
Input data |
DataPrep supports pattern matching. |
|
Pattern |
{start}{alpha}+ |
|
Matched data |
DataPrep |
|
Input data |
+1 340 |
|
Pattern |
{digit}{3}{end} |
|
Matched data |
340 |
|
Input data |
1234-1234-1234-1234 |
|
Pattern |
{start}{digit}{4}{special_character}{digit}{4}{special_character}{digit}{4}{special_character}{digit}{4}{end} |
|
Matched data |
1234-1234-1234-1234 |
Note: You can include the {start} and {end} tokens to match the data only when the pattern is found across the entire cell value.