|
Pattern |
Description |
|
{alpha} |
Set of all alphabets [A-Za-z] |
|
{alpha_numeric} |
Set of all alphabets and numbers [A-Za-z0-9] |
|
{lower} |
Set of all lower characters [a-z] |
|
{upper} |
Set of all upper characters [A-Z] |
|
{digit} |
Set of all integers [0-9] |
|
{number} |
Set of all integers and decimal numbers |
|
{special_character} |
Set of all special characters [e.g. -/,*&^%#@! etc.] |
|
{white_space} |
A white space character [' '] |
|
{any} |
Set of all characters |
|
{other} |
Set of all non-ASCII characters |
|
{'constant'} |
Matches the given text enclosed inside single quotes or double quotes |
|
{word} |
Set of all alphabets, numbers and underscore i.e. {alpha_numeric} (including underscore) |
|
{username} |
Characters prefixed with @ |
|
{hashtag} |
Characters prefixed with # |
|
Pattern |
Description |
|
{start} |
Start of the line |
|
{end} |
End of the line |
Input data |
DataPrep supports pattern matching. |
Pattern |
{start}{alpha}+ |
Matched data |
DataPrep |
Input data |
+1 340 |
Pattern |
{digit}{3}{end} |
Matched data |
340 |
Input data |
1234-1234-1234-1234 |
Pattern |
{start}{digit}{4}{special_character}{digit}{4}{special_character}{digit}{4}{special_character}{digit}{4}{end} |
Matched data |
1234-1234-1234-1234 |
Note: You can include the {start} and {end} tokens to match the data only when the pattern is found across the entire cell value.