Skip to main content

MapPattern

This Script is part of the Filters And Transformers Pack.#

Supported versions

Supported Cortex XSOAR versions: 6.5.0 and later.

This transformer will take in a value and transform it based on multiple condition expressions (wildcard, regex, etc) defined in a JSON dictionary structure. The key:value pair of the JSON dictionary should be:

"condition expression": "desired outcome"

For example:

{
".*match 1.*": "Dest Val1",
".*match 2.*": "Dest Val2",
".*match 3(.*)": "\\1",
"*match 4*": {
"algorithm": "wildcard",
"output": "Dest Val4"
}
}

The transformer will return the value matched to a pattern following to the priority. When unmatched or the input value is structured (dict or list), it will simply return the input value.

Script Data#


NameDescription
Script Typepython3
Tagstransformer, string

Inputs#


Argument NameDescription
valueThe value to modify.
mappingsA JSON dictionary or list of it that contains key:value pairs that represent the "Condition":"Outcome".
algorithmThe default algorithm for pattern match. Available algorithm: literal, wildcard, regex, regmatch and dt.
caselessSet to true for caseless comparison, false otherwise.
priorityThe option to choose which value matched to return. Available options: first_match (default) and last_match.
contextThe context: Input . (single dot) on `From previous tasks` to enable to extract the context data.
flagsThe comma separated flags for pattern matching in regex. dotall (s), multiline (m), ignorecase (i) and unicode (u) are supported. This will apply to all the algorithms.
compare_fieldsSet to true if you want pattern matching for each field, otherwise false.
wildcardsThe list of the special patterns which match to any values regardless of algorithm.
default_valueThe value to return when all the patterns are not satisfied.

Outputs#


There are no outputs for this script.


Syntax for mappings#

mappings ::= pattern-mapping | field-mapping
# `field-mapping` must be used when you set `compare_fields` to true. `pattern-mapping` is used if it is not set.
pattern-mapping ::= list-pattern-mapping | base-pattern-mapping
list-pattern-mapping ::= List[base-pattern-mapping]
base-pattern-mapping ::= Dict[pattern, repl]
field-mapping ::= Dict[field-name, pattern-mapping]
field-name ::= str
pattern ::= str # The pattern string which depends on the algorithm given to match with the value.
repl ::= output-str | config
output-str ::= str # The data to replace to the value.
# - Backslash substitution on the template string is available in `regex`
# - DT syntax (${}) is available when `context` is enabled.
# - As DT syntax, `${..}` refers the value given in the inputs, and `${.<name>}` also refers the property located at the relateve path to it.
# `${...}` refers the value being evaluated, and `${..<name>}` also refers the property located at the relateve path to it.
output-any ::= output-str | Any # The data to replace to the value.
# `null` is the special value to identify the input value given in this transformer.
algorithm ::= "literal" | "wildcard" | "regex" | "regmatch" | "dt"
comp-fields ::= List[field] | comma-separated-fields
comma-separated-fields ::= str # Comma separated field
config ::= Dict[str, Any]
The structure is:
{
"algorithm": algorithm, # (Optional) The algorithm to pattern matching.
"output": output-any, # (Optional) The data to replace to the value by the pattern.
"exclude": pattern | List[pattern], # (Optional) Patterns to exclude in the pattern matching.
"ignore_syntax": bool # (Optional) Set to true if you want to ignore syntax errors to the pattern.
"next": mappings # (Optional) Subsequent conditions to do the pattern matching with the value taken from the output.
}

Pattern Matching#

When you choose the dt as the algorithm, the value generated by a DT is handled as unmatched when it is considered as false in boolean condition in python, otherwise it is handles as matched. In python, null, boolean False, integer 0, empty string, empty list and empty dict are considered as false.


Examples#


Transform a severity name to the corresponding number.

algorithm: regmatch

caseless: true

priority: first_match

context:

flags:

compare_fields:

wildcards:

mappings:#

{
"Unknown": 0,
"Informational|Info": 0.5,
"Low": 1,
"Medium": 2,
"High": 3,
"Critical": 4
}
InputOutput
High3
Informational1
Info1
AbcAbc

Normalize a human readable phrase to a cannonical name.

algorithm: wildcard

caseless: true

priority: first_match

context:

flags:

compare_fields:

wildcards:

mappings:#

{
"*Low*": "low",
"*Medium*": "medium",
"*High*": "high",
"*": "unknown"
}
InputOutput
1 - Lowlow
Mediummedium
high (3)high
infomationunknown

Remove all the heading "Re:" or "Fw:" from an email subject.

algorithm: regex

caseless: true

priority: first_match

context:

flags:

compare_fields:

wildcards:

mappings:#

{
"( *(Re: *|Fw: *)*)(.*)": "\\3"
}
InputOutput
Re: Re: Fw: Hello!Hello!
Hello!Hello!

Extract the user name field from an text in an Active Directory user account format.

algorithm: regex

caseless: true

priority: first_match

context:

flags:

compare_fields:

wildcards:

mappings:#

{
"([^@]+)@.+": "\\1",
"[^\\\\]+\\\\(.+)": "\\1",
"[a-zA-Z_]([0-9a-zA-Z\\.-_]*)": null,
".*": "<unknown>"
}
InputOutput
username@domainusername
domain\usernameusername
usernameusername
012abc$<unknown>

Extract the user name field from an quoted text in an Active Directory user account format.

algorithm: regex

caseless: true

priority: first_match

context:

flags:

compare_fields:

wildcards:

mappings:#

{
"\"(.*)\"": {
"output": "\\1",
"next": {
"([^@]+)@.+": "\\1",
"[^\\\\]+\\\\(.+)": "\\1",
"[a-zA-Z_]([0-9a-zA-Z\\.-_]*)": "\\0",
".*": "<unknown>"
}
},
"([^@]+)@.+": "\\1",
"[^\\\\]+\\\\(.+)": "\\1",
"[a-zA-Z_]([0-9a-zA-Z\\.-_]*)": null,
".*": "<unknown>"
}
InputOutput
"username@domain"username
username@domainusername
"domain\username"username
domain\usernameusername
"username"username
usernameusername
012abc$<unknown>

Extract first name and last name from an email address in firstname.lastname@domain, but the format is lastname.firstname@domain in some particular domains.

algorithm: regex

caseless: true

priority: first_match

context:

flags:

compare_fields:

wildcards:

mappings:#

[
{
"([^.]+)\\.([^@]+)@.+": {
"exclude": ".*@example2.com",
"output": "\\1 \\2"
}
},
{
"([^.]+)\\.([^@]+)@.+": "\\2 \\1",
"([^@]+)@.+": "\\1"
}
]
InputOutput
john.doe@example1.comjohn doe
doe.john@example2.comjohn doe
username@example1.comusername

Normalize a date/time text to YYYY-MM-DD HH:mm:ss TZ.

algorithm: regex

caseless: true

priority: first_match

context:

flags:

compare_fields:

wildcards:

mappings:#

{
"(\\d{4})-(\\d{2})-(\\d{2})T(\\d{2}):(\\d{2}):(\\d{2})(\\.\\d+)?Z": "\\1-\\2-\\3 \\4:\\5:\\6 GMT",
"[^,]+, (\\d{1,2}) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": {
"output": "\\2",
"next": {
"Jan": {
"output": null,
"next": {
"[^,]+, (\\d) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-01-0\\1 \\4:\\5:\\6 \\7",
"[^,]+, (\\d{2}) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-01-\\1 \\4:\\5:\\6 \\7"
}
},
"Feb": {
"output": null,
"next": {
"[^,]+, (\\d) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-02-0\\1 \\4:\\5:\\6 \\7",
"[^,]+, (\\d{2}) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-02-\\1 \\4:\\5:\\6 \\7"
}
},
"Mar": {
"output": null,
"next": {
"[^,]+, (\\d) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-03-0\\1 \\4:\\5:\\6 \\7",
"[^,]+, (\\d{2}) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-03-\\1 \\4:\\5:\\6 \\7"
}
},
"Apr": {
"output": null,
"next": {
"[^,]+, (\\d) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-04-0\\1 \\4:\\5:\\6 \\7",
"[^,]+, (\\d{2}) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-04-\\1 \\4:\\5:\\6 \\7"
}
},
"May": {
"output": null,
"next": {
"[^,]+, (\\d) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-05-0\\1 \\4:\\5:\\6 \\7",
"[^,]+, (\\d{2}) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-05-\\1 \\4:\\5:\\6 \\7"
}
},
"Jun": {
"output": null,
"next": {
"[^,]+, (\\d) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-06-0\\1 \\4:\\5:\\6 \\7",
"[^,]+, (\\d{2}) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-06-\\1 \\4:\\5:\\6 \\7"
}
},
"Jul": {
"output": null,
"next": {
"[^,]+, (\\d) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-07-0\\1 \\4:\\5:\\6 \\7",
"[^,]+, (\\d{2}) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-07-\\1 \\4:\\5:\\6 \\7"
}
},
"Aug": {
"output": null,
"next": {
"[^,]+, (\\d) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-08-0\\1 \\4:\\5:\\6 \\7",
"[^,]+, (\\d{2}) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-08-\\1 \\4:\\5:\\6 \\7"
}
},
"Sep": {
"output": null,
"next": {
"[^,]+, (\\d) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-09-0\\1 \\4:\\5:\\6 \\7",
"[^,]+, (\\d{2}) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-09-\\1 \\4:\\5:\\6 \\7"
}
},
"Oct": {
"output": null,
"next": {
"[^,]+, (\\d) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-10-0\\1 \\4:\\5:\\6 \\7",
"[^,]+, (\\d{2}) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-10-\\1 \\4:\\5:\\6 \\7"
}
},
"Nov": {
"output": null,
"next": {
"[^,]+, (\\d) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-11-0\\1 \\4:\\5:\\6 \\7",
"[^,]+, (\\d{2}) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-11-\\1 \\4:\\5:\\6 \\7"
}
},
"Dec": {
"output": null,
"next": {
"[^,]+, (\\d) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-12-0\\1 \\4:\\5:\\6 \\7",
"[^,]+, (\\d{2}) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": "\\3-12-\\1 \\4:\\5:\\6 \\7"
}
}
}
}
}
InputOutput
2021-01-02T01:23:45.010Z2021-01-02 01:23:45 GMT
2021-01-02T01:23:45Z2021-01-02 01:23:45 GMT
Tue, 3 Jun 2008 11:05:30 GMT2008-06-03 11:05:30 GMT

Normalize a date/time text to YYYY-MM-DD HH:mm:ss TZ.

algorithm: regex

caseless: true

priority: first_match

context: . [From previous tasks]

flags:

compare_fields:

wildcards:

mappings:#

{
"(\\d{4})-(\\d{2})-(\\d{2})T(\\d{2}):(\\d{2}):(\\d{2})(\\.\\d+)?Z": "\\1-\\2-\\3 \\4:\\5:\\6 GMT",
"[^,]+, (\\d{1,2}) ([^ ]+) (\\d{4}) (\\d{2}):(\\d{2}):(\\d{2}) ([^ ]+)": {
"output": {
"year": "\\3",
"month": "${.={Jan:'01', Feb: '02', Mar:'03', Apr:'04', May:'05', Jun:'06', Jul:'07', Aug:'08', Sep:'09', Oct:'10', Nov:'11', Dec:'12'}['\\2']}",
"day": "${.=('0'+'\\1').slice(-2)}",
"hour": "\\4",
"minute": "\\5",
"second": "\\6",
"tz": "\\7"
},
"next": {
"..month=val > 0": {
"algorithm": "dt",
"output": "${..year}-${..month}-${..day} ${..hour}:${..minute}:${..second} ${..tz}"
}
}
}
}
InputOutput
2021-01-02T01:23:45.010Z2021-01-02 01:23:45 GMT
2021-01-02T01:23:45Z2021-01-02 01:23:45 GMT
Tue, 3 Jun 2008 11:05:30 GMT2008-06-03 11:05:30 GMT

Pattern matching for different nodes

algorithm: wildcard

caseless: true

priority: first_match

context:

flags:

compare_fields: true

wildcards:

mappings:#

{
"IP": {
"127.*": "localhost"
},
"Host": {
"localhost": "localhost",
"*.local": "localhost",
"*": "other"
}
}
InputOutput
{"IP": "127.0.0.1"}"localhost"
{"Host": "localhost"}"localhost"
{"Host": "paloaltonetworks.local"}"localhost"
{"IP": "192.168.1.1"}"other"

Make a text with the value field corresponding to the score field.

algorithm: regex

caseless: true

priority: first_match

context: . [From previous tasks]

flags:

compare_fields: true

wildcards: *

mappings:#

{
"score": {
"1": "low - ${.value}",
"2": "medium - ${.value}",
"3": "high - ${.value}",
"*": "unknown - ${.value}"
}
}
InputOutput
{"score": 1, "value": "192.168.1.1"}"low - 192.168.1.1"
{"score": 4, "value": "192.168.1.1"}"unknown - 192.168.1.1"

Make a text with the value field corresponding to the score field.

algorithm: dt

caseless:

priority: first_match

context: . [From previous tasks]

flags:

compare_fields: true

wildcards: *

mappings:#

{
"score": {
"...=val < 30": "low - ${.value}",
"...=val < 50": "medium - ${.value}",
"...=val >= 50": "high - ${.value}",
"*": "unknown - ${.value}"
}
}
InputOutput
{"score": 10, "value": "192.168.1.1"}"low - 192.168.1.1"
{"score": 40, "value": "192.168.1.1"}"medium - 192.168.1.1"
{"score": 70, "value": "192.168.1.1"}"high - 192.168.1.1"
{"score": "x", "value": "192.168.1.1"}"unknown - 192.168.1.1"

Make a phrase based on the values of score and type.

algorithm: dt

caseless:

priority: first_match

context: . [From previous tasks]

flags:

compare_fields: true

wildcards: *

mappings:#

{
"score": {
"...=val < 30": {
"next": {
"type": {
"IP": {
"algorithm": "literal",
"output": "benign IP"
},
"*": "low"
}
}
},
"...=val < 50": {
"next": {
"type": {
"IP": {
"algorithm": "literal",
"output": "suspicious IP"
},
"*": "medium"
}
}
},
"...=val >= 50": {
"next": {
"type": {
"IP": {
"algorithm": "literal",
"output": "malicious IP"
},
"*": "high"
}
}
},
"*": "unknown - ${.value}"
}
}
InputOutput
{"score": 70, "value": "192.168.1.1", "type": "IP"}"malicious IP"
{"score": 10, "value": "paloaltonetworks.com", "type": "domain"}"low"
{"score": "x", "value": "192.168.1.1"}"unknown - 192.168.1.1"

Check if the date is a leap day.

algorithm: regex

caseless:

priority: first_match

context:

flags:

compare_fields:

wildcards:

mappings:#

{
"(Jan|Mar|May|Jul|Aug|Oct|Dec) (\\d\\d?), \\d{4}": {
"output": "\\2",
"next": {
"...=val <= 31": {
"algorithm": "dt",
"output": false
}
}
},
"(Apr|Jun|Sep|Nov) (\\d\\d?), \\d{4}": {
"output": "\\2",
"next": {
"...=val <= 30": {
"algorithm": "dt",
"output": false
}
}
},
"Feb (\\d\\d?), (\\d{4})": {
"output": {
"day": "\\1",
"year": "\\2"
},
"next": {
"...=val.day <= 28": {
"algorithm": "dt",
"output": false
},
"...=val.day == 29 && (val.year % 4) == 0 && !((val.year % 100) == 0 && (val.year % 400) != 0)": {
"algorithm": "dt",
"output": true
}
}
}
}
InputOutput
Jun 6, 2021false
Feb 29, 2000true
Feb 29, 2004true
Feb 29, 2001Feb 29, 2001
Jun 32, 2021Jun 32, 2021