ScanContent 2.3.0

BUNDLE

org.apache.nifi | nifi-standard-nar

DESCRIPTION

Scans the content of FlowFiles for terms that are found in a user-supplied dictionary. If a term is matched, the UTF-8 encoded version of the term will be added to the FlowFile using the ‘matching.term’ attribute

TAGS

aho-corasick, byte sequence, content, dictionary, find, scan, search

INPUT REQUIREMENT

REQUIRED

Supports Sensitive Dynamic Properties

false

PROPERTIES

Property

Description

Dictionary Encoding

Indicates how the dictionary is encoded. If ‘text’, dictionary terms are new-line delimited and UTF-8 encoded; if ‘binary’, dictionary terms are denoted by a 4-byte integer indicating the term length followed by the term itself

Dictionary File

The filename of the terms dictionary

RELATIONSHIPS

NAME

DESCRIPTION

unmatched

FlowFiles that do not match any term in the dictionary are routed to this relationship

matched

FlowFiles that match at least one term in the dictionary are routed to this relationship

WRITES ATTRIBUTES

NAME

DESCRIPTION

matching.term

The term that caused the Processor to route the FlowFile to the ‘matched’ relationship; if FlowFile is routed to the ‘unmatched’ relationship, this attribute is not added

Language: English