Operator ExtractDomain
This operator extracts the domain+tld portion of a fqdn in a tuple, and sets a pre-existing field in the tuple to that value.
Summary
- Ports
- This operator has 2 input ports and 1 output port.
- Windowing
- This operator does not accept any windowing configurations.
- Parameters
- This operator supports 3 parameters.
Required: inputFQDNAttr, outputDomainAttr
Optional: blankOnInvalidTLD
- Metrics
- This operator does not report any metrics.
Properties
- Implementation
- C++
- Threading
- Never - Operator never provides a single threaded execution context.
- Ports (0)
-
Ingests tuples containing FQDNs in the inputFQDNAttr field, extracts the domain, and sets it into the outputDomainAttr field before sending the tuple on.
- Properties
-
- Optional: false
- ControlPort: false
- TupleMutationAllowed: true
- WindowingMode: NonWindowed
- WindowPunctuationInputMode: Oblivious
- Ports (1)
-
Control port that takes in tuples containing TLDs for use in later extractions. This control port can be used to dynamically update the list of TLDs used for extraction. Each time a tuple is received containing a TLD it is saved in a temporary TLD list that is applied after a window punctuation is received on this port. This input port expects a tuple containing a single attribute of type rstring which is a TLD name.
- Properties
-
- Optional: false
- ControlPort: true
- TupleMutationAllowed: false
- WindowingMode: NonWindowed
- WindowPunctuationInputMode: Oblivious
- Assignments
- This operator does not allow assignments to output attributes.
- Ports (0)
-
Submits each input tuple after updating the field indicated by the outputDomainAttr parameter with the domain+TLD extracted from the FQDN field (indicated by the inputFQDNAttr parameter).
- Properties
-
- Optional: false
- TupleMutationAllowed: true
- WindowPunctuationOutputMode: Preserving
Required: inputFQDNAttr, outputDomainAttr
Optional: blankOnInvalidTLD
- blankOnInvalidTLD
-
By default, if the FQDN doesn't match any TLD, the outputDomainAttr field is filled in with the entire incoming FQDN. If blankOnInvalidTLD is set, when no valid TLD is found, the outputDomainAttr field is left blank.
- Properties
-
- Type: boolean
- Cardinality: 1
- Optional: true
- ExpressionMode: Constant
- PortScope: 0
- inputFQDNAttr
-
Specifies the input attribute containing the FQDN that the extraction will be performed on. The supported data type for this attribute is rstring.
- Properties
-
- Type: rstring
- Cardinality: 1
- Optional: false
- ExpressionMode: Expression
- outputDomainAttr
-
Specifies the output attribute to write the extracted domain+TLD data to. Existing data will be overwritten. If the FQDN is malformed, or doesn't match a known TLD, this field will be empty. The supported data type for this attribute is rstring.
- Properties
-
- Type: rstring
- Cardinality: 1
- Optional: false
- ExpressionMode: Attribute