Skip to content

Stop Word

Stop Word (stop-word)

Remove or mask common stop words from textual content.

Transform json

Minimal example

actions:
- stop-word: {}
JSON
{
"actions": [
{
"stop-word": {}
}
]
}

Contents

Fields

FieldTypeRequiredDescription
description GeneralstringDescribe this step.
condition Generallua-expression (string)Only run this action if the condition is met.
Examples: 2 * count()
input-field Inputfield (string)Field containing the input text.
Examples: data_field
tokens-field Inputfield (string)Field containing an existing array of tokens to filter.
Examples: data_field
output-field Outputfield (string)Field to write the filtered output to (defaults to input when unset).
Examples: data_field
language BehaviourstringNamed language profile for the built-in stop word lists.
custom Behaviourstring[]Custom stop words to merge with the language profile.
case-sensitive Behaviourboolean (bool)Treat comparisons as case sensitive.
preserve-phrases Behaviourboolean (bool)Preserve multi-word phrases when present in the input.
metrics-field Outputfield (string)Optional field to capture statistics about removals.
Examples: data_field

General

Show fields
FieldTypeRequiredDescription
descriptionstringDescribe this step.
conditionlua-expression (string)Only run this action if the condition is met.
Examples: 2 * count()

Input

Show fields
FieldTypeRequiredDescription
input-fieldfield (string)Field containing the input text.
Examples: data_field
tokens-fieldfield (string)Field containing an existing array of tokens to filter.
Examples: data_field

Output

Show fields
FieldTypeRequiredDescription
output-fieldfield (string)Field to write the filtered output to (defaults to input when unset).
Examples: data_field
metrics-fieldfield (string)Optional field to capture statistics about removals.
Examples: data_field

Behaviour

Show fields
FieldTypeRequiredDescription
languagestringNamed language profile for the built-in stop word lists.
customstring[]Custom stop words to merge with the language profile.
case-sensitiveboolean (bool)Treat comparisons as case sensitive.
preserve-phrasesboolean (bool)Preserve multi-word phrases when present in the input.