Docx to Text (docx-to-text)
Extract content from Microsoft Word documents with optional image emission.
Transform binary json
Minimal example
JSON
Contents
Fields
| Field | Type | Required | Description |
|---|
description General | string | | Short summary shown next to the action in the editor. |
condition General | lua-expression (string) | | Conditional expression that must evaluate truthy for the action to run. Examples: 2 * count() |
mode Output | string | | Output mode: markdown |
split Output | string | | Splitting strategy: none |
parser Parser | string | | Parser backend: ooxml (quick-xml). |
include-images Advanced | boolean (bool) | | Emit embedded images as separate binary events alongside extracted text. |
preserve-styles Advanced | boolean (bool) | | Preserve inline style markers (bold/italic/etc.) in markdown output. |
emit-document-events Advanced | boolean (bool) | | Emit a synthetic document-level event with metadata alongside page/paragraph events. |
General
Show fields
| Field | Type | Required | Description |
|---|
description | string | | Short summary shown next to the action in the editor. |
condition | lua-expression (string) | | Conditional expression that must evaluate truthy for the action to run. Examples: 2 * count() |
Output
Show fields
| Field | Type | Required | Description |
|---|
mode | string | | Output mode: markdown |
split | string | | Splitting strategy: none |
Parser
Show fields
| Field | Type | Required | Description |
|---|
parser | string | | Parser backend: ooxml (quick-xml). |
Advanced
Show fields
| Field | Type | Required | Description |
|---|
include-images | boolean (bool) | | Emit embedded images as separate binary events alongside extracted text. |
preserve-styles | boolean (bool) | | Preserve inline style markers (bold/italic/etc.) in markdown output. |
emit-document-events | boolean (bool) | | Emit a synthetic document-level event with metadata alongside page/paragraph events. |