Docx to Text

Docx to Text (`docx-to-text`)

Extract content from Microsoft Word documents with optional image emission.

Transform binary json

actions:
  - docx-to-text: {}

JSON

{
  "actions": [
    {
      "docx-to-text": {}
    }
  ]
}

Field	Type	Description
`description` General	`string`	Short summary shown next to the action in the editor.
`condition` General	`lua-expression` (`string`)	Conditional expression that must evaluate truthy for the action to run. Examples: `2 * count()`
`mode` Output	`string`	Output mode: markdown
`split` Output	`string`	Splitting strategy: none
`parser` Parser	`string`	Parser backend: ooxml (quick-xml).
`include-images` Advanced	`boolean` (`bool`)	Emit embedded images as separate binary events alongside extracted text.
`preserve-styles` Advanced	`boolean` (`bool`)	Preserve inline style markers (bold/italic/etc.) in markdown output.
`emit-document-events` Advanced	`boolean` (`bool`)	Emit a synthetic document-level event with metadata alongside page/paragraph events.

Show fields

Field	Type	Required	Description
`description`	`string`		Short summary shown next to the action in the editor.
`condition`	`lua-expression` (`string`)		Conditional expression that must evaluate truthy for the action to run. Examples: `2 * count()`

Show fields

Field	Type	Required	Description
`mode`	`string`		Output mode: markdown
`split`	`string`		Splitting strategy: none

Show fields

Field	Type	Required	Description
`parser`	`string`		Parser backend: ooxml (quick-xml).

Show fields

Field	Type	Description
`include-images`	`boolean` (`bool`)	Emit embedded images as separate binary events alongside extracted text.
`preserve-styles`	`boolean` (`bool`)	Preserve inline style markers (bold/italic/etc.) in markdown output.
`emit-document-events`	`boolean` (`bool`)	Emit a synthetic document-level event with metadata alongside page/paragraph events.