Overview
DOCX loader extracts content from Microsoft Word documents (.docx). Supports extraction of text, tables, headers, and footers with flexible formatting options. Loader Class:DOCXLoader
Config Class: DOCXLoaderConfig
Install
Install the DOCX loader optional dependency group:
Examples
Parameters
| Parameter | Type | Description | Default | Source |
|---|---|---|---|---|
encoding | str | None | File encoding (auto-detected if None) | None | Base |
error_handling | "ignore" | "warn" | "raise" | How to handle loading errors | ”warn” | Base |
include_metadata | bool | Whether to include file metadata | True | Base |
custom_metadata | dict | Additional metadata to include | Base | |
max_file_size | int | None | Maximum file size in bytes | None | Base |
skip_empty_content | bool | Skip documents with empty content | True | Base |
include_tables | bool | Include table content | True | Specific |
include_headers | bool | Include header content | True | Specific |
include_footers | bool | Include footer content | True | Specific |
table_format | "text" | "markdown" | "html" | How to format tables | ”text” | Specific |

