Run any Skill in Manus with one click

docx

Use this skill whenever the user wants to create, edit, or extract content from Word documents (.docx files). Includes creating new documents, editing existing ones, extracting text, working with tables, images, tracked changes, headers/footers, and format conversion.

Run Skill in Manus

Overview

Install command

npx skills add https://github.com/icub3d/dotfiles --skill docx

Copy and paste this command into Claude Code to install the skill

Source

icub3d/dotfiles

Stars0

Forks0

UpdatedJune 3, 2026 at 22:01

SKILL.md

readonly

name	docx
description	Use this skill whenever the user wants to create, edit, or extract content from Word documents (.docx files). Includes creating new documents, editing existing ones, extracting text, working with tables, images, tracked changes, headers/footers, and format conversion.

DOCX Processing

Strategy by Task

Task	Approach
Create new document	JavaScript `docx` npm package
Edit existing document	Unpack XML → edit → repack
Extract text	`pandoc` or unpack
Convert to other formats	`pandoc`

Text Extraction

# Best option — pandoc converts to plain text or markdown
pandoc input.docx -t plain -o output.txt
pandoc input.docx -t markdown -o output.md

# Python alternative
uv run --with python-docx - <<'EOF'
from docx import Document
doc = Document("input.docx")
for para in doc.paragraphs:
    print(para.text)
EOF

Creating New Documents (docx npm)

Preferred for new documents — richer API than python-docx.

npm install docx

const { Document, Packer, Paragraph, TextRun, Table, TableRow, TableCell, WidthType } = require("docx");
const fs = require("fs");

const doc = new Document({
    sections: [{
        properties: {
            page: {
                size: { width: 12240, height: 15840 },  // US Letter in DXA (1440 DXA = 1 inch)
            },
        },
        children: [
            new Paragraph({
                children: [new TextRun({ text: "Hello World", bold: true })],
            }),
        ],
    }],
});

Packer.toBuffer(doc).then(buffer => fs.writeFileSync("output.docx", buffer));

Critical Rules for docx-js

Page size: Default is A4; use { width: 12240, height: 15840 } for US Letter
Tables: Set both columnWidths array AND individual cell width with WidthType.DXA — never use percentage widths (breaks Google Docs compatibility)
Bullet lists: Use LevelFormat.BULLET with numbering config; never insert Unicode bullet characters •
Page breaks: Must be nested inside Paragraph elements
Images: Require explicit type parameter: png, jpg, gif, bmp, or svg

Editing Existing Documents (XML)

.docx files are ZIP archives — unpack, edit XML, repack:

# Unpack
mkdir unpacked && cp input.docx unpacked/input.zip
cd unpacked && unzip input.zip -d contents

# Edit word/document.xml (main body), word/styles.xml, etc.

# Repack
cd contents && zip -r ../output.docx .

XML Editing Standards

Use smart quotes as XML entities: ’ (apostrophe), “/” (open/close quotes)
Tracked changes: <w:ins> for insertions, <w:del> for deletions, preserving <w:rPr> formatting blocks
Comments: marker elements are siblings of text runs, never nested inside them

# Python for XML manipulation
uv run --with lxml - <<'EOF'
from lxml import etree
tree = etree.parse("unpacked/contents/word/document.xml")
# ... modify tree ...
tree.write("unpacked/contents/word/document.xml", xml_declaration=True, encoding="UTF-8")
EOF

Python (python-docx) for Simple Tasks

For straightforward creation/editing when the docx npm approach is overkill:

uv run --with python-docx - <<'EOF'
from docx import Document
from docx.shared import Inches, Pt

doc = Document()
doc.add_heading("Title", 0)
doc.add_paragraph("Body text.")

table = doc.add_table(rows=2, cols=3)
table.cell(0, 0).text = "Header"

doc.save("output.docx")
EOF

Format Conversion (pandoc)

# docx → PDF (requires LaTeX or LibreOffice)
pandoc input.docx -o output.pdf

# docx → HTML
pandoc input.docx -o output.html

# Markdown → docx
pandoc input.md -o output.docx

# With a reference template
pandoc input.md --reference-doc=template.docx -o output.docx

Arch package: sudo pacman -S pandoc

More from this repository

same repository

git-workflow

icub3d/dotfiles

Dotfiles-specific git workflow manager. Use when committing, branching, managing symlink history, rebasing, or handling the multi-machine/multi-platform nature of this repo.

2026-06-040

kubernetes-operator

icub3d/dotfiles

Kubernetes cluster operator for the Marshian Galaxy home lab. Use when managing workloads, namespaces, node operations, Cilium networking, or kubectl/helm workflows on the k8s0–k8s4 cluster.

2026-06-040

rust-smith

icub3d/dotfiles

Specialized in Rust toolchains, cargo config optimizations, crate profiling, build script setups, and memory/performance optimizations (including the cached crate memoization strategy). Use when developing or optimizing Rust projects.

2026-06-040

ssh-config-manager

icub3d/dotfiles

SSH config manager for the Marshian Galaxy multi-host setup. Use when adding/editing Host blocks, configuring ProxyJump chains, managing identities, or debugging SSH connectivity to cluster nodes, VMs, and git.marsh.gg.

2026-06-040

systemd-manager

icub3d/dotfiles

Manages systemd user and system unit files in the dotfiles repo — services, timers, socket activation, device dependencies, and install/reload workflows. Use when creating or editing .service, .timer, or .socket files in helpers/ or dotfiles/.config/systemd/.

2026-06-040

wireguard-configurator

icub3d/dotfiles

WireGuard VPN configuration manager for the Marshian Galaxy. Use when adding/removing peers, rotating keys, editing interface config on the wireguard VM, or debugging VPN connectivity.

2026-06-040

Source

icub3d

icub3d/dotfiles

View GitHub Repository View Creator Repositories

Install command

Download

Run Skill in Manus

name	docx
description	Use this skill whenever the user wants to create, edit, or extract content from Word documents (.docx files). Includes creating new documents, editing existing ones, extracting text, working with tables, images, tracked changes, headers/footers, and format conversion.

DOCX Processing

Strategy by Task

Task	Approach
Create new document	JavaScript `docx` npm package
Edit existing document	Unpack XML → edit → repack
Extract text	`pandoc` or unpack
Convert to other formats	`pandoc`

Text Extraction

# Best option — pandoc converts to plain text or markdown
pandoc input.docx -t plain -o output.txt
pandoc input.docx -t markdown -o output.md

# Python alternative
uv run --with python-docx - <<'EOF'
from docx import Document
doc = Document("input.docx")
for para in doc.paragraphs:
    print(para.text)
EOF

Creating New Documents (docx npm)

Preferred for new documents — richer API than python-docx.

npm install docx

const { Document, Packer, Paragraph, TextRun, Table, TableRow, TableCell, WidthType } = require("docx");
const fs = require("fs");

const doc = new Document({
    sections: [{
        properties: {
            page: {
                size: { width: 12240, height: 15840 },  // US Letter in DXA (1440 DXA = 1 inch)
            },
        },
        children: [
            new Paragraph({
                children: [new TextRun({ text: "Hello World", bold: true })],
            }),
        ],
    }],
});

Packer.toBuffer(doc).then(buffer => fs.writeFileSync("output.docx", buffer));

Critical Rules for docx-js

Page size: Default is A4; use { width: 12240, height: 15840 } for US Letter
Tables: Set both columnWidths array AND individual cell width with WidthType.DXA — never use percentage widths (breaks Google Docs compatibility)
Bullet lists: Use LevelFormat.BULLET with numbering config; never insert Unicode bullet characters •
Page breaks: Must be nested inside Paragraph elements
Images: Require explicit type parameter: png, jpg, gif, bmp, or svg

Editing Existing Documents (XML)

.docx files are ZIP archives — unpack, edit XML, repack:

# Unpack
mkdir unpacked && cp input.docx unpacked/input.zip
cd unpacked && unzip input.zip -d contents

# Edit word/document.xml (main body), word/styles.xml, etc.

# Repack
cd contents && zip -r ../output.docx .

XML Editing Standards

Use smart quotes as XML entities: ’ (apostrophe), “/” (open/close quotes)
Tracked changes: <w:ins> for insertions, <w:del> for deletions, preserving <w:rPr> formatting blocks
Comments: marker elements are siblings of text runs, never nested inside them

# Python for XML manipulation
uv run --with lxml - <<'EOF'
from lxml import etree
tree = etree.parse("unpacked/contents/word/document.xml")
# ... modify tree ...
tree.write("unpacked/contents/word/document.xml", xml_declaration=True, encoding="UTF-8")
EOF

Python (python-docx) for Simple Tasks

For straightforward creation/editing when the docx npm approach is overkill:

uv run --with python-docx - <<'EOF'
from docx import Document
from docx.shared import Inches, Pt

doc = Document()
doc.add_heading("Title", 0)
doc.add_paragraph("Body text.")

table = doc.add_table(rows=2, cols=3)
table.cell(0, 0).text = "Header"

doc.save("output.docx")
EOF

Format Conversion (pandoc)

# docx → PDF (requires LaTeX or LibreOffice)
pandoc input.docx -o output.pdf

# docx → HTML
pandoc input.docx -o output.html

# Markdown → docx
pandoc input.md -o output.docx

# With a reference template
pandoc input.md --reference-doc=template.docx -o output.docx

Arch package: sudo pacman -S pandoc