kimi-docx

thvroyal/kimi-skills · updated May 12, 2026

MDX-style export adds YAML metadata + attribution linking explainx.ai and this canonical listing URL.

$npx skills add https://github.com/thvroyal/kimi-skills --skill kimi-docx
0 commentsdiscussion
summary

To preserve ANY formatting from the source document, MUST unzip and parse XML.

skill.md

Part 1: Goals

⚠️ When to Unzip vs Read

To preserve ANY formatting from the source document, MUST unzip and parse XML.

Read tool returns plain text only — fonts, colors, alignment, borders, styles are lost.

Need Method
Text content only (summarize, analyze, translate) Read tool is fine
Formatting info (copy styles, preserve layout, template filling) Unzip and parse XML
Structure + comments/track changes pandoc input.docx -t markdown

Core Principles

  1. Preserve formatting — When editing existing documents, retain original formatting. Clone and modify, never recreate.

  2. Correct feature implementation — Comments need multi-file sync. Track Changes need revision marks. Use the right structure.

Never use python-docx/docx-js as fallback. These libraries produce lower quality output than direct XML manipulation.

Source Principle

Template provided = Act as form-filler, not designer.

  • Format is the user's decision
  • Task: replace placeholders, not redesign
  • Like filling a PDF form—do not redesign

No template = Act as designer. Design freely based on scenario.

For .doc (legacy format), first convert with libreoffice --headless --convert-to docx.


Part 2: Execution

File Structure

docx/
├── SKILL.md                      ← This file (entry point + reference)
├── references/
│   └── EditingGuide.md           → Complete Python editing tutorial (comments, track changes, 5-file sync)
├── scripts/
│   ├── docx                      → Unified entry (the only script to call)
│   ├── fix_element_order.py      → Auto-fix XML element ordering
│   ├── validate_docx.py          → Business rule validation
│   ├── generate_backgrounds.py       → Morandi style backgrounds
│   ├── generate_inkwash_backgrounds.py → Ink-wash style backgrounds
│   └── generate_chart.py         → matplotlib (only for heatmaps/3D/radar; simple charts must use native)
├── assets/
│   └── templates/
│       ├── KimiDocx.csproj       → Project file template (for creating new docs)
│       ├── Program.cs            → Program entry template
│       ├── Example.cs            → Complete example (cover+TOC+charts+back cover)
│       ├── CJKExample.cs         → CJK content patterns (quote escaping, fonts)
│       └── xml/                  → XML templates for comments infrastructure
└── validator/                    → OpenXML validator (pre-compiled binary, AI does not modify)

Creating new documents: Use C# SDK with ./scripts/docx build → See Example.cs for patterns, CJKExample.cs for CJK content Editing existing documents: Use Python + lxml → See references/EditingGuide.md for complete tutorial

⚠️ Do NOT mix these approaches. C# SDK for creation, Python for editing. Never use python-docx/docx-js.

Environment Setup

First time, execute in the SKILL directory:

cd /app/.kimi/skills/kimi-docx/
./scripts/docx init

Fixed Path Conventions (cannot be changed):

Path Purpose
/app/.kimi/skills/kimi-docx/ SKILL directory, where commands are executed
/tmp/docx-work/ Working directory, edit Program.cs here
/mnt/okcomputer/output/ Output directory, final deliverables
/mnt/okcomputer/upload/ User upload location (input files)

Script Commands (./scripts/docx <cmd>):

Command Purpose
env Show environment status (no changes)
init Setup dependencies + workspace
build [out] Compile, run, validate (default: output/output.docx)
validate FILE Validate existing docx

The script automatically handles:

  • Detects dotnet/python3 (required), pandoc/playwright/matplotlib (optional)
  • Installed → use directly; Not installed → auto-install; Broken → repair
  • Initializes working directory, copies template files

Build Process

Must use ./scripts/docx build, do not execute dotnet build && dotnet run separately (skips validation).

Program.cs Output Path Convention (Critical)

Program.cs must get output path from command line arguments, otherwise build script cannot find the generated file:

// Correct - get output path from command line arguments
string outputFile = args.Length > 0 ? args[0] : "/mnt/okcomputer/output/output.docx";

// Wrong - hardcoded path causes build failure
string outputFile = "my_document.docx";  // Script can't find file!
Step Action Notes
1. Compile dotnet build Provides fix suggestions on failure
2. Generate dotnet run -- <output path> Path passed via command line args
3. Auto-fix fix_element_order.py Fixes XML element ordering issues
4. OpenXML validation validator/ Mandatory
5. Business rules validate_docx.py Mandatory
6. Statistics Character + word count Optional (requires pandoc)

Validation is mandatory: On failure, file is kept but warnings are shown. Check error messages to fix issues.

Standalone Validation

cd /app/.kimi/skills/kimi-docx/
./scripts/docx validate /mnt/okcomputer/output/report.docx

Content Verification (Mandatory)

pandoc is the SOURCE OF TRUTH. OpenXML validator checks structure; pandoc shows actual content.

Before delivery, verify with pandoc:

  • pandoc output.docx -t plain — check text completeness
  • For revisions/comments: add --track-changes=all to verify marker positions

⚠️ Critical: comments.xml exists ≠ comments visible. Count mismatch = doc_tree not saved. See references/EditingGuide.md §5.3.


Part 3: Quality Standards

Delivery Standard

Generic styling and mediocre aesthetics = mediocre delivery.

Deliver studio-quality Word documents with deep thought on content, functionality, and styling. Users often don't explicitly request advanced features (covers, TOC, backgrounds, back covers, footnotes, charts)—deeply understand needs and proactively extend.

Language Consistency

Document language = User conversation language (including filename, body text, headings, headers, TOC hints, chart labels, and all other text).

Headers and Footers - REQUIRED BY DEFAULT

Most documents MUST include headers and footers. The specific style (alignment, format, content) should match the document's overall design.

  • Header: Typically document title, company name, or chapter name
  • Footer: Typically page numbers (format flexible: "X / Y", "Page X", "— X —", etc.)
  • Cover/Back cover: Use TitlePage setting to hide header/footer on first page

Professional Elements (Critical)

Create documents that exceed user expectations, proactively add professional elements, don't wait for users to ask. Delivery standard: Visual quality of a top designer in 2024.

Cover & Visual:

  • Formal documents (proposals, reports, financials, bids, contracts) / creative documents (invitations, greeting cards) must have cover and back cover
  • Covers must have designer-quality background images
  • Body pages can optionally include backgrounds to enhance visual appeal

Structure:

  • Long documents (3+ sections) add TOC, must add refresh hint after TOC

Data Presentation:

  • When comparing data or showing trends, use charts instead of plain text lists
  • Tables use light gray headers or three-line style, avoid Word default blue

Links & References:

  • URLs must be clickable hyperlinks
  • Multiple figures/tables add numbering and cross-references ("see Figure 1", "as shown in Table 2")
  • Academic/legal/data analysis citation scenarios implement correct in-text click-to-jump references with corresponding footnotes/endnotes

TOC Refresh Hint

Word TOC is field code, page numbers may be inaccurate when generated. Must add gray hint text after TOC, informing users to manually refresh:

Table of Contents
─────────────────
Chapter 1 Overview .......................... 1
Chapter 2 Methods ........................... 3
...

(Hint: On first open, right-click the TOC and select "Update Field" to show correct page numbers)

Hint text requirements:

  • Visually subtle — gray color, smaller font size, should not compete with actual TOC entries
  • Language: Matches user conversation language

Only When User Explicitly Requests

Feature Reason
Watermark Changes visual state. SDK limitation: VML watermark classes don't serialize correctly; must write raw XML to header.
Document protection Restricts editing
Mail merge fields Requires data source

Chart Selection Strategy (Critical)

Default to native Word charts, editable, small file size, professional.

Chart Type Method Notes
Pie chart Native Example.csAddPieChart()
Bar chart Native Example.csAddBarChart()
Line chart Native Reference bar chart structure, use c:lineChart
Horizontal bar Native Reference bar chart structure, use barDir="bar"
Heatmap, 3D, radar matplotlib Word native doesn't support
Complex statistics (box plot, etc.) matplotlib Word native doesn't support

Native charts are preferred (editable, smaller files), but matplotlib is acceptable for data analysis scenarios.

Inserting Images/Charts

Any PNG (matplotlib charts, backgrounds, photos) must be inserted using AddInlineImage():

AddInlineImage(body, mainPart, "/path/to/image.png", "Description", docPrId++);

Critical:

  • Chart labels/titles must match document language (e.g., Chinese labels for Chinese docs)
  • Build output shows X images — if 0, images were not inserted

Content Constraints

Word/Page Count Requirements

User Request Execution Standard
Specific word count (e.g., "3000 words") Actual output within ±20%
Specific page count (e.g., "5 pages") Exact match
Range (e.g., "2000-3000 words") Within range
Minimum (e.g., "at least 5000 words") No more than 2x the requirement

Forbidden: Padding word count with excessive bullet point lists. Maintain information density.

Outline Adherence

  • User provides outline: Follow strictly, no additions, deletions, or reordering
  • No outline provided: Use standard structure
    • Academic: Introduction → Literature → Methods → Results → Discussion → Conclusion
    • Business: Executive Summary → Analysis → Recommendations
    • Technical: Overview → Principles → Usage → Examples → FAQ

Scene Completeness

Think one step ahead of the user, complete elements the scenario needs. Examples below are not exhaustive — apply this principle to ALL document types:

  • Exam paper → Name/class/ID fill areas, point allocation per question (consider total), grading section
  • Contract → Signature and seal areas for both parties, date, contract number, attachment list
  • Meeting minutes → Attendees, absentees, action items with owners, next meeting time

Design Philosophy

Color Scheme

Low saturation tones, avoid Word default blue and matplotlib default high saturation.

Flexibly choose color schemes based on document scenario:

Style Palette Suitable Scenarios
Morandi Soft muted tones Artistic, editorial
Earth tones Brown, olive, natural Environmental, organic
Nordic Cool gray, misty blue Minimalist, tech
Japanese Wabi-sabi Gray, raw wood, zen Traditional, contemplative
French elegance Off-white, dusty pink Luxury, feminine
Industrial Charcoal, rust, concrete Manufacturing, engineering
Academic Navy, burgundy, ivory Research, education
Ocean mist Misty blue, sand Marine, wellness
Forest moss Olive, moss green Nature, sustainability
Desert dusk Ochre, sandy gold Warm, regional

Color scheme must be consistent within the same document.

Layout

White space (margins, paragraph spacing), clear hierarchy (H1 > H2 > body), proper padding (text shouldn't touch borders).

Pagination Control

Word uses flow layout, not fixed pages. Control pagination with these properties:

Property XML Effect
Keep with next <w:keepNext/> Heading stays on same page as following paragraph
Keep lines together <w:keepLines/> Paragraph won't break across pages
Page break before <w:pageBreakBefore/> Force new page (for H1)
Widow/orphan control <w:widowControl/> Prevent single lines at top/bottom of page
// Example: H1 always starts on new page, stays with next paragraph
new ParagraphProperties(
    new ParagraphStyleId { Val = "Heading1" },
    new PageBreakBefore(),
    new KeepNext(),
    new KeepLines()
)

Table pagination:

// Allow row to break across pages (avoid large blank areas)
new TableRowProperties(
    new CantSplit { Val = false }  // false = can split
)

// Repeat header row on each page
new TableRowProperties(
    new TableHeader()
)

Part 4: Technical Reference

Choose your path:

Task Stack Reference
Create new document C# + OpenXML SDK 4.1-4.6 + Example.cs
Edit existing document Python + lxml 4.7 + references/EditingGuide.md

4.1 SDK Fundamentals

Schema Compliance (MEMORIZE THESE)

OpenXML has strict element ordering requirements. Wrong order = Word cannot open the file.

Required Styles

// Normal style must exist - all Heading styles use basedOn="Normal"
styles.Append(new Style(
    new StyleName { Val = "Normal" },
    new StyleParagraphProperties(
        new SpacingBetweenLines { After 
how to use kimi-docx

How to use kimi-docx on Cursor

AI-first code editor with Composer

1

Prerequisites

Before installing skills in Cursor, ensure your development environment meets these requirements:

  • Cursor installed and configured on your development machine
  • Node.js version 16.0+ with npm package manager (verify with node --version)
  • Active project directory or workspace where you want to add kimi-docx
2

Execute installation command

Execute the skills CLI command in your project's root directory to begin installation:

$npx skills add https://github.com/thvroyal/kimi-skills --skill kimi-docx

The skills CLI fetches kimi-docx from GitHub repository thvroyal/kimi-skills and configures it for Cursor.

3

Select Cursor when prompted

The CLI will show a list of available agents. Use arrow keys to navigate and space to select Cursor:

◆ Which agents do you want to install to?
│ ── Universal (.agents/skills) ── always included ────
│ • Amp
│ • Antigravity
│ • Cline
│ • Codex
│ ●Cursor(selected)
│ • Cursor
│ • Windsurf
4

Verify installation

Confirm successful installation by checking the skill directory location:

.cursor/skills/kimi-docx

Reload or restart Cursor to activate kimi-docx. Access the skill through slash commands (e.g., /kimi-docx) or your agent's skill management interface.

Security & Verification Notice

We perform automated surface-level scans (Gen AI Scanner, Socket, Snyk) during installation. These checks detect common vulnerabilities but do not guarantee complete security. Always review skill source code and verify the publisher's reputation before production use.

Skills execute code in your development environment. Always verify the publisher's identity, review recent commits, and test in isolated environments before production deployment.

List & Monetize Your Skill

Submit your Claude Code skill and start earning

GET_STARTED →

Use Cases

Task Automation & Efficiency

Automate repetitive workflows and reduce manual effort

Example

Generate reports, summarize documents, draft communications

Save 3-5 hours per week on routine tasks

Knowledge Enhancement

Learn new skills, understand complex topics, get expert guidance

Example

Explain concepts, provide examples, suggest learning resources

Accelerate learning and skill development by 2x

Quality Improvement

Enhance output quality through reviews, suggestions, and refinements

Example

Review drafts, suggest improvements, catch errors

Improve work quality by 30-40% with less effort

Implementation Guide

Prerequisites

  • Claude Desktop or compatible AI client with skill support
  • Clear understanding of task or problem to solve
  • Willingness to iterate and refine outputs

Time Estimate

15-45 minutes depending on use case complexity

Installation Steps

  1. 1.Install skill using provided installation command
  2. 2.Test with simple use case relevant to your work
  3. 3.Evaluate output quality and relevance
  4. 4.Iterate on prompts to improve results
  5. 5.Integrate into regular workflow if valuable

Common Pitfalls

  • Expecting perfect results without iteration
  • Not providing enough context in prompts
  • Using skill for tasks outside its intended scope
  • Accepting outputs without review and validation

Best Practices

✓ Do

  • +Start with clear, specific prompts
  • +Provide relevant context and constraints
  • +Review and refine all outputs before using
  • +Iterate to improve output quality
  • +Document successful prompt patterns

✗ Don't

  • Don't use without understanding skill limitations
  • Don't skip validation of outputs
  • Don't share sensitive information in prompts
  • Don't expect skill to replace human judgment

💡 Pro Tips

  • Be specific about desired format and style
  • Ask for multiple options to choose from
  • Request explanations to understand reasoning
  • Combine AI efficiency with human expertise

When to Use This

✓ Use When

Use when skill capabilities match your task, clear ROI on time saved, and you can validate outputs. Best for repetitive tasks, learning, and quality improvement.

✗ Avoid When

Avoid when task requires deep expertise you can't validate, involves sensitive decisions, or when learning process is more valuable than speed of completion.

Learning Path

  1. 1Familiarize yourself with skill capabilities and limitations
  2. 2Start with low-risk, non-critical tasks
  3. 3Progress to more complex and valuable use cases
  4. 4Build expertise through regular use and experimentation

Discussion

Product Hunt–style comments (not star reviews)
  • No comments yet — start the thread.
general reviews

Ratings

4.651 reviews
  • Hassan Ndlovu· Dec 28, 2024

    Keeps context tight: kimi-docx is the kind of skill you can hand to a new teammate without a long onboarding doc.

  • Benjamin Martin· Dec 24, 2024

    Useful defaults in kimi-docx — fewer surprises than typical one-off scripts, and it plays nicely with `npx skills` flows.

  • William Smith· Dec 24, 2024

    kimi-docx has been reliable in day-to-day use. Documentation quality is above average for community skills.

  • Pratham Ware· Dec 20, 2024

    We added kimi-docx from the explainx registry; install was straightforward and the SKILL.md answered most questions upfront.

  • Layla Brown· Dec 16, 2024

    I recommend kimi-docx for anyone iterating fast on agent tooling; clear intent and a small, reviewable surface area.

  • William Sharma· Dec 8, 2024

    Registry listing for kimi-docx matched our evaluation — installs cleanly and behaves as described in the markdown.

  • Layla Johnson· Nov 27, 2024

    Solid pick for teams standardizing on skills: kimi-docx is focused, and the summary matches what you get after install.

  • Benjamin Sharma· Nov 15, 2024

    I recommend kimi-docx for anyone iterating fast on agent tooling; clear intent and a small, reviewable surface area.

  • Yash Thakker· Nov 11, 2024

    kimi-docx fits our agent workflows well — practical, well scoped, and easy to wire into existing repos.

  • Layla Yang· Nov 7, 2024

    kimi-docx has been reliable in day-to-day use. Documentation quality is above average for community skills.

showing 1-10 of 51

1 / 6