Are my Word headings preserved? They look wrong in the output.

**Only if you used real heading styles in Word.** This is the single most common cause of "ugly" conversions. If you made a heading by **bumping the font size to 18 and clicking Bold**, Word records it as "Normal text in 18pt bold", not as a heading. Mammoth has no way to know that was supposed to be a Heading 1. **Fix it at the source**: in Word, click the line, then click "Heading 1" in the Styles ribbon. Same for Heading 2, 3 and so on. Re-save, re-drop the file. Now the Markdown gets proper `#`, `##`, `###` prefixes. Pro tip: turn on the **Styles pane** (Alt+Ctrl+Shift+S on Windows) so you can see what every paragraph is tagged as.

Do tables come through?

**Yes, basic tables become GitHub-flavored Markdown pipe tables** (`| a | b |` with a separator row). The text in every cell is preserved, plus bold / italic / links inside cells. **What does not survive**: merged cells (Markdown tables do not support them, the cell content lands in the first column), per-cell background colors, custom borders, vertically rotated text, and nested tables (Markdown forbids them). If your Word doc has a complex table layout, the Markdown will be flatter than the original. For mostly-text tables (a versions list, a feature matrix), the result is clean and usable.

Are images included? What does "embed as base64" do?

**You have two options**: - **Strip images** (default): all images are removed. The Markdown is small, readable, and there are no large `data:` URLs. Best when you only care about the **text content**. - **Embed as base64**: every embedded image becomes a `![alt](data:image/png;base64,...)` inline reference. The Markdown is self-contained (no external files needed) but the file gets **big fast**, a single screenshot can add 200 KB or more. What we **do not** offer: extracting images to separate files. If you need that, run mammoth locally with the `convertImage` option pointing to a directory. **Note**: vector graphics (charts, SmartArt) are not extractable as images at all, they appear as warnings in the panel.

What about footnotes, endnotes and comments?

**Footnotes and endnotes are dropped** by default. Mammoth lists them in the warnings as "footnotes lost". The footnote text exists in a separate XML stream inside the docx (`footnotes.xml`) and there is no clean way to express it in Markdown without polluting the flow. If you need them, the typical workaround is to **manually copy footnote text to the end of the Markdown** under a "Notes" heading. **Comments** (Review > New Comment in Word) are **never carried over**, they live in `comments.xml` and are metadata, not document content. **Track changes** are flattened to the **accepted version** of the text, no markers remain.

Why are my "track changes" not visible in the Markdown?

Because **Markdown has no concept of tracked changes.** When mammoth parses the document XML, it sees the **final accepted text**. The deletion markers, insertion markers and author attributions all live in ` ` and ` ` elements that have no Markdown equivalent. **Workaround if you need the changes visible**: in Word, accept or reject all changes first (Review > Accept All / Reject All) so the doc is clean, then convert. Or, if you specifically need to show **what changed**, export both versions (before and after) to Markdown separately and run them through a **diff tool** (we have a Text Diff tool that does this).

Will my custom Word styles survive?

**No, only the structure does, not the styling.** Markdown is a content format, not a presentation format. It does not care if your Heading 1 is Calibri 24pt blue centered, only that it is a Heading 1. The conversion strips all font, color, alignment, line-height and margin information. **What survives**: bold, italic, strikethrough, links, headings (as levels 1 through 6), lists (bulleted vs numbered), tables (as text), code blocks. **What does not**: fonts, colors, alignment, indentation, line spacing, theme colors, custom paragraph styles. If you need styled output (PDF, print-ready), Markdown is the wrong target format, use the original docx.

Are page breaks and section breaks honored?

**No, both are ignored.** Markdown is a **flow format**: the rendered output reflows to fit whatever width the reader uses, so there is no concept of a "page". Page breaks in Word (Ctrl+Enter), section breaks, column breaks, manual line breaks inside paragraphs are all dropped. The text from page 1 flows directly into the text from page 2 with no separator. **If you want a visible separator**, the convention is a horizontal rule (`---` on its own line). You can search-and-replace that into the converted Markdown manually if you need it. For paginated output, convert the Markdown back to PDF afterwards using **Pandoc**, **Typst** or a static site generator with print CSS.

How is this different from Pandoc?

**Pandoc is the gold standard** for document conversion, supports dozens of input and output formats, has a powerful template system, can output PDF directly. But **Pandoc is a command-line tool you install locally** (not always trivial on a corporate Windows machine without admin rights) and the CLI takes some learning. **This tool is browser-based, zero install, no command line**. Under the hood it uses **mammoth**, which is more **narrowly focused** than Pandoc: - Mammoth only reads .docx, only outputs Markdown or HTML. - Mammoth keeps the conversion **simple and predictable**: it does not invent structure that was not in the source. - Pandoc is **more aggressive** with formatting and can produce subtly different output (extra blank lines, smart quotes, different list markers). **Use this tool** for quick one-off conversions and when you do not have Pandoc handy. **Use Pandoc** for batch processing, complex docs with math, or when you need fine-grained control over the output.

How big a .docx file can I upload?

**Practical limit is around 25 MB**, which already covers a 500-page document with embedded images. The server reads the file into memory, unzips the OpenXML, walks the document tree, and emits Markdown. For a **text-heavy 50-page doc**, expect 1 to 2 seconds. For a **200-page doc with screenshots**, expect 5 to 10 seconds. **What slows it down most**: hundreds of embedded images with "embed as base64" enabled (the base64 encoding is itself CPU work). **Rate limit**: 30 conversions per hour per IP, which is plenty for a normal session. If you hit the limit, wait an hour or batch-process locally with mammoth on your own machine.

DOCX to Markdown Converter - free

Convert Word .docx into clean Markdown, headings, lists and tables kept intact

You have a .docx file from Word, Google Docs export or LibreOffice and you need it as Markdown for your README, static site, blog post or Notion / Obsidian note. Copy-pasting from Word into a Markdown editor leaves you with garbage: smart quotes, broken bullet points, no headings.

This tool reads the OpenXML structure inside your .docx (Word is really a ZIP archive with XML inside), maps paragraph styles to Markdown headings, bullet and numbered lists to `-` and `1.`, tables to GitHub-flavored tables, and bold / italic / links to their Markdown equivalents.

The whole job runs server-side in our Node process using `mammoth`, the same library Pandoc-style tools use under the hood. The file is parsed in memory and discarded immediately, never written to disk, never logged. You see two panes: the raw Markdown on the left (copy or download), and a live HTML preview on the right so you can sanity-check the result before pasting it into your repo.

How to use it

Drop your .docx file into the dropzone, or click "Choose file". Only `.docx` is accepted, the older `.doc` binary format is not supported (re-save it from Word first).
Pick image handling with the switch at the top: "Strip images" (default, fastest, cleanest Markdown) or "Embed as base64" (every image becomes a `data:image/png;base64,...` URL inline in the Markdown).
Hit Convert. The server unzips the .docx, walks the document XML, and returns the converted Markdown plus a list of warnings for anything that did not map cleanly.
Read the warnings panel at the top. Mammoth lists things it could not convert: unsupported styles, lost footnotes, dropped comments. Decide whether you care.
On the left pane you see the raw Markdown. Hit Copy to put it on the clipboard, or Download to save it as a `.md` file with the same base name as your source.
On the right pane you see the HTML preview rendered from the Markdown. This is what a Markdown engine (GitHub, Notion, your static site) will display.
If the result looks wrong, the usual fix is upstream: in Word, apply real heading styles (Heading 1, Heading 2) instead of just bumping the font size. Resave, re-drop.
Nothing is stored. The file is read into a buffer, converted, and the buffer is released. No copy lives on our servers.

When this is useful

Six common situations where this tool replaces 20 minutes of manual cleanup:

Importing a long Word draft into a static site. You wrote a 4000-word article in Word with proper heading styles. You need it as Markdown for Hugo, Astro, Next.js MDX or Jekyll. Drop, convert, paste, done. Headings, lists, links, tables, all preserved.
Migrating internal docs from SharePoint or Google Docs to a wiki. Your team is moving from a Word-based knowledge base to Notion, Obsidian, Outline or BookStack. Batch-export the Word files, run each through this tool, get clean Markdown ready to paste.
Turning a vendor spec into a README. The vendor sent you a 30-page Word spec with numbered headings and tables. Convert to Markdown, drop into your repo as `docs/spec.md`. Search-friendly, diffable, version-controlled.
Preparing content for an LLM context window. You want to feed a Word doc into ChatGPT, Claude or a local model. Markdown is far more token-efficient than raw Word HTML and the model parses structure (headings, lists) better.
Quoting a section in a GitHub issue or pull request. You got a Word file as a bug report. Convert, copy the relevant section, paste into the issue. The structure (the user step list, the table of versions) survives intact.
Translating a legal contract template. You have the original in .docx, you need a clean Markdown version to run through a translation pipeline. Convert, translate the Markdown (where formatting is text not metadata), then re-export.

Questions and answers

Converted cleanly: - Headings based on Word's paragraph styles (Heading 1 → `#`, Heading 2 → `##`, and so on up to `######`). - Bullet and numbered lists (including nested lists, up to ~6 levels). - Bold, italic, strikethrough. - Hyperlinks with the visible text and the target URL. - Tables as GitHub-flavored Markdown pipe tables. - Code blocks when Word applied a monospace style. Dropped or simplified: - Footnotes and endnotes:mammoth flags these in the warnings list, the text is usually lost. - Comments and track changes:never carried over. - Page headers and footers:Markdown has no equivalent. - Page numbers, page breaks, section breaks:Markdown is a flow format, not paginated. - Text boxes, shapes, embedded SmartArt:not extractable as text. - Equations (OMML / MathML): dropped unless you have a separate equation pipeline. Every drop shows up in the warnings panel so you know exactly what is missing.

Convert Word .docx into clean Markdown, headings, lists and tables kept intact

How to use it

Drop your .docx file into the dropzone, or click "Choose file". Only `.docx` is accepted, the older `.doc` binary format is not supported (re-save it from Word first).

Pick image handling with the switch at the top: "Strip images" (default, fastest, cleanest Markdown) or "Embed as base64" (every image becomes a `data:image/png;base64,...` URL inline in the Markdown).

Hit Convert. The server unzips the .docx, walks the document XML, and returns the converted Markdown plus a list of warnings for anything that did not map cleanly.

Read the warnings panel at the top. Mammoth lists things it could not convert: unsupported styles, lost footnotes, dropped comments. Decide whether you care.

On the left pane you see the raw Markdown. Hit Copy to put it on the clipboard, or Download to save it as a `.md` file with the same base name as your source.

On the right pane you see the HTML preview rendered from the Markdown. This is what a Markdown engine (GitHub, Notion, your static site) will display.

If the result looks wrong, the usual fix is upstream: in Word, apply real heading styles (Heading 1, Heading 2) instead of just bumping the font size. Resave, re-drop.

Nothing is stored. The file is read into a buffer, converted, and the buffer is released. No copy lives on our servers.

When this is useful

Six common situations where this tool replaces 20 minutes of manual cleanup:

Importing a long Word draft into a static site. You wrote a 4000-word article in Word with proper heading styles. You need it as Markdown for Hugo, Astro, Next.js MDX or Jekyll. Drop, convert, paste, done. Headings, lists, links, tables, all preserved.
Migrating internal docs from SharePoint or Google Docs to a wiki. Your team is moving from a Word-based knowledge base to Notion, Obsidian, Outline or BookStack. Batch-export the Word files, run each through this tool, get clean Markdown ready to paste.
Turning a vendor spec into a README. The vendor sent you a 30-page Word spec with numbered headings and tables. Convert to Markdown, drop into your repo as `docs/spec.md`. Search-friendly, diffable, version-controlled.
Preparing content for an LLM context window. You want to feed a Word doc into ChatGPT, Claude or a local model. Markdown is far more token-efficient than raw Word HTML and the model parses structure (headings, lists) better.
Quoting a section in a GitHub issue or pull request. You got a Word file as a bug report. Convert, copy the relevant section, paste into the issue. The structure (the user step list, the table of versions) survives intact.
Translating a legal contract template. You have the original in .docx, you need a clean Markdown version to run through a translation pipeline. Convert, translate the Markdown (where formatting is text not metadata), then re-export.

Questions and answers

DOCX to Markdown Converter

Drop your .docx file here

Convert Word .docx into clean Markdown, headings, lists and tables kept intact

How to use it

When this is useful

Questions and answers

Related tools

PDF Text Extractor

HTML / Markdown Converter

XLSX to JSON / CSV Converter

JSON formatter

DOCX to Markdown Converter

Drop your .docx file here

Convert Word .docx into clean Markdown, headings, lists and tables kept intact

How to use it

When this is useful

Questions and answers

Related tools

PDF Text Extractor

HTML / Markdown Converter

XLSX to JSON / CSV Converter

JSON formatter