No. The HTML is parsed and converted entirely in your browser tab. There is no server endpoint, no upload, no telemetry. You can verify this in the Network panel of your browser's developer tools.

What does the sanitize pass do?

` ` tags, inline event handlers (`onclick`, `onload` etc.), ` ` blocks and external iframe embeds are stripped before conversion. That's part of the standard path and protects against hidden script payloads in clipped or scraped HTML.

What happens with images?

Image tags become Markdown image references (`![alt](src)`). External image URLs stay as links — the tool doesn't fetch them, since it makes no network requests. If you want images embedded, download them separately and reference them relatively.

Are tables supported?

Yes, ` ` structures with ` `, ` `, ` ` and ` ` become GitHub Flavored Markdown pipe tables. Complex tables with `colspan`/`rowspan` get a hint block — Markdown pipe tables don't support cell merging.

Does the paste box work with code from the browser inspector?

Yes. From the developer tools you can copy a DOM node as HTML ('Copy outerHTML') and paste it directly into the box. The sanitize pass strips the unnecessary attributes; the resulting Markdown stays slim.

What happens to inline formatting?

` `/` ` → **bold**, ` `/` ` → *italic*, ` ` → `inline code`, ` ` → `[text](URL)`, ` ` → `>` block quote, ` `/` ` → `- ` and `1. ` lists. Unusual or visually-oriented tags (` `, ` `, ` `) are ignored.

How large can the HTML be?

Up to roughly 5 MB per file or paste block is realistic — that covers very long articles or full web exports. Larger is possible, but the browser's RAM is the limit, and the resulting Markdown becomes unwieldy.

Can I use this with arbitrary website URLs?

URL fetching is not in this version — that would carry CORS, rate-limit and ToS implications we deliberately keep out of the browser tool. To convert a website, copy the HTML source from your browser (`Ctrl+U` → Copy All) and paste it into the box.

HTML to Markdown — file or paste mode, in your browser

Why convert HTML to Markdown?

HTML is the source-code format of the web — and very often the data form in which you find knowledge: an email newsletter that’s only searchable as HTML source; a note backup from Evernote or OneNote; a scraped article; a CMS post export. Markdown, by contrast, is the format in which you maintain knowledge long-term — diffable, plain-text, readable in any editor, natively understood by Obsidian and Logseq.

This tool builds the bridge. Drop in an HTML file or HTML source code, the tool parses the DOM structure, sanitizes scripts and inline handlers out, and writes GitHub Flavored Markdown with headings, lists, tables and inline formats preserved.

How does the conversion work technically?

HTML is parsed into a DOM tree via the browser’s native HTML parser — the same mechanism every web page uses internally, but in sandboxed mode without script execution. A sanitize pass strips <script> tags, <style> blocks, inline event handlers, and <iframe> embeds before further processing. That makes it safe to convert even scraped or foreign HTML files.

A proven open-source HTML-to-Markdown library translates the DOM tree into Markdown: headings (<h1> → #, <h2> → ##), paragraphs (<p> → empty line between blocks), lists (<ul> → -, <ol> → 1.), tables (<table> → GFM pipe tables), inline formats (<strong> → **, <em> → *, <code> → `).

What are typical use cases?

Archive email newsletters. Newsletters in HTML format come into your Obsidian vault as Markdown.
Prepare web content for AI prompts. A long article becomes Markdown that fits inside your Claude or GPT prompt.
Note migration from Evernote / OneNote. HTML note exports land as clean .md files in your new note system.
Blog migration to Hugo / Astro. Existing HTML posts become Markdown posts with frontmatter that static-site generators understand.
Wiki content from Confluence exports. HTML exports from Confluence/SharePoint become Markdown pages for Notion alternatives.

What stays — what doesn’t?

Preserved: heading hierarchies (<h1> through <h6>), paragraphs, ordered and unordered lists with nesting, tables without merges as GFM pipe tables, inline formats (bold, italic, inline code, strikethrough), hyperlinks with anchor text, block quotes, code blocks with language hint, images as Markdown references (original src is kept).

Deliberately stripped: scripts, <style> blocks, inline event handlers, external iframes (sanitize pass). That’s non-negotiable — especially with scraped or clipped HTML, it protects against hidden script payloads.

Not 1:1 convertible: tables with colspan/rowspan (hint block, because Markdown pipe tables don’t support cell merging), CSS layout tricks, JavaScript-generated content (the sanitize pass runs before script execution — JS content is not rendered).

How does the tool keep my HTML private?

When you’re parsing an HTML export from a private note app, the last thing you want is the content going to a foreign server. Newsletter archives can also carry confidential tracking IDs or personal greeting fields embedded in the HTML.

None of that happens here. The HTML is parsed and converted to Markdown inside the browser tab via web standards (native HTML parser, WebAssembly). Open the Network panel of your developer tools and watch: no request, no upload, no server communication. The paste box runs entirely client-side as well.

This tool is part of the Markdown converter family:

DOCX to Markdown — Word documents with heading structure and lists.
PDF to Markdown — including scanned PDFs via OCR.
CSV to Markdown — also TSV with delimiter auto-detection.
XLSX to Markdown — Excel including XLS and ODS, multi-sheet support.

HTML to Markdown

How It Works

Bring HTML in

Check sanitize

Take the Markdown

Privacy

How do you use this tool?

Why convert HTML to Markdown?

How does the conversion work technically?

What are typical use cases?

What stays — what doesn’t?

How does the tool keep my HTML private?

How It Works

Bring HTML in

Check sanitize

Take the Markdown

Privacy

Why convert HTML to Markdown?

How does the conversion work technically?

What are typical use cases?

What stays — what doesn’t?

How does the tool keep my HTML private?

Which related converters exist?