Convert PDF to Markdown Online
Extract text from a PDF and export it as a Markdown .md file, with each page as a separate section.
Markdown has become the standard format for documentation, README files, static site generators, and note-taking apps. If you have content locked inside a PDF and need to get it into a Markdown workflow — whether for a wiki, a docs site, or a writing tool like Obsidian or Notion — Dockitt extracts the text page by page and formats it as clean Markdown. Each page becomes a section with a heading, making the structure easy to navigate and edit. The extraction runs entirely in your browser using PDF.js, without uploading anything to a server.
How to use
- Click 'Choose PDF' and select the PDF file you want to convert to Markdown.
- Click 'Extract to Markdown' and wait while each page is processed.
- Download the .md file. Each page of the PDF becomes a ## Page N section in the output.
FAQ
What Markdown structure does the output use?
Each page of the PDF becomes a section introduced by a ## Page N heading. The extracted text from that page follows immediately below. This makes the output easy to read, edit, and split in any Markdown editor.
Will formatting like bold, italics, or tables be preserved?
No. PDF.js extracts the raw text content of each page. Visual formatting such as bold, italics, tables, bullet lists, and column layouts is not preserved. The output is plain text structured with Markdown section headings. For complex formatted documents, manual cleanup of the Markdown will be needed after extraction.
What if the PDF contains only scanned images?
If the PDF is a scan — meaning it contains images of text rather than actual text data — the extraction will produce empty or near-empty sections. PDF.js can only extract text that is encoded as text in the PDF. For scanned PDFs, use the OCR PDF tool on Dockitt first to create a searchable PDF, then convert that to Markdown.
Can I use the output directly in a static site generator?
Yes, with some editing. The output is valid Markdown that any static site generator (Hugo, Jekyll, Astro, etc.) can process. You will likely need to add front matter (title, date, tags) at the top of the file and clean up any text that was oddly extracted due to the original PDF layout. Multi-column layouts in particular may produce lines in an unexpected order.
Can I convert a password-protected PDF to Markdown?
No. The PDF must be unlocked before it can be processed. Use the Dockitt Unlock PDF tool to remove the password first, then convert the unlocked file to Markdown.
Is the conversion done in my browser or on a server?
The conversion runs entirely in your browser using PDF.js. Your file is never uploaded to any server. It stays on your device throughout the entire process.