microsoft/markitdown

Python tool for converting files and office documents to Markdown.

84
/ 100
Verified

Supports 20+ file formats (PDF, Office documents, images, audio, HTML, archives) with modular optional dependencies and preserves document structure like tables and headings. Uses a pluggable converter architecture that streams from file-like objects without creating temporary files, and integrates with LLMs for image descriptions and speech transcription. Includes an MCP server for Claude Desktop integration and optional Azure Document Intelligence backend for advanced layout analysis.

90,677 stars and 3,746,161 monthly downloads. Used by 25 other packages. Actively maintained with 3 commits in the last 30 days. Available on PyPI.

Maintenance 16 / 25
Adoption 25 / 25
Maturity 25 / 25
Community 18 / 25

How are scores calculated?

Stars

90,677

Forks

5,354

Language

Python

License

MIT

Last pushed

Mar 10, 2026

Monthly downloads

3,746,161

Commits (30d)

3

Dependencies

6

Reverse dependents

25

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/microsoft/markitdown"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.