markitdown and markpdfdown

These are **competitors** that address the same use case with different approaches—markitdown handles general file and document conversion to Markdown with broad format support, while markpdfdown specializes specifically in PDF-to-Markdown conversion using LLM-based visual recognition for higher quality extraction.

markitdown
84
Verified
markpdfdown
48
Emerging
Maintenance 16/25
Adoption 25/25
Maturity 25/25
Community 18/25
Maintenance 10/25
Adoption 10/25
Maturity 9/25
Community 19/25
Stars: 90,677
Forks: 5,354
Downloads: 3,746,161
Commits (30d): 3
Language: Python
License: MIT
Stars: 1,669
Forks: 129
Downloads:
Commits (30d): 0
Language: Python
License: Apache-2.0
No risk flags
No Package No Dependents

About markitdown

microsoft/markitdown

Python tool for converting files and office documents to Markdown.

Supports 20+ file formats (PDF, Office documents, images, audio, HTML, archives) with modular optional dependencies and preserves document structure like tables and headings. Uses a pluggable converter architecture that streams from file-like objects without creating temporary files, and integrates with LLMs for image descriptions and speech transcription. Includes an MCP server for Claude Desktop integration and optional Azure Document Intelligence backend for advanced layout analysis.

About markpdfdown

MarkPDFdown/markpdfdown

A high-quality PDF to Markdown tool based on large language model visual recognition. 一款基于大模型视觉识别的高质量PDF转Markdown工具

Related comparisons

Scores updated daily from GitHub, PyPI, and npm data. How scores work