takashiishida/arxiv-to-prompt

Transform arXiv papers into a single LaTeX source that can be used as a prompt for asking LLMs questions about the paper.

65
/ 100
Established

Automatically downloads arXiv source files, identifies the main LaTeX document, and flattens nested `\input`/`\include` commands into a single file—with options to strip comments, appendices, and expand macros to optimize token usage. Provides both CLI and Python API with advanced features like section extraction via hierarchical path notation, figure path extraction, and token counting via OpenAI's tokenizer. Integrates with the `llm` library for piped LLM queries and serves as a foundation for downstream tools like MCP servers and iOS apps.

133 stars and 4,594 monthly downloads. Available on PyPI.

Maintenance 13 / 25
Adoption 18 / 25
Maturity 25 / 25
Community 9 / 25

How are scores calculated?

Stars

133

Forks

7

Language

Python

License

MIT

Last pushed

Mar 08, 2026

Monthly downloads

4,594

Commits (30d)

0

Dependencies

3

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/prompt-engineering/takashiishida/arxiv-to-prompt"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.