artitw/text2text

Text2Text Language Modeling Toolkit

52
/ 100
Established

Provides unified APIs for multilingual text processing tasks including LLM inference (with streaming and structured output support), embeddings, semantic search via TF-IDF/BM25 indexing, machine translation, and data augmentation through back-translation. Built on transformer-based models with pluggable pretrained backends (defaulting to Facebook's M2M-100), operating efficiently on commodity hardware and free Colab resources. Exposes sub-word tokenization, language identification, and edit-distance calculations alongside a conversational assistant interface compatible with OpenAI's chat completion schema.

303 stars and 606 monthly downloads. No commits in the last 6 months. Available on PyPI.

Stale 6m
Maintenance 0 / 25
Adoption 16 / 25
Maturity 18 / 25
Community 18 / 25

How are scores calculated?

Stars

303

Forks

41

Language

Python

License

Last pushed

Jan 14, 2025

Monthly downloads

606

Commits (30d)

0

Dependencies

18

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/artitw/text2text"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.