Llm Data Labeling MLOps Tools

There are 3 llm data labeling tools tracked. 2 score above 50 (established tier). The highest-rated is datajuicer/data-juicer at 64/100 with 6,051 stars. 1 of the top 10 are actively maintained.

Get all 3 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=mlops&subcategory=llm-data-labeling&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 datajuicer/data-juicer

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

64
Established
2 dermatologist/pyomop

Python package for managing OHDSI clinical data models. Includes support for...

63
Established
3 duoan/mega-data-factory

🏭 Mega Scale Multimodal DataPipeline for SOTA Foundation Models

49
Emerging