Llm Data Labeling Transformer Models

There are 3 llm data labeling models tracked. 1 score above 50 (established tier). The highest-rated is allenai/dolma at 65/100 with 1,447 stars and 6,035 monthly downloads.

Get all 3 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=llm-data-labeling&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Model Score Tier
1 allenai/dolma

Data and tools for generating and inspecting OLMo pre-training data.

65
Established
2 waikato-llm/llm-dataset-converter

For converting LLM datasets from one format into another.

42
Emerging
3 refuel-ai/autolabel

Label, clean and enrich text datasets with LLMs.

37
Emerging