hikariming/pindata
PinData is a modern, open-source dataset management platform designed specifically for large language model (LLM) training workflows
30
/ 100
Emerging
No commits in the last 6 months.
No License
Stale 6m
No Package
No Dependents
Maintenance
2 / 25
Adoption
8 / 25
Maturity
7 / 25
Community
13 / 25
Stars
44
Forks
6
Language
TypeScript
License
—
Category
Last pushed
Jul 07, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/hikariming/pindata"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
NVIDIA-NeMo/Curator
Scalable data pre processing and curation toolkit for LLMs
74
MigoXLab/dingo
Dingo: A Comprehensive AI Data, Model and Application Quality Evaluation Tool
67
data-prep-kit/data-prep-kit
Open source project for data preparation for GenAI applications
64
cleanlab/cleanlab-studio
Client interface to Cleanlab Studio
56
TheDataStation/pneuma
LLM-Powered Data Discovery System for Tabular Data
46