H0NEYP0T-466/dataset-generator

🧠 DataForge — An intelligent LLM fine-tuning dataset generator that transforms raw prompts into structured ShareGPT-format training data through a multi-stage AI pipeline. Built with FastAPI + React/TypeScript, featuring real-time progress tracking via SSE, and FAISS-powered deduplication. Just prompt it, and watch your dataset build itself. ⚡

23
/ 100
Experimental
No Package No Dependents
Maintenance 13 / 25
Adoption 1 / 25
Maturity 9 / 25
Community 0 / 25

How are scores calculated?

Stars

1

Forks

Language

Python

License

MIT

Last pushed

Mar 13, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/H0NEYP0T-466/dataset-generator"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.