windblow32/DATE
Exploring the Heterogeneity of Tabular Data: A Diversity-aware Data Generator via LLMs
Stars
1
Forks
—
Language
Python
License
—
Category
Last pushed
Dec 26, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/windblow32/DATE"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
VikParuchuri/textbook_quality
Generate textbook-quality synthetic LLM pretraining data
dmanuel64/codablellm
A framework for creating and curating high-quality code datasets tailored for large language models
BhabhaAI/dataformer
Solving data for LLMs - Create quality synthetic datasets!
BothBosu/Synthetic-Data-for-Scam-Detection-Leveraging-LLMs-to-Train-Deep-Learning-Models
This repository contains the source code and synthetic datasets used in the research on scam...
iiis-ai/TemplateMath
[ICLR 2025 DATA-FM] Training and Evaluating Language Models with Template-based Data Generation...