databrickslabs/dbldatagen
Generate relevant synthetic data quickly for your projects. The Databricks Labs synthetic data generator (aka `dbldatagen`) may be used to generate large simulated / synthetic data sets for test, POCs, and other uses in Databricks environments including in Delta Live Tables pipelines
455 stars.
Stars
455
Forks
92
Language
Python
License
—
Last pushed
Apr 08, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/synthetic-data/databrickslabs/dbldatagen"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
benkeen/generatedata
A powerful, feature-rich, random test data generator.
sdv-dev/CTGAN
Conditional GAN for generating synthetic tabular data.
DexForce/EmbodiChain
An end-to-end, GPU-accelerated, and modular platform for building generalized Embodied Intelligence.
synthesized-io/tdk-demo
This is a collection of TDK demo projects that use different databases and options
Stranger6667/hypothesis-graphql
Generate arbitrary queries matching your GraphQL schema, and use them to verify your backend...