jkkummerfeld/text2sql-data

A collection of datasets that pair questions with SQL queries.

50
/ 100
Established

Aggregates corrected versions of 10 diverse semantic parsing datasets (Academic, ATIS, Geography, Restaurants, Scholar, Spider, WikiSQL, IMDB, Yelp, Advising) with standardized schema, database instances, and variable annotations for improved evaluation methodology. Provides versioned releases with documented data fixes and contributions tracked via pull requests, enabling reproducible benchmarking across text-to-SQL systems while maintaining backward compatibility for published comparisons.

585 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 24 / 25

How are scores calculated?

Stars

585

Forks

116

Language

Python

License

Last pushed

Mar 03, 2025

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/jkkummerfeld/text2sql-data"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.