styfeng/DataAug4NLP

Collection of papers and resources for data augmentation for NLP.

/ 100

Experimental

Organized by 15+ NLP task categories (text classification, translation, QA, sequence tagging, parsing, dialogue, multimodal, and more), this curated repository systematically maps augmentation techniques to specific problem domains with linked implementations and benchmark datasets. Grounded in an ACL 2021 survey paper, it combines peer-reviewed research with a community contribution model via pull requests, enabling practitioners to identify task-appropriate augmentation strategies from backtranslation and synonym replacement to recent LLM-based approaches like GPT3Mix and automated augmentation methods.

831 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 1 / 25

Community 18 / 25

How are scores calculated?

Stars

831

Forks

Language

—

License

—

Higher-rated alternatives

varunkumar-dev/TransformersDataAugmentation

Code associated with the "Data Augmentation using Pre-trained Transformer Models" paper

Akshint0407/Automated-Answer-Checker

AI-powered grading system for educators 🔹 Streamlit web app that automates answer sheet...

Anjum48/commonlitreadabilityprize

4th Place solution for the Kaggle CommonLit Readability Prize

yuchen0515/2022-Competition-CUDAOutOfMemory

Our team placed 6th out of 119 teams in E.SUN AI Open Competition Summer 2022 - ASR...

omerfarooq223/AutoGrader-Agent

AI agent that grades student assignments from a ZIP file using LLMs — generates rubrics, detects...

Explore Transformer Models

All categories Trending Transformer directory Insights