uds-lsv/bert-stable-fine-tuning

On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines

/ 100

Emerging

Provides empirical analysis of fine-tuning instability across BERT, RoBERTa, and ALBERT on GLUE tasks, identifying vanishing gradients and generalization variance as root causes rather than catastrophic forgetting. Implements stable fine-tuning baselines built on Huggingface Transformers (v2.5.1) with reproducible Docker-based evaluation scripts. The codebase includes diagnostic tools for analyzing optimization dynamics and gradient behavior during training.

138 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 10 / 25

Maturity 16 / 25

Community 17 / 25

How are scores calculated?

Stars

138

Forks

Language

Python

License

Apache-2.0

Related tools

VanekPetr/flan-t5-text-classifier

Fine-tuning of Flan-5T LLM for text classification 🤖 focuses on adapting a state-of-the-art...

MeryylleA/lunariscodex

A high-performance PyTorch toolkit for pre-training modern, Llama-style language models. Based...

kingTLE/literary-alpaca2

从词表到微调这就是你所需的一切

RunxinXu/ChildTuning

Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and...

aymen-000/predict-reconstruct-language-models

"Predict and Reconstruct: Joint Objectives for Self-Supervised Language Representation Learning"...

Explore NLP Tools

All categories Trending NLP directory Insights