qiangsiwei/bert_distill

BERT distillation(基于BERT的蒸馏实验 )

41
/ 100
Emerging

Implements knowledge distillation from BERT into lightweight student models (TextCNN, BiLSTM/GRU) using both Keras and PyTorch, following the approach in "Distilling Task-Specific Knowledge from BERT into Simple Neural Networks." The framework uses a 1:8:1 labeled-to-unlabeled-to-test data split and supports data augmentation techniques (masking, n-gram sampling) to improve student model performance on sentiment classification tasks, achieving ~87-88% accuracy compared to BERT's ~90-91% on the clothing dataset.

314 stars. No commits in the last 6 months.

No License Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 8 / 25
Community 23 / 25

How are scores calculated?

Stars

314

Forks

82

Language

Python

License

Last pushed

Jul 30, 2020

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/nlp/qiangsiwei/bert_distill"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.