BERT Model Implementations Transformer Models
PyTorch and framework-specific implementations of BERT and BERT-variant architectures (RoBERTa, DistilBERT, etc.), including pretraining, finetuning libraries, and language-specific BERT models. Does NOT include task-specific applications (NER, classification, QA), downstream finetuning notebooks, or non-BERT transformer implementations.
There are 68 bert model implementations models tracked. 2 score above 50 (established tier). The highest-rated is Tongjilibo/bert4torch at 67/100 with 1,335 stars and 180 monthly downloads.
Get all 68 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=transformers&subcategory=bert-model-implementations&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Model | Score | Tier |
|---|---|---|---|
| 1 |
Tongjilibo/bert4torch
An elegent pytorch implement of transformers |
|
Established |
| 2 |
nyu-mll/jiant
jiant is an nlp toolkit |
|
Established |
| 3 |
lonePatient/TorchBlocks
A PyTorch-based toolkit for natural language processing |
|
Emerging |
| 4 |
grammarly/gector
Official implementation of the papers "GECToR – Grammatical Error... |
|
Emerging |
| 5 |
monologg/JointBERT
Pytorch implementation of JointBERT: "BERT for Joint Intent Classification... |
|
Emerging |
| 6 |
backprop-ai/backprop
Backprop makes it simple to use, finetune, and deploy state-of-the-art ML models. |
|
Emerging |
| 7 |
appvision-ai/fast-bert
Super easy library for BERT based NLP models |
|
Emerging |
| 8 |
sagorbrur/bntransformer
Bengali transformer using transformers |
|
Emerging |
| 9 |
sagorbrur/bangla-bert
Bangla-Bert is a pretrained bert model for Bengali language |
|
Emerging |
| 10 |
voidful/TFkit
🤖📇 handling multiple nlp task in one pipeline |
|
Emerging |
| 11 |
taishi-i/nagisa_bert
A BERT model for nagisa |
|
Emerging |
| 12 |
gitabtion/BertBasedCorrectionModels
PyTorch impelementations of BERT-based Spelling Error Correction Models. ... |
|
Emerging |
| 13 |
dccuchile/beto
BETO - Spanish version of the BERT model |
|
Emerging |
| 14 |
iPieter/RobBERT
A Dutch RoBERTa-based language model |
|
Emerging |
| 15 |
gitabtion/SoftMaskedBert-PyTorch
🙈 An unofficial implementation of SoftMaskedBert based on huggingface/transformers. |
|
Emerging |
| 16 |
JetRunner/BERT-of-Theseus
⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT... |
|
Emerging |
| 17 |
menon92/BangalASR
Transformer based Bangla Speech Recognition | Encoder Decoder Architecture |
|
Emerging |
| 18 |
Ethan-yt/guwenbert
GuwenBERT: 古文预训练语言模型(古文BERT) A Pre-trained Language Model for Classical... |
|
Emerging |
| 19 |
ymcui/PERT
PERT: Pre-training BERT with Permuted Language Model |
|
Emerging |
| 20 |
JulesBelveze/bert-squeeze
🛠️ Tools for Transformers compression using PyTorch Lightning ⚡ |
|
Emerging |
| 21 |
nlpaueb/greek-bert
A Greek edition of BERT pre-trained language model |
|
Emerging |
| 22 |
dbmdz/berts
DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models |
|
Emerging |
| 23 |
alexa/ramen
A software for transferring pre-trained English models to foreign languages |
|
Emerging |
| 24 |
rdenadai/BR-BERTo
Transformer model for Portuguese language (Brazil pt_BR) |
|
Emerging |
| 25 |
retarfi/language-pretraining
Pre-training Language Models for Japanese |
|
Experimental |
| 26 |
cakshat/AlloyBERT
Introducing AlloyBERT: a transformer encoder-based model for predicting... |
|
Experimental |
| 27 |
bnosac/golgotha
Contextualised Embeddings and Language Modelling using BERT and Friends using R |
|
Experimental |
| 28 |
TayeeChang/keras_transformers
the implement of transformer family such as bert, alber, roberta, nezha, etc. |
|
Experimental |
| 29 |
Beomi/exbert-transformers
exBERT on Transformers🤗 |
|
Experimental |
| 30 |
psychbruce/FMAT
😷 The Fill-Mask Association Test (FMAT): Measuring Propositions in Natural Language. |
|
Experimental |
| 31 |
shahrukhx01/bert-probe
BERT Probe: A python package for probing attention based robustness to... |
|
Experimental |
| 32 |
isaacus-dev/emubert-creator
The training code behind EmuBert, the largest open-source masked language... |
|
Experimental |
| 33 |
Beomi/KcBERT-Finetune
KcBERT/KcELECTRA Fine Tune Benchmarks code (forked from... |
|
Experimental |
| 34 |
HeegyuKim/language-model
한국어 언어 모델 학습을 위한 프로젝트(Flax, Pytorch with Huggingface Accelerate) |
|
Experimental |
| 35 |
ant-louis/netbert
📶 NetBERT: a domain-specific BERT model for computer networking. |
|
Experimental |
| 36 |
DomHudson/bert-in-production
A collection of resources on using BERT (https://arxiv.org/abs/1810.04805 )... |
|
Experimental |
| 37 |
AshutoshDongare/softskill-NER
Fine tuning 🤗 transformer model for softskill NER task |
|
Experimental |
| 38 |
asiff00/Bengali-Sentence-Error-Correction
Fine-tune mBart 50 for Bengali Sentence Error Correction |
|
Experimental |
| 39 |
gitabtion/ConvBert-PyTorch
🤗An unofficial PyTorch implementation of ConvBert based on huggingface/transformers. |
|
Experimental |
| 40 |
sagorbrur/fillblank
Fill The Blank |
|
Experimental |
| 41 |
PlanTL-GOB-ES/lm-biomedical-clinical-es
Official source for Spanish pretrained biomedical and clinical language... |
|
Experimental |
| 42 |
YRL-AIDA/RuTaBERT
RuTaBERT is a framework for solving column type and property annotation... |
|
Experimental |
| 43 |
Thisen-Ekanayake/HelaBERT
A compact BERT (6-layer) masked language model trained from scratch on a... |
|
Experimental |
| 44 |
phkhanhtrinh23/spelling_correction_project
This spelling correction project helps people fix English spelling mistakes.... |
|
Experimental |
| 45 |
haozhg/lmd
Language Model Decomposition: Quantifying the Dependency and Correlation of... |
|
Experimental |
| 46 |
Pchambet/NLP-from-scratch-to-BERT
End-to-end NLP in 4 notebooks: text preprocessing, TF-IDF,... |
|
Experimental |
| 47 |
lcl-hse/heptabot
A full-text error corrector for English based on transformers and deep learning |
|
Experimental |
| 48 |
Vidhyambika/Next-Word-Prediction-using-BERT-GPT
Predicting the next word for a sentence/word given using BERT |
|
Experimental |
| 49 |
RichardScottOZ/geoscience-transformers-for-predictive-mapping-of-critical-minerals
First pass paper implementation |
|
Experimental |
| 50 |
sfp932705/simple_bert
A pure pytorch from scratch implementation of BERT |
|
Experimental |
| 51 |
shreydan/masked-language-modeling
Transformers Pre-Training with MLM objective — implemented encoder-only... |
|
Experimental |
| 52 |
LennartKeller/roberta2longformer
Convert pretrained RoBerta models to various long-document transformer models |
|
Experimental |
| 53 |
ilanaliouchouche/KANBert
Implementation of an Encoder only MoE usable as an Embedding Model,... |
|
Experimental |
| 54 |
joshstephenson/MorphemeSegmentation
This is a survey of morpheme segmentation techniques including 2 baselines... |
|
Experimental |
| 55 |
Vincentiv/BERT_Finetuning_from_scratch
Notebook on finetuning BERT |
|
Experimental |
| 56 |
sappho192/ffxiv-ja-ko-translator
Japanese→Korean translator model specialized in Final Fantasy XIV based on... |
|
Experimental |
| 57 |
Sean652039/Token-Masking
Token Masking Regularization |
|
Experimental |
| 58 |
tejasvaidhyadev/ALBERT.jl
ALBERT(A Lite BERT for Self-Supervised Learning of Language Representations)... |
|
Experimental |
| 59 |
SumitM0432/XLM-RoBERTa-for-Textual-Entailment
A multilingual model XLM- RoBERTa for the textual entailment of sequence... |
|
Experimental |
| 60 |
DiFronzo/Multilingual-Models
mBERT and XLM-R for encodeing of Scandinavian languages |
|
Experimental |
| 61 |
teticio/inBERTolate
Hit your word count by using BERT to pad out your essays! |
|
Experimental |
| 62 |
mhmdsabry/BERT_with_Residual_vs_Highway
Comparing between residual stream and highway stream in transformers(BERT) . |
|
Experimental |
| 63 |
viktor-shcherb/vive_la_ner
The default way to fine-tune BERT is wrong. Here is why |
|
Experimental |
| 64 |
mdmmn378/spell-magic
Transformer Based Seq2Seq Model for Bangla Spell Correction |
|
Experimental |
| 65 |
UnkindGoose/MultiTask-NLP-model
Multitask model for NER and document-level classification. Project contains... |
|
Experimental |
| 66 |
davydantoniuk/grammarfix-bot
Fine-tuned a Hugging Face transformer model for grammar correction. |
|
Experimental |
| 67 |
gaolichen/simplebert
A simple implementation of transformer models with tensorflow/keras. |
|
Experimental |
| 68 |
cbstanley/dp-bert
Differential privacy with BERT model |
|
Experimental |