GPT Multilingual Training LLM Tools

Tools for training GPT models on non-English languages and domain-specific datasets (poetry, regional languages). Does NOT include general English-language GPT implementations, architecture education, or inference-only tools.

There are 112 gpt multilingual training tools tracked. 1 score above 70 (verified tier). The highest-rated is Nixtla/nixtla at 87/100 with 3,792 stars and 123,695 monthly downloads. 1 of the top 10 are actively maintained.

Get all 112 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=llm-tools&subcategory=gpt-multilingual-training&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Tool Score Tier
1 Nixtla/nixtla

TimeGPT-1: production ready pre-trained Time Series Foundation Model for...

87
Verified
2 andrewdalpino/NoPE-GPT

A GPT-style small language model (SLM) with no positional embeddings (NoPE).

48
Emerging
3 akanyaani/gpt-2-tensorflow2.0

OpenAI GPT2 pre-training and sequence prediction implementation in Tensorflow 2.0

42
Emerging
4 samkamau81/FinGPT_

FinGPT is an AI language model designed to understand and generate financial...

38
Emerging
5 VinAIResearch/PhoGPT

PhoGPT: Generative Pre-training for Vietnamese (2023)

37
Emerging
6 teddykoker/image-gpt

PyTorch Implementation of OpenAI's Image GPT

36
Emerging
7 sigdelsanjog/gptmed

pip install gptmed

35
Emerging
8 alibaba/graph-gpt

Generative Pre-trained Graph Eulerian Transformer [ICML2025]

35
Emerging
9 LIYUESEN/druggpt

DrugGPT: A GPT-based Strategy for Designing Potential Ligands Targeting...

35
Emerging
10 milmor/GPT

Implementation of Generative Pretrained Transformer Model in Tensorflow / Keras

33
Emerging
11 aaron-wheeler/MarketGPT

MarketGPT: Developing a Pre-trained transformer (GPT) for Modeling Financial...

33
Emerging
12 AspirinCode/iupacGPT

IUPAC-based large-scale molecular pre-trained model for property prediction...

31
Emerging
13 mkt1412/GraspGPT_public

code implementation of GraspGPT and FoundationGrasp

31
Emerging
14 hunar4321/reweight-gpt

Reweight GPT - a simple neural network using transformer architecture for...

31
Emerging
15 abhaskumarsinha/Corpus2GPT

Corpus2GPT: A project enabling users to train their own GPT models on...

27
Experimental
16 duydvu/gpt-j-6B-vietnamese-news-api

Vietnamese GPT-J API service deployed with Docker & Helm chart

27
Experimental
17 BanglaGPT/bangla-gpt

Training code for BanglaGPT model

26
Experimental
18 Koziev/verslibre

Using transformers to generate Russian poetry

26
Experimental
19 MetaTrustLabs/GPTScan

Indexing three datasets for GPTScan

25
Experimental
20 tufts-ml/G2PT

Graph generative pre-trained transformer

25
Experimental
21 brown-palm/AntGPT

Official code implemtation of paper AntGPT: Can Large Language Models Help...

25
Experimental
22 ariannamethod/postgpt

GPT with metaweights: weights that don't actually exist

25
Experimental
23 PointsCoder/GPT-Driver

Learning to Drive with GPT

25
Experimental
24 Rishikesh-Jadhav/Video-Compression-and-Future-Prediction-Using-GPT

This repository presents a project focused on advanced video compression and...

24
Experimental
25 n4ze3m/footgpt

FootGPT is a GPT-based language model for football news

24
Experimental
26 fraserlove/gpt-alpha

GPT-α is a 124 million parameter decoder-only language model following the...

24
Experimental
27 jmaczan/gpt

Generative Pre-trained Transformer in PyTorch from scratch

24
Experimental
28 MauroCE/DanteGPT

DanteGPT

24
Experimental
29 VachanVY/gpt.jax

Generative Pretrained Model (GPT) in JAX. A step by step guide to train LLMs...

24
Experimental
30 creatorrr/cryptgpt

Pretrain a model on ciphered text so only you can use it

24
Experimental
31 VanekPetr/my-own-GPT

A simple PyTorch re-implementation of the OpenAI GPT (Generative Pretrained...

24
Experimental
32 xavierzheng/pxgpt

Phenotype eXplore GPT. Use multimodel LLM for structural plant phenotyping

24
Experimental
33 PetropoulakisPanagiotis/gpt-practice

GPT code - I completed the tutorial for building GPT components by Andrej...

24
Experimental
34 berkerdemirel/GPT-from-scratch

Re-implementation of Andrej Karpathy's nanoGPT

23
Experimental
35 s4m-mo/tf-gpt

A TensorFlow implementation of GPT.

23
Experimental
36 xtreamsrl/build-your-own-gpt

Repository for the AMLD 2024 Workshop "At the cutting edge of Generative AI...

22
Experimental
37 amskit/in-naamgpt

Generate authentic-sounding Hindi names using a minimalist GPT built from...

22
Experimental
38 nguyenphuminh/planckgpt

Train a GPT from scratch on your laptop

22
Experimental
39 pguso/gpt-from-scratch

Implementation of a small GPT-style transformer from scratch in PyTorch....

22
Experimental
40 nirajsaran2/AdTextGeneration

Text Generator for Amazon Ads. Use Natural Language Generation (NLG)...

22
Experimental
41 aaditya29/GPT-2-124M-

Reproducing GPT-2 (124M) from Scratch in PyTorch

21
Experimental
42 gromdimon/beLLM

beLLM: GPT for belarusian language

21
Experimental
43 ksupasate/GPTForAnything

This is a repository aimed at promoting open-source contributions for...

21
Experimental
44 mkashirin/cattode

Lil GPT and BPE built from scratch using PyTorch.

20
Experimental
45 DanielPuentee/gpt-from-zero

Create your own GPT model from scratch

20
Experimental
46 jbxamora/reversenanogpt

A minimal character-level language model using Transformer architecture in PyTorch

20
Experimental
47 Pacatro/gpoetry

A tiny GPT model to generate spanish poetry

20
Experimental
48 muhammad-fiaz/gpt

A simple implementation based on the "Attention is All You Need" paper,...

20
Experimental
49 Laz4rz/GPT-2

Following Karpathy with GPT-2 implementation and training, writing lots of...

19
Experimental
50 LongpanZhou/Pat-GPT

This is a GPT model but brain rot rizzler... Implements standard GPT-2...

19
Experimental
51 Uokoroafor/gpt_from_scratch

This is a PyTorch implementation of a smaller version of the GPT model

19
Experimental
52 mjub/nlab-gpt

A small, custom GPT trained on nLab text, with an analysis of emergent...

19
Experimental
53 daparasyte/GPT-Models-Text-Generation

Google Colab demos on how to use lighter GPT model versions to generate text...

18
Experimental
54 saforem2/wordplay

Playing with words

18
Experimental
55 mohd-faizy/GPT1-From-Scratch

This project implements GPT-1 using PyTorch, focusing on foundational...

18
Experimental
56 jwchoi95/GPT_MLP

Official source codes for implementing "Accelerating materials language...

17
Experimental
57 antonio-f/GPT_from_scratch

Very simple implementation of GPT architecture using PyTorch and Jupyter.

17
Experimental
58 sumo1/gpt-reproduction-SFT-RLHF

OpenAI...

16
Experimental
59 ashishsalunkhe/DickensSpeaks

Text Generation trained on the Short Stories of Charles Dickens using RNN,...

16
Experimental
60 abdullateefv/PeptideGPT

GPT powered plugin & fine tuned model for natural language interaction with...

16
Experimental
61 zhoucaiNi/poet-gpt-2

Generative LLM specifically trained to generate poems This LLM uses...

16
Experimental
62 mytechnotalent/gpt_from_scratch

This notebook builds a complete GPT (Generative Pre-trained Transformer)...

16
Experimental
63 Michaelgathara/GPT

FineWeb-EDU trained Billion+ Parameter Model

16
Experimental
64 betogaona7/gptpose

GPT pose image generator to condition SD models with ControlNet OpenPose

16
Experimental
65 Ankit9424-prog/Nepali-GPT

A GPT-2 language model trained from scratch on Nepali text using PyTorch

16
Experimental
66 yogeshHax/Big_Dragon_Hatchling

BDH-Dragon: A custom dual-GPU Transformer model optimizing RoPE for...

15
Experimental
67 UgurKap/gpt-implementation

This repository contains my personal implementation and experiments while...

15
Experimental
68 billh0420/MathAssertGPT

Create a Generative Pretrained Transformer model for Metamath to generate...

15
Experimental
69 B4S1C-Coder/GPT-2-from-scratch

GPT-2 Implementation using only PyTorch and Tiktoken

15
Experimental
70 fostiropoulos/ReGPT

Code for our work published at ICJAI 2023 Workshop on Knowledge-Based...

15
Experimental
71 mcpeixoto/gpt

Implementation of a scalled down ChatGPT-like transformer pretraining using PyTorch

15
Experimental
72 iug-htw/GPTAndPrejudice

Research framework for training and interpreting a custom GPT-style language...

13
Experimental
73 ammarhydr/MobilityGPT

PyTorch implementation of MobilityGPT model: https://arxiv.org/abs/2402.03264

13
Experimental
74 kalvin807/sherlock

an attempt to generate code change from issue using LLM

13
Experimental
75 koayon/atp_star

PyTorch and NNsight implementation of AtP* (Kramar et al 2024, DeepMind)

12
Experimental
76 AlexGidiotis/gpt-light

The easiest repo for building GPT applications.

12
Experimental
77 HoangHao1009/hminiGPT

Pre-train GPT model by your txt

12
Experimental
78 derinworks/penr-oz-gpt-example

Implementation of an example GPT for understanding how next character is...

12
Experimental
79 sigdelsanjog/code-llm

pip install gptgpt

12
Experimental
80 SCCSMARTCODE/gpt2-from-scratch

A fundamental implementation of the GPT-2 architecture from scratch,...

12
Experimental
81 sszzz830/TensorFlow-in-GPT-4-advanced-data-analysis-mode

Install TensorFlow Lite on GPT-4 (advanced data analysis mode), or any other...

11
Experimental
82 HomebrewML/HomebrewNLP-MTF

HomebrewNLP in Mesh-TensorFlow flavour for distributed TPU training

11
Experimental
83 lewisnjue/gpt-2

gpt-2

11
Experimental
84 purang2/GPT

OpenAI GPT, Generative Pre-Training

11
Experimental
85 hnliu-git/GPTagger

GPTagger: Extract accurate text tags with the power of GPT.

11
Experimental
86 ravijo/TrumpGPT

A GPT model trained to mimic Donald Trump's style - just for fun

11
Experimental
87 ravijo/GPT101

Getting started with GPT for language modeling

11
Experimental
88 MrFishPL/gpt

I built this repo to prove to my granny that I can implement GPT.

11
Experimental
89 patrykniemczyk/gpt

A minimal from-scratch implementation of the GPT architecture with BPE...

11
Experimental
90 billh0420/ClaimGPT250203

Create a Generative Pretrained Transformer model for Metamath

11
Experimental
91 BlazeWild/GPT_FROM_SCRATCH

Minimal GPT implementation from scratch using PyTorch — trains a...

11
Experimental
92 alphatechlogics/FaseehGPT

FaseehGPT is an advanced pipeline for training a GPT-style language model...

11
Experimental
93 onemriganka/GPT-0.5m

A 0.5 million parameter character-level Transformer model in PyTorch, base...

11
Experimental
94 MariuszAndziak/baselineGPT

Basic GPT functionality written from scratch. Project made for educational...

11
Experimental
95 KunkelAlexander/lets-play-with-gpts

Explore GPTs based on a series of Youtube videos.

11
Experimental
96 suryanshgupta9933/Hindi-GPT

Hindi GPT is a transformer based language model trained on Hindi Oscar...

11
Experimental
97 Alvaro8gb/BERTvsGPT

GPT for medical entity recognition in Spanish

11
Experimental
98 den1ksk/GPTClimat

GPT model for climate analysis

10
Experimental
99 wansiqing1226/Younker_CISSP_GPT

A GPT model using fine-tuning techniques for the purpose of CISSP study.

10
Experimental
100 binoydipu/kobigpt

KobiGPT is a character-level GPT language model trained exclusively on the...

10
Experimental
101 azimonti/gpt-playground

My playground for Generative Pre-trained Transformer (GPT) implementation

10
Experimental
102 RISHIT7/GPT-From-Scratch

A repository that aims to model and train a GPT from scratch.

10
Experimental
103 Manas02/ScaffoldGPT

Scaffold Generative Pretraining

10
Experimental
104 MiSaengg/gunhee-RnD-space

R&D for datasets for book genres

10
Experimental
105 zilaeric/othello-gpt-probing

Training and exploration of linear probes into Othello-GPT by Li et al. (2022)

10
Experimental
106 mytechnotalent/ToyGPT

ToyGPT, inspired by Andrej Karpathy’s GPT from scratch, creates a toy...

10
Experimental
107 tedoaba/GPT-from-scratch

GPT Model from scratch

10
Experimental
108 fraserlove/gpt-base

GPT-base is a basic decoder-only language model following the architecture...

10
Experimental
109 hrithiksagar/Reproducing-GPT-2

Reproducing GPT-2 (124M) from scratch, following Mr. Karpathy's tutorial.

10
Experimental
110 ES7/GPT-from-Scratch

In this repository, I have created the GPT architecture, provided the code...

10
Experimental
111 abdussahid26/GPT-2-Model-from-Scratch-to-Generate-Text

Implementation of a GPT-2 model from scratch for text generation. This...

10
Experimental
112 iamNCJ/YuanGPT

GPT-like Large Language Model Pretrained on Inspur's Yuan Dataset

10
Experimental

Comparisons in this category