mytechnotalent/gpt_from_scratch

This notebook builds a complete GPT (Generative Pre-trained Transformer) model from scratch using PyTorch. It covers tokenization, self-attention, multi-head attention, transformer blocks, and text generation and all explained step-by-step with a simple nursery rhyme corpus.

/ 100

Experimental

No Package No Dependents

Maintenance 6 / 25

Adoption 1 / 25

Maturity 9 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

Jupyter Notebook

License

MIT

Category

gpt-multilingual-training

Last pushed

Dec 16, 2025

Commits (30d)

GitHub

GPT Multilingual Training · 112 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/mytechnotalent/gpt_from_scratch"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

Nixtla/nixtla

TimeGPT-1: production ready pre-trained Time Series Foundation Model for forecasting and...

akanyaani/gpt-2-tensorflow2.0

OpenAI GPT2 pre-training and sequence prediction implementation in Tensorflow 2.0

andrewdalpino/NoPE-GPT

A GPT-style small language model (SLM) with no positional embeddings (NoPE).

VinAIResearch/PhoGPT

PhoGPT: Generative Pre-training for Vietnamese (2023)

teddykoker/image-gpt

PyTorch Implementation of OpenAI's Image GPT

Explore LLM Tools

All categories Trending LLM Tool directory Insights