ranpy13/Learning-LLM

Learning to build LLM from scratch, following rasbt/LLMs-from-scratch footsteps.

/ 100

Experimental

This project provides the code and guidance to build your own large language models (LLMs) from the ground up. You'll put in foundational code and training data, and get out a custom, functional LLM capable of generating text and performing specific tasks. This is ideal for machine learning engineers, AI researchers, or data scientists looking to deepen their understanding of LLM architecture and training.

No commits in the last 6 months.

Use this if you are a machine learning practitioner who wants to learn the inner workings of large language models by implementing them yourself.

Not ideal if you are looking for a pre-built LLM to use out-of-the-box for applications without needing to understand its construction.

machine-learning-engineering natural-language-processing AI-research deep-learning model-development

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 4 / 25

Maturity 16 / 25

Community 8 / 25

How are scores calculated?

Stars

Forks

Language

Jupyter Notebook

License

—

Higher-rated alternatives

AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

rasbt/reasoning-from-scratch

Implement a reasoning LLM in PyTorch from scratch, step by step

mindspore-lab/mindnlp

MindSpore + 🤗Huggingface: Run any Transformers/Diffusers model on MindSpore with seamless...

mosaicml/llm-foundry

LLM training code for Databricks foundation models

rickiepark/llm-from-scratch

<밑바닥부터 만들면서 공부하는 LLM>(길벗, 2025)의 코드 저장소

Explore Transformer Models

All categories Trending Transformer directory Insights