srush/LLM-Training-Puzzles
What would you do with 1000 H100s...
A hands-on puzzle collection exploring distributed training primitives across thousands of GPUs, covering memory efficiency and compute pipelining strategies critical to large-scale model training. Each puzzle presents concrete challenges in multi-GPU coordination, requiring implementation of techniques like gradient accumulation, pipeline parallelism, and communication optimization. Designed for Colab execution with progressive difficulty, building on prior puzzle series covering GPU kernels, tensors, autodiff, and transformers.
1,157 stars. No commits in the last 6 months.
Stars
1,157
Forks
72
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Jan 10, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/srush/LLM-Training-Puzzles"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
SepineTam/stata-mcp
Let LLM help you achieve your regression with Stata. Evolve from reg monkey to causal thinker.
datawhalechina/code-your-own-llm
一份全栈式大语言模型参考指南,用最简洁的代码帮助你端到端定义模型从零训练到工程落地的每一个细节
leonid20000/odin-slides
This is an advanced Python tool that empowers you to effortlessly draft customizable PowerPoint...
onejune2018/Awesome-LLM-Eval
Awesome-LLM-Eval: a curated list of tools, datasets/benchmark, demos, leaderboard, papers, docs...
R3gm/InsightSolver-Colab
InsightSolver: Colab notebooks for exploring and solving operational issues using deep learning,...