HuaizhengZhang/AI-Infra-from-Zero-to-Hero
🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑💻 Video Tutorials.
Organizes research and practical implementations across the full ML/AI infrastructure stack—from data processing and training systems to LLM serving and domain-specific areas like federated learning and edge AI. Curates papers from top systems conferences (OSDI, NSDI, MLSys, SoCC) alongside implementation code, whitepaper guides, and video tutorials to bridge academic research with production deployment. Covers specialized infrastructure for LLMs, video systems, AutoML, GNNs, and reinforcement learning, enabling developers to understand architectural patterns for scaling AI workloads.
3,763 stars. No commits in the last 6 months.
Stars
3,763
Forks
370
Language
—
License
MIT
Category
Last pushed
Jul 25, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/HuaizhengZhang/AI-Infra-from-Zero-to-Hero"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
thu-pacman/chitu
High-performance inference framework for large language models, focusing on efficiency,...
NotPunchnox/rkllama
Ollama alternative for Rockchip NPU: An efficient solution for running AI and Deep learning...
sophgo/LLM-TPU
Run generative AI models in sophgo BM1684X/BM1688
Deep-Spark/DeepSparkHub
DeepSparkHub selects hundreds of application algorithms and models, covering various fields of...
tomdyson/microllama
The smallest possible LLM API