mosaicml/llm-foundry

LLM training code for Databricks foundation models

/ 100

Verified

Implements end-to-end training, finetuning, evaluation, and inference pipelines with built-in support for efficiency techniques like Flash Attention and Mixture-of-Experts architectures. Integrates with Composer for distributed training optimization and MosaicML's platform for scalable workload orchestration, while supporting both HuggingFace and proprietary models (MPT, DBRX) from 125M to 132B parameters. Includes data preparation utilities for StreamingDataset format, inference export to ONNX/HuggingFace, and in-context learning evaluation on academic benchmarks.

4,397 stars and 4,165 monthly downloads. Available on PyPI.

Maintenance 6 / 25

Adoption 18 / 25

Maturity 25 / 25

Community 22 / 25

How are scores calculated?

Stars

4,397

Forks

584

Language

Python

License

Apache-2.0

Compare

llm-foundry and llm-from-scratch llm-foundry and rllm

Related models

AI-Hypercomputer/maxtext

A simple, performant and scalable Jax LLM!

mindspore-lab/mindnlp

MindSpore + 🤗Huggingface: Run any Transformers/Diffusers model on MindSpore with seamless...

rasbt/reasoning-from-scratch

Implement a reasoning LLM in PyTorch from scratch, step by step

rickiepark/llm-from-scratch

<밑바닥부터 만들면서 공부하는 LLM>(길벗, 2025)의 코드 저장소

rllm-team/rllm

Pytorch Library for Relational Table Learning with LLMs.

Explore Transformer Models

All categories Trending Transformer directory Insights