zjysteven/Awesome-Byte-LLM
A curated list of papers and resources on byte-based large language models (LLMs) โ models that operate directly on raw bytes.
This project is a curated list of research papers and resources related to byte-based large language models (LLMs). These models directly process raw digital information, such as text, audio, or images, without needing to convert it into tokens first. Data scientists, machine learning researchers, and AI engineers can use this to explore the latest advancements in creating more robust, versatile, and efficient AI models.
No commits in the last 6 months.
Use this if you are a researcher or engineer looking for academic papers and resources on cutting-edge byte-based large language models that avoid traditional tokenization.
Not ideal if you are looking for ready-to-use LLM applications or a guide on how to implement byte-based models in a production environment.
Stars
13
Forks
—
Language
—
License
—
Category
Last pushed
Jul 12, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/zjysteven/Awesome-Byte-LLM"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
PaddlePaddle/PaddleNLP
Easy-to-use and powerful LLM and SLM library with awesome model zoo.
meta-llama/llama-cookbook
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started...
arcee-ai/mergekit
Tools for merging pretrained large language models.
changyeyu/LLM-RL-Visualized
๐100+ ๅๅ LLM / RL ๅ็ๅพ๐๏ผใๅคงๆจกๅ็ฎๆณใไฝ่ ๅทจ็ฎ๏ผ๐ฅ๏ผ100+ LLM/RL Algorithm Maps ๏ผ
mindspore-lab/step_into_llm
MindSpore online courses: Step into LLM