Mbeeee111/tokenizer.cpp
📦 Optimize tokenization in C++ for HuggingFace models with a fast, production-ready library supporting BPE, WordPiece, and Unigram methods.
Stars
1
Forks
—
Language
C++
License
Apache-2.0
Category
Last pushed
Mar 13, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/Mbeeee111/tokenizer.cpp"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
huggingface/tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
megagonlabs/ginza-transformers
Use custom tokenizers in spacy-transformers
Kaleidophon/token2index
A lightweight but powerful library to build token indices for NLP tasks, compatible with major...
NVIDIA/Cosmos-Tokenizer
A suite of image and video neural tokenizers
Hugging-Face-Supporter/tftokenizers
Use Huggingface Transformer and Tokenizers as Tensorflow Reusable SavedModels