mazebrr/language-tokenizer
🧩 Tokenize text efficiently across multiple languages using our robust library, combining Unicode and NLP techniques for accurate text analysis.
Stars
—
Forks
—
Language
Rust
License
WTFPL
Category
Last pushed
Mar 13, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/transformers/mazebrr/language-tokenizer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
huggingface/tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
megagonlabs/ginza-transformers
Use custom tokenizers in spacy-transformers
Kaleidophon/token2index
A lightweight but powerful library to build token indices for NLP tasks, compatible with major...
NVIDIA/Cosmos-Tokenizer
A suite of image and video neural tokenizers
Hugging-Face-Supporter/tftokenizers
Use Huggingface Transformer and Tokenizers as Tensorflow Reusable SavedModels