unfoldingWord/string-punctuation-tokenizer
Small library that provides functions to tokenize a string into an array of words with or without punctuation
Used by 1 other package. No commits in the last 6 months. Available on npm.
Stars
8
Forks
1
Language
JavaScript
License
MIT
Category
Last pushed
Aug 09, 2023
Commits (30d)
0
Dependencies
1
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/unfoldingWord/string-punctuation-tokenizer"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
forzagreen/n2words
Convert numerical numbers to written numbers, in 52+ languages.
PyThaiNLP/nlpo3
Thai natural language processing library in Rust, with Python and Node bindings.
pemistahl/lingua-rs
The most accurate natural language detection library for Rust, suitable for short text and...
greyblake/whatlang-rs
Natural language detection library for Rust. Try demo online: https://whatlang.org/
wikimedia/sentencex
A sentence segmentation library with wide language support optimized for speed and utility.