IntuitionEngineeringTeam/chars2vec
Character-based word embeddings model based on RNN for handling real world texts
Uses LSTM layers to process character sequences, generating fixed-dimensional embeddings on-the-fly without storing a lookup dictionary—enabling robust handling of misspellings, slang, and out-of-vocabulary words. Trainable on custom datasets via contrastive learning with word pair similarity labels, with five pretrained English models (50-300 dimensions) available. Supports Python 2.7 and 3.0+ with a simple API for vectorizing words and training custom models.
174 stars. No commits in the last 6 months. Available on PyPI.
Stars
174
Forks
39
Language
Python
License
Apache-2.0
Category
Last pushed
Oct 09, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/IntuitionEngineeringTeam/chars2vec"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
shibing624/text2vec
text2vec, text to vector....
ddangelov/Top2Vec
Top2Vec learns jointly embedded topic, document and word vectors.
predict-idlab/pyRDF2Vec
🐍 Python Implementation and Extension of RDF2Vec
IITH-Compilers/IR2Vec
Implementation of IR2Vec, LLVM IR Based Scalable Program Embeddings
stephantul/reach
Load embeddings and featurize your sentences.