IntuitionEngineeringTeam/chars2vec

Character-based word embeddings model based on RNN for handling real world texts

56
/ 100
Established

Uses LSTM layers to process character sequences, generating fixed-dimensional embeddings on-the-fly without storing a lookup dictionary—enabling robust handling of misspellings, slang, and out-of-vocabulary words. Trainable on custom datasets via contrastive learning with word pair similarity labels, with five pretrained English models (50-300 dimensions) available. Supports Python 2.7 and 3.0+ with a simple API for vectorizing words and training custom models.

174 stars. No commits in the last 6 months. Available on PyPI.

Stale 6m No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 25 / 25
Community 21 / 25

How are scores calculated?

Stars

174

Forks

39

Language

Python

License

Apache-2.0

Last pushed

Oct 09, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/IntuitionEngineeringTeam/chars2vec"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.