yannvgn/laserembeddings
LASER multilingual sentence embeddings as a pip package
Replaces heavy C++ dependencies (Moses, fastBPE) with pure-Python alternatives (Sacremoses, subword-nmt) for zero-config installation. Produces language-agnostic 1024-dimensional vectors using Facebook Research's pre-trained encoder, enabling zero-shot cross-lingual transfer and multilingual semantic similarity tasks. Supports 93+ languages with optional tokenization extras for Chinese (jieba) and Japanese (MeCab).
224 stars and 2,671 monthly downloads. Used by 1 other package. No commits in the last 6 months. Available on PyPI.
Stars
224
Forks
29
Language
Python
License
BSD-3-Clause
Category
Last pushed
Aug 11, 2023
Monthly downloads
2,671
Commits (30d)
0
Dependencies
5
Reverse dependents
1
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/yannvgn/laserembeddings"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Featured in
Related tools
embeddings-benchmark/mteb
MTEB: Massive Text Embedding Benchmark
harmonydata/harmony
The Harmony Python library: a research tool for psychologists to harmonise data and...
embeddings-benchmark/results
Data for the MTEB leaderboard
fresh-stack/freshstack
This repository helps you evaluate your models on the FreshStack benchmark!
Hironsan/awesome-embedding-models
A curated list of awesome embedding models tutorials, projects and communities.