FlagOpen/FlagEmbedding
Retrieval and Retrieval-augmented LLMs
Provides dense, sparse, and multi-vector embedding models (including BGE-M3 supporting 100+ languages and 8K context) alongside rerankers and multimodal variants for comprehensive semantic search and RAG pipelines. Built on transformer architectures with support for in-context learning, token compression, and unified retrieval methods—integrates seamlessly with vector databases and LLM frameworks via HuggingFace.
11,395 stars. Used by 10 other packages. Actively maintained with 17 commits in the last 30 days. Available on PyPI.
Stars
11,395
Forks
842
Language
Python
License
MIT
Category
Last pushed
Mar 10, 2026
Commits (30d)
17
Dependencies
9
Reverse dependents
10
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/FlagOpen/FlagEmbedding"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
Blaizzy/mlx-embeddings
MLX-Embeddings is the best package for running Vision and Language Embedding models locally on...
qdrant/fastembed
Fast, Accurate, Lightweight Python library to make State of the Art Embedding
Merck/Sapiens
Sapiens is a human antibody language model based on BERT.
amansrivastava17/embedding-as-service
One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques
IlyasMoutawwakil/py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI...