jina-ai/mlx-retrieval

Train embedding and reranker models for retrieval tasks on Apple Silicon with MLX

/ 100

Emerging

Implements LoRA fine-tuning with contrastive losses (InfoNCE, NT-Xent) and hard negative mining, leveraging MLX Data for efficient streaming from local JSONL or Elasticsearch sources. Integrates with MTEB for evaluation and Weights & Biases for experiment tracking, supporting gradient accumulation to simulate large batch sizes on resource-constrained Apple Silicon hardware. Uses query/document prompt tokens with mean pooling for embedding generation, mirroring Jina's v3/v4 architectures.

177 stars. No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 2 / 25

Adoption 10 / 25

Maturity 15 / 25

Community 10 / 25

How are scores calculated?

Stars

177

Forks

Language

Python

License

Apache-2.0

Featured in

Embeddings Are Easier Than Whatever You're Doing Instead

Higher-rated alternatives

FlagOpen/FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Blaizzy/mlx-embeddings

MLX-Embeddings is the best package for running Vision and Language Embedding models locally on...

qdrant/fastembed

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

Merck/Sapiens

Sapiens is a human antibody language model based on BERT.

amansrivastava17/embedding-as-service

One-Stop Solution to encode sentence to fixed length vectors from various embedding techniques

Explore Embedding Tools

All categories Trending Embeddings directory Insights