unum-cloud/UForm

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and ๐Ÿ”œ video, up to 5x faster than OpenAI CLIP and LLaVA ๐Ÿ–ผ๏ธ & ๐Ÿ–‹๏ธ

58
/ 100
Established

Based on the README, here's a technical summary: Combines Matryoshka-style embeddings down to 64 dimensions with quantization-aware training for fast semantic search via USearch, while generative models leverage ViT encoders paired with compact language models (Qwen, LLaMA) for image captioning and VQA. Exports native ONNX models with bfloat16 support across Python, JavaScript, and Swift for edge deployment from servers to mobile devices.

1,221 stars and 2,280 monthly downloads. Available on PyPI.

Maintenance 6 / 25
Adoption 18 / 25
Maturity 18 / 25
Community 16 / 25

How are scores calculated?

Stars

1,221

Forks

76

Language

Python

License

Apache-2.0

Last pushed

Oct 30, 2025

Monthly downloads

2,280

Commits (30d)

0

Dependencies

4

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/unum-cloud/UForm"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.