towhee-io/towhee
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Supports multimodal unstructured data (text, images, video, audio) with 140+ pre-built operators spanning CV, NLP, and audio domains using a Pythonic method-chaining API. Leverages LLM-based pipeline orchestration with prompt management and knowledge retrieval, while offering pre-configured ETL pipelines for RAG, image search, and video deduplication. Can compile Python pipelines to high-performance Docker containers via Triton Inference Server, supporting TensorRT, PyTorch, and ONNX backends for CPU/GPU deployment.
3,458 stars. No commits in the last 6 months. Available on PyPI.
Stars
3,458
Forks
262
Language
Python
License
Apache-2.0
Category
Last pushed
Oct 18, 2024
Commits (30d)
0
Dependencies
9
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/embeddings/towhee-io/towhee"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related tools
deepset-ai/haystack-tutorials
Here you can find all the Tutorials for Haystack 📓
unum-cloud/USearch
Fast Open-Source Search & Clustering engine × for Vectors & Arbitrary Objects × in C++, C,...
aryn-ai/sycamore
🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.
MaartenGr/PolyFuzz
Fuzzy string matching, grouping, and evaluation.
pingcap/pytidb
TiDB AI SDK: Unified Multi-Modal Data Platform for AI Apps & Agents - https://pingcap.github.io/ai/