av1d/rk3588_npu_llm_server
Allows access via HTTP to LLM running on RK3588 NPU. Returns JSON response.
Archived32
/ 100
Emerging
No commits in the last 6 months.
Archived
Stale 6m
No Package
No Dependents
Maintenance
0 / 25
Adoption
7 / 25
Maturity
9 / 25
Community
16 / 25
Stars
28
Forks
6
Language
C++
License
MIT
Category
Last pushed
May 29, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/av1d/rk3588_npu_llm_server"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
thu-pacman/chitu
High-performance inference framework for large language models, focusing on efficiency,...
85
sophgo/LLM-TPU
Run generative AI models in sophgo BM1684X/BM1688
60
NotPunchnox/rkllama
Ollama alternative for Rockchip NPU: An efficient solution for running AI and Deep learning...
60
Deep-Spark/DeepSparkHub
DeepSparkHub selects hundreds of application algorithms and models, covering various fields of...
56
tomdyson/microllama
The smallest possible LLM API
52