Frikallo/parakeet.cpp
Ultra fast and portable Parakeet implementation for on-device inference in C++ using Axiom with MPS+Unified Memory
Supports multiple decoder types (CTC, TDT, RNNT) with optional beam search and ARPA language model fusion, plus phrase boosting for domain-specific vocabulary. Built entirely in C++ on axiom's Metal tensor compiler without external runtimes—encoder inference fuses into optimized MPSGraph operations. Offers streaming models with configurable latency, speaker diarization, VAD preprocessing, and multi-format audio (WAV/FLAC/MP3/OGG) with a flat C API for FFI bindings to Python, Swift, Go, and Rust.
244 stars.
Stars
244
Forks
7
Language
C++
License
MIT
Category
Last pushed
Mar 15, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Frikallo/parakeet.cpp"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Higher-rated alternatives
shibing624/parrots
Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高
MainRo/deepspeech-server
A testing server for a speech to text service based on coqui.ai
altunenes/parakeet-rs
very fast speech-to-text, diarization, streaming (even in CPU) with NVIDIA Parakeet in Rust
thewh1teagle/pyannote-rs
pyannote audio diarization in rust
PaddlePaddle/Parakeet
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS,...