Blaizzy/mlx-audio-swift

A modular Swift SDK for audio processing with MLX on Apple Silicon

54
/ 100
Established

Provides modular audio AI capabilities spanning text-to-speech, speech-to-text, voice activity detection, speaker diarization, and speech enhancement via MLX inference on Apple Silicon. Built as composable Swift packages with streaming support and automatic HuggingFace model loading, it integrates codecs (SNAC, Encodec, Vocos) and supports multiple model families (Qwen3, Fish Audio, Soprano, Voxtral, Sortformer) with native async/await APIs.

446 stars.

No Package No Dependents
Maintenance 13 / 25
Adoption 10 / 25
Maturity 13 / 25
Community 18 / 25

How are scores calculated?

Stars

446

Forks

56

Language

Swift

License

MIT

Last pushed

Mar 17, 2026

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/Blaizzy/mlx-audio-swift"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.