Blaizzy/mlx-audio-swift

A modular Swift SDK for audio processing with MLX on Apple Silicon

/ 100

Established

Provides modular audio AI capabilities spanning text-to-speech, speech-to-text, voice activity detection, speaker diarization, and speech enhancement via MLX inference on Apple Silicon. Built as composable Swift packages with streaming support and automatic HuggingFace model loading, it integrates codecs (SNAC, Encodec, Vocos) and supports multiple model families (Qwen3, Fish Audio, Soprano, Voxtral, Sortformer) with native async/await APIs.

446 stars.

No Package No Dependents

Maintenance 13 / 25

Adoption 10 / 25

Maturity 13 / 25

Community 18 / 25

How are scores calculated?

Stars

446

Forks

Language

Swift

License

MIT

Compare

mlx-audio-swift and speech-swift mlx-audio-swift and mlx-swift-asr

Related tools

k2-fsa/sherpa-ncnn

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn...

FluidInference/FluidAudio

Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity...

phuc-nt/my-translator

Real-time speech translation — macOS & Windows, free TTS, no server, your API keys only

pot-app/pot-desktop

🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.

soniqo/speech-swift

AI speech toolkit for Apple Silicon — ASR, TTS, speech-to-speech, VAD, and diarization powered...

Explore Voice AI Tools

All categories Trending Voice AI directory Insights