AppDevGuy/OSSSpeechKit
OSSSpeechKit offers a native iOS Speech wrapper for AVFoundation and Apple's Speech.
Provides simplified multi-language voice selection and unified speech-to-text/text-to-speech APIs built on AVFoundation and Speech frameworks, supporting 47 languages with on-device recognition (iOS 13+). Abstracts away Apple's complex voice enumeration logic through a lightweight `OSSVoice` abstraction that handles language-country locale mapping and voice quality tiers. Achieves both TTS and STT functionality in minimal code while supporting advanced fine-tuning of speech rate, pitch, and volume through `OSSUtterance` parameters.
181 stars. No commits in the last 6 months.
Stars
181
Forks
42
Language
Swift
License
MIT
Category
Last pushed
Apr 22, 2024
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/voice-ai/AppDevGuy/OSSSpeechKit"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
k2-fsa/sherpa-ncnn
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn...
FluidInference/FluidAudio
Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity...
phuc-nt/my-translator
Real-time speech translation — macOS & Windows, free TTS, no server, your API keys only
Blaizzy/mlx-audio-swift
A modular Swift SDK for audio processing with MLX on Apple Silicon
pot-app/pot-desktop
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.