All Voice AI Tools

6,981 tools ranked by quality score · Page 26 of 70

Showing 2501–2600 of 6,981
# Tool Score Tier
2501 mpoyraz/wav2vec2-turkish

Turkish Speech Recognition using Facebook's Wav2vec 2.0 models

29
Experimental
2502 6Morpheus6/Chattered

All in one Gradio interface for chatterbox. Voice cloning from uploaded...

29
Experimental
2503 hay/audio2text

Python command line utility wrappers for Whispercpp and other speech-to-text...

29
Experimental
2504 parzibyte/tts-js

Demostración de speechSynthesis con JavaScript: TTS o Síntesis de habla

29
Experimental
2505 robinhad/voice-recognition-ua

Training scripts for Speech-To-Text models for Ukrainian language

29
Experimental
2506 Hamahmi/kaldi-tut

This is a Kaldi tutorial for beginners

29
Experimental
2507 ARK018/multi-voice-sdk

A universal Text-to-Speech (TTS) SDK . Easily generate and manage audio...

29
Experimental
2508 marcominerva/TranslatorService

A lightweight library that uses Cognitive Translator Service for text...

29
Experimental
2509 DillionLowry/NeuralCodecs

Neural Audio Codecs implemented in C# - DAC, SNAC, Encodec, Dia

29
Experimental
2510 javichur/fitness-voice

AI voice-controlled trainer in your web browser, using NLP (wit.ai), body...

29
Experimental
2511 lord-lethris/ComfyUI-lethris-dia2

ComfyUI custom nodes for the Dia2 TTS model — generate speech, timestamps,...

29
Experimental
2512 GuruCharan94/az-podcast-transcriber

A podcast transcription service built on Azure that transcribes any new...

29
Experimental
2513 themaxdit1175/soundpad-download-plus-subscription

Get Soundpad Download Plus on GitHub: a complete, high-performance toolkit...

29
Experimental
2514 aflr-archive/apiaudio-python

api.audio Python SDK

29
Experimental
2515 speechly/browser-client-example

A demo app showcasing Speechly browser-client and detailed api responses.

29
Experimental
2516 ElsebaiyMohamed/Modablag

This project presents a comprehensive study on video dubbing techniques and...

29
Experimental
2517 techiaith/docker-huggingface-stt-cy

Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech...

29
Experimental
2518 morfeusys/porfir

Голосовой ассистент Порфирьевич

29
Experimental
2519 azu/vscode-read-aloud-text

VSCode extension that read aloud text like Markdown and text etc...

29
Experimental
2520 StanGirard/quivr-whisper

Talk to your second brain personal assistant using speech 🧠

29
Experimental
2521 LianjiaTech/bella-whisper

bella-whisper是一系列基于OpenAI...

29
Experimental
2522 Fraunhofer-AISEC/towards-resistant-audio-adversarial-examples

Generation tool for offset-resistant audio adversarial examples against Deepspeech

29
Experimental
2523 alecokas/BiLatticeRNN-Confidence

Confidence Estimation for Black Box Automatic Speech Recognition Systems...

29
Experimental
2524 mikex86/DeepSpeech-Java-Bindings

Java Bindings for the C++ library DeepSpeech

29
Experimental
2525 prohetamine/tor-speech

🔉 Yandex & Google + Tor

29
Experimental
2526 vietai/ASR

End-to-End Vietnamese Speech Recognition using wav2vec 2.0

29
Experimental
2527 nidi3/swiss-wowbagger

Let yourself be insulted in swiss german. Schöner fluchen auf Berndeutsch.

29
Experimental
2528 simalexan/speechy

Voice command tool for an easy web speech recognition for your web...

29
Experimental
2529 amitpatil321/VoiceForm

Voice Controlled Form, Which can be filled, cleared, submitted using only...

29
Experimental
2530 nalbion/whisper-server

streaming speech to text server using Whisper

28
Experimental
2531 FS-17/SpeechDataBuilder

Browser-based open-source tool for creating high-quality TTS/STT datasets....

28
Experimental
2532 AceCentre/pasco

Phrase Auditory Scanning COmmunicator - AAC App for iOS and the Web

28
Experimental
2533 taeyoun811/Whisfusion

Whisfusion: Parallel ASR Decoding via a Diffusion Transformer

28
Experimental
2534 yc9701/pansori-tedxkr-corpus

Korean ASR Corpus generated from TEDx talks

28
Experimental
2535 heyfoz/python-youtube-transcription

This repository contains Python scripts and a local Flask web application...

28
Experimental
2536 jianchang512/parakeet-api

一个基于 NVIDIA Parakeet-tdt-0.6b 模型的本地语音转录服务。它提供了一个与 OpenAI API 兼容的接口和一个简洁的 Web 用户界面

28
Experimental
2537 Leonard2310/LibrAI

iOS app with AI for an immersive audiobook experience, text-to-speech and...

28
Experimental
2538 jreremy/conformer

Pytorch implementation of conformer with with training script for end-to-end...

28
Experimental
2539 cadia-lvl/WebRICE

WebRICE (Web Reader ICE) is an open source web reader in development at...

28
Experimental
2540 Forced-Alignment-and-Vowel-Extraction/fave-asr

Interface for automated transcription and time alignment of conversational...

28
Experimental
2541 alexogeny/cortana

Your own personal assistant thanks to chat-gpt, whisper, and elevenlabs tts

28
Experimental
2542 maetshju/flux-blstm-implementation

An implementation of the Graves & Schmidhuber (2005) bidirectional LSTM in Flux.

28
Experimental
2543 nearkyh/AWS-Polly

How to use Amazon Polly TTS(Text To Speech)

28
Experimental
2544 theamazing0/global-subtitles-main

Closed Captioning Everywhere, With Assembly AI

28
Experimental
2545 speechly/react-client

An React client library for Speechly API

28
Experimental
2546 noir-neo/UniSpeech

iOS speech framework native plugin for Unity

28
Experimental
2547 BleachDev/tts-grabber

Every Google, Azure & IBM text to speech voice for free.

28
Experimental
2548 cosmoquester/speech-recognition

Develop speech recognition models with Tensorflow 2

28
Experimental
2549 M86xKC/edge-tts

Simple TTS using MS Edge built-in voices

28
Experimental
2550 18F/tts-buy-cloudgov-vulnerability-scanner

Solicitation and acquisition documents created for the cloud.gov...

28
Experimental
2551 yokawasa/vscode-translator-voice

VS Code extension for multi-language text translation and TTS...

28
Experimental
2552 tasmirz/EyeWear

Eyewear with OCR and live WebRTC based calling for the visually impaired....

28
Experimental
2553 Ralireza/spoken-digit-recognition

Classifying English spoken digit by Hidden Markov Model

28
Experimental
2554 Tristan296/Universal-MacAssistant

Advanced Personal Assistant created for macOS that utilises AppleScripts,...

28
Experimental
2555 Lqm1/openai-workers-ai

A Cloudflare Workers-based, OpenAI-compatible API project that provides...

28
Experimental
2556 Sukumar9944/Speech-to-Text-with-ChatGPT

This Python application combines speech recognition with the power of...

28
Experimental
2557 matusstas/openai-whisper-microservice

This is an OpenAI Whisper automatic speech recognition microservice

28
Experimental
2558 yanorei32/aitalked

W.I.P. GynoidTalk / VOICEROID2 Low-Level Rust Binding Library based on...

28
Experimental
2559 chienhsiang-hung/voice-and-wav-cloning

通過少量語音與影片樣本生成高質量的語音與影片克隆 ( AI 人像口白生成 ),並提供多種音頻處理技術來提升音質和真實感。

28
Experimental
2560 hi-paris/wavlm-vocoder-french

WavLM-to-Audio neural vocoder for French speech reconstruction — layer...

28
Experimental
2561 ryanlintott/OEVoice

Old English text-to-speech using AVSpeechSynthesis and IPA pronunciations.

28
Experimental
2562 adhadse/Deepdubpy

A complete end-to-end Deep Learning system to generate high quality human...

28
Experimental
2563 ActiveNick/Unity-SpeechWithLUIS

Sample Unity project used to demonstrate the integration of Speech...

28
Experimental
2564 fikrikarim/volocal

Fully local voice AI for iOS

28
Experimental
2565 SohamRatnaparkhi/Voice-Assistant

Voice Assistant coded in Python!

28
Experimental
2566 huuquyet/PhoWhisper-next

Demo using PhoWhisper models of VinAI built with Transformers.js + Next.js

28
Experimental
2567 codekraft-studio/vue-speech

Vue integration and components for the Web Speech API

28
Experimental
2568 Helther/voice-pick-tbot

Text To Speech Synthesis Telegram Bot with voice customization

28
Experimental
2569 systoolz/dosbtalk

unofficial API implementation for Text-to-Speech Engine by First Byte

28
Experimental
2570 i-bardinov/Godot-Android-Text-to-Speech

Godot Android Text to Speech plugin for Godot Engine 3.4 or higher

28
Experimental
2571 WindQAQ/tensorflow-wavenet

Implementation of WaveNet network based on Tensorflow.

28
Experimental
2572 IBM/iot-mic-sts-ifttt-slack

WARNING: This repository is no longer maintained :warning: This repository...

28
Experimental
2573 void-xtreme/audible-text-editor

An automated Sinhala audio Text Editor for visually impaired and blind students

28
Experimental
2574 jaganadhg/nemoexamples

Experiments with NVIDIA NeMo

28
Experimental
2575 hugobloem/esp-ha-speech

Local speech recognition on an ESP32 for Home Assistant

28
Experimental
2576 18F/tts-buy-code-review

Solicitation documents for the code review procurement being undertaken by TTS.

28
Experimental
2577 opensource-spraakherkenning-nl/asr_nl

Dutch Speech Recognition webservice

28
Experimental
2578 jeantimex/F5-TTS-Server

F5-TTS server APIs for voice cloning and text-to-speech generation with...

28
Experimental
2579 chimechallenge/chime-utils

Scripts for data generation, scoring and data manifest preparation for...

28
Experimental
2580 xiaominfc/aliyun_nls_c_demo

阿里云的实时语音识别服务(ASR)没有提供C的SDK,项目中需要,看了它java sdk的实现,就做了个C版demo

28
Experimental
2581 sap1119/voice-agent-0.01

A self-hosted, AI-powered voice assistant system with real-time voice...

28
Experimental
2582 mathquis/node-picotts

SVOX PicoTTS binding for Node.js

28
Experimental
2583 seven-io/go-client

Official Go API Client for seven.io

28
Experimental
2584 Ranjit2111/AI-Interview-Agent

Multi-agent AI system for interview practice. Features adaptive questioning,...

28
Experimental
2585 veralvx/xtts-finetune

XTTS fine-tuning via CLI

28
Experimental
2586 aria-music/zundacord

Japanese Text-to-speech bot for Discord, powered by VOICEVOX

28
Experimental
2587 jlia0/RealityTalk

RealityTalk: Real-Time Speech-Driven Augmented Presentation for AR Live Storytelling

28
Experimental
2588 sera619/S4M-2.0

German supported VoiceAssist without BigData

28
Experimental
2589 darsh-1010/Jarvis-A-Voice-Based-Assistant-Powered-by-LLaMA

Jarvis is a voice-based assistant built in Python that simplifies daily...

28
Experimental
2590 MelvilQ/stacksrs

A simple Spaced Repetition app for Android.

28
Experimental
2591 Yukaii/gakuon

Review Anki cards using Generative AI voice

28
Experimental
2592 sinProject-Inc/talk

Listening and Speaking

28
Experimental
2593 EtienneAb3d/SRT-Sync

Synchronize SRT timestamps over an existing accurate transcription

28
Experimental
2594 robotology/natural-speech

This repository contains a codebase to build automatic speech recognition...

28
Experimental
2595 dialpad/mucs_2021_dialpad

Dialpad team's submission to the MUCS 2021 workshop

28
Experimental
2596 NICEElevateAI/ElevateAIJavaSDK

Java SDK for ElevateAI

28
Experimental
2597 lelosaiyan/J.A.R.V.I.S.

A voice virtual desktop assistant for Windows 7/10

28
Experimental
2598 MazueraAlvaro/speech-recognition-asterisk

A script for speech recognition in asterisk

28
Experimental
2599 speechly/react-example-repo-filtering

An example app for filtering data with Speechly and React

28
Experimental
2600 Babakinha/Dectalk

A Simple package for using Dectalk

28
Experimental
« Prev 1 2 3 24 25 26 27 28 68 69 70 Next »