isaacwiafe/speech_data_ghana_ug

The dataset comprises of 5000 hours speech corpus in Akan, Ewe, Dagbani, Daagare, and Ikposo. Each language includes 1000 hours of audio speech from indigenous speakers of the language and 100 hours of transcription.

/ 100

Experimental

No commits in the last 6 months.

Stale 6m No Package No Dependents

Maintenance 0 / 25

Adoption 1 / 25

Maturity 9 / 25

Community 0 / 25

How are scores calculated?

Stars

Forks

—

Language

HTML

License

—

Category

llm-scaling-architecture

Last pushed

Dec 29, 2024

Commits (30d)

GitHub

LLM Scaling Architecture · 49 tools

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/llm-tools/isaacwiafe/speech_data_ghana_ug"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Higher-rated alternatives

aalok-sathe/surprisal

A unified interface for computing surprisal (log probabilities) from language models! Supports...

EvolvingLMMs-Lab/lmms-engine

A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

FunnySaltyFish/Better-Ruozhiba

【逐条处理完成】人为审核+修改每一条的弱智吧精选问题QA数据集

reasoning-machines/pal

PaL: Program-Aided Language Models (ICML 2023)

microsoft/monitors4codegen

Code and Data artifact for NeurIPS 2023 paper - "Monitor-Guided Decoding of Code LMs with Static...

Explore LLM Tools

All categories Trending LLM Tool directory Insights