UzVoice β an open infrastructure for Uzbek speech-to-text and text-to-speech, trained on community-contributed voice data covering all 13 regional dialects of Uzbekistan and Karakalpakstan.
Whisper and Google STT recognize "literary Tashkent Uzbek" with mediocre accuracy and completely fail on Khorezm, Karakalpak, or Surkhandarya dialects. That's not a corner case β it's how 60% of the country speaks.
From standard Tashkent to rare Karakalpak β each dialect gets dedicated training data and benchmarks.
Voice contributors are rewarded with points, regional contests, and referral bonuses. Rare dialects earn x3-x5 multipliers.
Sub-500ms streaming STT for call centers, government services, and education. Fully self-hostable for sensitive workloads.
Trained on consented data, model cards published, evaluation benchmarks open. No black-box surprises in production.
Every API request improves the model: low-confidence audio is routed to human annotators, high-confidence becomes pseudo-labels.
Uzbek-specific G2P rules, fricatives like "q" and "Κ»" properly modeled, code-switching to Russian and Tajik handled.
Rare regional dialects get higher reward multipliers in our Telegram collection bot. The rarer the speech, the more valuable the contribution.
Public progress: what's shipped, what's in flight, what's next.
Live and accepting voice contributions with consent + gamification.
Label Studio + MinIO storage + dialect-aware annotation schema.
Whisper large-v3-turbo bootstraps annotators with pre-filled drafts.
Targeting May 2026 β community contributions + paid annotators.
Public access for select partners (call centers, education, gov).
Synthesize speech in any of the 13 dialects, voice cloning for licensed talent.
CC-BY-NC dataset for academic research, partnerships with national universities.
Open the Telegram bot, agree to consent, choose your region, and start reading prompts. Earn points, climb regional leaderboards, invite friends.
Open @uzvoicerobot βLooking for early API access, custom dialect models, or full on-premise deployment for your call center, school, or government service?
Get in touch β