开源语音合成工作室
Voicebox是一款开源、本地优先的语音合成工作室,可作为ElevenLabs的免费替代方案。它允许用户仅凭几秒钟的音频样本即可克隆声音,并支持使用Qwen3-TTS、LuxTTS等五种引擎,以23种语言生成语音。项目内置音高调整、混响等后期效果,以及一个多音轨时间线编辑器,便于制作对话、播客等叙事内容。所有语音模型和数据均在本地运行,确保了完全的隐私安全。其提供的REST API便于开发者将语音合成功能集成到自己的应用程序中。该工具基于Tauri(Rust)构建,性能高效,支持macOS(MLX/Metal)、Windows(CUDA)、Linux等多种平台与硬件加速。
📖 README
VoiceboxThe open-source voice synthesis studio.
Clone voices. Generate speech. Apply effects. Build voice-powered apps.
All running locally on your machine.voicebox.sh •
Docs •
Download •
Features •
APIClick the image above to watch the demo video on voicebox.shWhat is Voicebox?Voicebox is a local-first voice cloning studio — a free and open-source alternative to ElevenLabs. Clone voices from a few seconds of audio, generate speech in 23 languages across 5 TTS engines, apply post-processing effects, and compose multi-voice projects with a timeline editor.
• Complete privacy — models and voice data stay on your machine
• 5 TTS engines — Qwen3-TTS, LuxTTS, Chatterbox Multilingual, Chatterbox Turbo, and HumeAI TADA
• 23 languages — from English to Arabic, Japanese, Hindi, Swahili, and more
• Post-processing effects — pitch shift, reverb, delay, chorus, compression, and filters
• Expressive speech — paralinguistic tags like `[laugh]`, `[sigh]`, `[gasp]` via Chatterbox Turbo
• Unlimited len...
📊 项目信息
- 语言
- TypeScript
- Stars
- ⭐ 19,047
- Forks
- 2,195
- 今日新增
- +880
- 排名
- #4
- 收录
- 总榜
- 趋势日期
- 2026年4月16日
- 最后推送
- 2026/4/16
🏷️ 标签
aicudamlxqwen3-ttsqwen3-tts-uivoice-aivoice-clonewhisper


