OpenVoice
Talk to your OpenClaw from your phone, 24/7.
At the gym, on a walk, wherever — just open Discord and speak.
Your OpenClaw sits in Discord Voice Chat ready to cook whenever you are. Say its name, it'll activate and you can start working together.
Simply add your OpenClaw to your voice channel(s)
Full transcriptions sent to voice chat channel
Think about it
Your OpenClaw already has access to your files, your GitHub, your APIs, your tools. It can deploy code, check emails, manage servers, write scripts — everything.
Now imagine you don't need to type any of that. You don't need your laptop open. You don't even need to look at a screen. You don't even need voice to text.
Just say "Hey Midir, deploy the staging branch" — and it's done. That's it. Jarvis in your ear, except it's YOUR agent with YOUR permissions.
"Hey, remember that content idea from last night? Let's flesh it out right now."
↑ On a walk. No screen. Just talking.
"Check if the PR passed CI and merge it if it's clean."
↑ Between sets at the gym.
"Read me the last 3 emails and draft a reply to the one from the client."
↑ Driving home. Hands on the wheel.
You in five minutes ↓
Demo
Watch your OpenClaw respond to voice commands in real time.
Setup
$10 (48hr) or $20. One-time payment, no subscriptions. Lifetime access to the private repo.
Check your email for the GitHub invite. Accept it, clone the private repo to your machine.
Personalize your setup — add your API keys, choose your voice, pick your model. Your OpenClaw agent joins voice in minutes.
Make it yours
Here's how you can power this however YOU want.
Use MiniMax or Kimi K2.5 instead of your main model — $10/mo is all you need. Keep your main session context clean.
Run Whisper locally for free STT — no API costs, slightly slower but completely private. No internet needed.
Paid tier for faster TTS, more voices, higher quality. Worth it if you want the best voice experience.
Use your existing OpenClaw API keys here too. Might clog session context — we suggest a cheap model if you want to keep things separate.
Runs as an isolated subagent — zero context used from your main session. Or swap in Gemini Flash for free casual voice chat (1M+ free tokens, ~1s responses). Requires additional setup.
Make it as smart or simple as you want. Spend $50/mo or $10/mo — your choice. Nothing new needed.
FAQ