You

"Run db:migrate on production and verify the schema"

⚡

OpenClaw BOT

"Migration complete. 14 tables updated, all constraints valid. ✅"

You

"I'm at the gym — what exercise should I do next for chest?"

⚡

OpenClaw BOT

"Bench, incline dumbbell press, cable flys. Want me to track your sets? 💪"

You

"Spin up staging and deploy branch feature/auth"

⚡

OpenClaw BOT

"Staging is live at staging-auth.fly.dev — branch deployed. 🚀"

You

"Merge PR #47, run the test suite, deploy to prod"

⚡

OpenClaw BOT

"Merged, 47/47 tests passing. Production deploy at v2.3.1."

You

"Remember that content idea from last night? Let's flesh it out"

⚡

OpenClaw BOT

"Pulled up your notes. The 'AI agents for dentists' thread — want to outline it?"

You

"Pull last 24h analytics and graph the conversion funnel"

⚡

OpenClaw BOT

"Funnel report ready. 12.4% conversion, up 2.1% from yesterday. 📊"

OpenVoice

Give your OpenClaw
a voice.

Talk to your OpenClaw from your phone, 24/7.
At the gym, on a walk, wherever — just open Discord and speak.

Get Access — $10

Secured with Stripe · Lifetime access · One-time payment · No subscription

Launch price ends in 48:00:00 — then $20

✓ Your OpenClaw, with a voice ✓ Your existing API keys ✓ Link to existing Discord ✓ No subscription ✓ Powered by ElevenLabs ✓ Powered by OpenAI Whisper

Your agent. In voice. Right now.

Your OpenClaw sits in Discord Voice Chat ready to cook whenever you are. Say its name, it'll activate and you can start working together.

Simply add your OpenClaw to your voice channel(s)

OpenClaw responding to voice commands in Discord chat

Full transcriptions sent to voice chat channel

Think about it

Your entire OpenClaw.
In your ear.

Your OpenClaw already has access to your files, your GitHub, your APIs, your tools. It can deploy code, check emails, manage servers, write scripts — everything.

Now imagine you don't need to type any of that. You don't need your laptop open. You don't even need to look at a screen. You don't even need voice to text.

Just say "Hey Midir, deploy the staging branch" — and it's done. That's it. Jarvis in your ear, except it's YOUR agent with YOUR permissions.

"Hey, remember that content idea from last night? Let's flesh it out right now."

↑ On a walk. No screen. Just talking.

"Check if the PR passed CI and merge it if it's clean."

↑ Between sets at the gym.

"Read me the last 3 emails and draft a reply to the one from the client."

↑ Driving home. Hands on the wheel.

You in five minutes ↓

Demo

See it in action

Watch your OpenClaw respond to voice commands in real time.

Give your OpenClaw a voice.
— ashen (@ashen_one) March 3, 2026

Watch on X →

Setup

Three steps. Five minutes.

Pay

$10 (48hr) or $20. One-time payment, no subscriptions. Lifetime access to the private repo.

Clone

Check your email for the GitHub invite. Accept it, clone the private repo to your machine.

Run

Personalize your setup — add your API keys, choose your voice, pick your model. Your OpenClaw agent joins voice in minutes.

Make it yours

Optimize your setup

Here's how you can power this however YOU want.

💰

Use a cheaper model

Run a local model like Qwen 3.5 (7B) for free voice chat — no API costs at all. Or use MiniMax, Kimi K2.5, or Gemini Flash ($0–10/mo). Handles casual conversation so your main agent stays clean.

💡 Best for: Keeping costs low

🏠

Run Whisper locally

Whisper runs locally to convert your speech to text — completely free, private, no API costs. Your voice never leaves your machine.

💡 Best for: Privacy & zero ongoing costs

🗣️

Free TTS with Edge

Microsoft Edge TTS — free, unlimited, no API key, no account needed. Dozens of natural-sounding voices out of the box. Or upgrade to ElevenLabs for premium quality and voice cloning.

💡 Edge = free forever. ElevenLabs = premium upgrade.

🔑

Main OpenClaw model

Use your existing OpenClaw API keys here too. Might clog session context — we suggest a cheap model if you want to keep things separate.

💡 Best for: Simplicity

🪶

Run it like an actual Jarvis

Want the full sci-fi experience? Use Gemini Flash for instant responses (~1s), ElevenLabs for a real human voice, and your main OpenClaw for action commands. Fast, natural, hands-free. Your own personal AI assistant.

💡 Speed + voice quality = actual Jarvis vibes.

⚙️

$0/mo is possible

Local Whisper (free STT) + Edge TTS (free voice) + Qwen 3.5 local (free LLM) = $0/month ongoing costs. The only cost is the one-time purchase. Or mix and match — ElevenLabs for better voices, Opus for smarter responses. Your call.

💡 Pay once. Run forever. Zero monthly fees.

FAQ

Common questions

How does it work? +

Your OpenClaw joins a Discord voice channel as a bot. When you speak, Whisper transcribes your voice to text locally (free, private). That text gets sent to your OpenClaw, which processes it like any other message. Then Edge TTS (free) or ElevenLabs (premium) converts the response back to speech and plays it in the voice channel. You talk, it talks back. Full conversation, hands-free.

Why should I pay $10 for this? +

You don't have to. You could probably vibecode this entire thing yourself. But will you? Ever? Or do you just want access NOW and not have to deal with all that? $10 is cheaper than the 3+ hours you'd spend building, debugging, and wiring it all together. Your call.

What's the cheapest way to run this? +

Literally $0/month. Whisper runs locally (free STT). Edge TTS is built in (free voice). Run a local model like Qwen 3.5 for the brain (free). The only cost is the one-time $10 purchase — no subscriptions, no monthly fees, no API keys required. Want premium? ElevenLabs, Gemini Flash, Claude — all optional upgrades.

How will this affect my OpenClaw? +

This IS your OpenClaw. It gives it a voice you can speak to. Whisper transcribes what you say, your OpenClaw processes it, and Edge TTS (or ElevenLabs) turns its response into speech. Same agent, new interface.

Do I need to pay for ElevenLabs? +

No. OpenVoice ships with Microsoft Edge TTS — completely free, unlimited, no API key, no account. Dozens of natural voices built in. ElevenLabs is an optional upgrade if you want premium voices or voice cloning. Most people never need it.

Can I run Whisper locally? +

Yes — run it directly on your OpenClaw's hardware. Won't increase CPU usage by much, your voice never leaves your machine, and it's usually faster than the API since there's no network round trip. You can also use the OpenAI Whisper API if your hardware is older.

How hard is the setup? +

Super easy. Grab your Discord bot token, run the install script, done. Edge TTS works out of the box — no API keys needed. Whisper runs locally too. The README has copy-paste commands. Most people are up and running in under 10 minutes.

What if I use my main API key? +

Totally fine — voice commands are short and won't break the bank. But it will fill your session with voice transcripts and use up context. If you're okay with that, keep everything under one API key or subscription — no need for another one. If you want it clean, use a cheap separate model.

Give your OpenClawa voice.

Your agent. In voice. Right now.

Your entire OpenClaw.In your ear.

See it in action

Three steps. Five minutes.

Pay

Clone

Run

Optimize your setup

Use a cheaper model

Run Whisper locally

Free TTS with Edge

Main OpenClaw model

Run it like an actual Jarvis

$0/mo is possible

Common questions

Give your OpenClaw
a voice.

Your entire OpenClaw.
In your ear.