Skip to content
Subs -25% LAUNCH-SUB
Claws -25% LAUNCH-CLAWS

Voice Add-on

4 min read Addons Last updated February 10, 2026

What the Voice Add-on Does

The voice add-on gives your OpenClaw instance the ability to process voice messages. When enabled, your instance can receive audio messages through connected messengers, transcribe them to text, and respond. It also supports text-to-speech for sending voice replies back to users.

Without this add-on, voice messages sent to your instance are ignored.

How It Works

Voice processing runs in two directions:

  • Speech-to-text (STT) — Incoming voice messages are transcribed into text so the LLM can understand and respond to them
  • Text-to-speech (TTS) — The LLM's text response is converted to audio and sent back as a voice message

Both directions are handled automatically once the add-on is active. You do not need to configure STT and TTS separately.

Pricing

The voice add-on is billed monthly based on your selected pack size. Pricing covers both speech-to-text and text-to-speech processing.

Pack Monthly Price Best For
Starter €2 Light usage, testing voice features
Standard €8 Regular voice conversations
Pro €25 High-volume voice processing

Usage is tracked by processing minutes. Each pack includes a monthly allowance of voice processing time.

Setting Up the Voice Add-on

  1. Open your instance in the ClawHosters dashboard
  2. Go to Add-ons > Voice
  3. Choose a pack size (Starter, Standard, or Pro)
  4. Confirm your subscription

Voice processing is available immediately after subscribing. Any voice messages received through connected messengers will be transcribed and processed.

Requirements

The voice add-on requires:

  • An active LLM subscription (BYOK or managed pack) — the transcribed text needs an LLM to generate a response
  • At least one connected messenger channel that supports voice messages (Telegram, WhatsApp)

Discord and Slack voice message support depends on the messenger's capabilities.

Supported Messengers

Messenger Voice Input (STT) Voice Output (TTS)
Telegram Yes Yes
WhatsApp Yes Yes
Discord Depends on bot setup Depends on bot setup
Slack Limited Limited

Telegram and WhatsApp have full voice message support. Discord and Slack support varies depending on how the bot integration is configured.

Usage Tracking

Voice processing minutes are tracked in real time on the add-ons page:

  • Minutes used — How many voice processing minutes you have consumed this period
  • Minutes remaining — How many you have left in your pack
  • Usage percentage — A visual indicator of consumption

What Happens When You Run Out

If your voice pack runs out of processing minutes:

  • Incoming voice messages are no longer transcribed
  • Text-to-speech replies are no longer generated
  • Your instance continues working normally for text messages
  • Voice processing resumes when your pack resets at the next billing period or you upgrade to a larger pack

Managing Your Subscription

Upgrading

You can upgrade your pack at any time from the add-ons page. The new pack takes effect immediately. Any remaining minutes from the old pack carry over for the current period.

Downgrading

Downgrades take effect at the start of the next billing period. You keep your current pack's allowance until then.

Cancelling

Cancel the voice add-on from the add-ons page. Voice processing stops at the end of the current billing period. Your instance continues working for text messages.

Troubleshooting

Voice messages are not being transcribed

  • Verify the voice add-on is active on the add-ons page
  • Check that your voice pack has remaining minutes
  • Confirm your messenger channel is properly connected
  • Make sure the LLM add-on is also active — voice transcription without an LLM cannot generate responses

Audio quality is poor in TTS responses

  • TTS quality depends on the voice model used by the platform
  • Short, clear sentences generally produce better audio output
  • Very long responses may be truncated in voice form

"Voice add-on not available" error

  • The voice add-on requires an active instance in "Running" status
  • Instances in error, stopped, or paused states cannot process voice messages

Related Documentation