Spokenly Logo
Spokenly
SpokenlyClaude Cowork

Voice Input for Claude Cowork

Spokenly is a push-to-talk dictation app for macOS and iOS. The simplest way to use it with Claude Cowork: press a keyboard shortcut, speak, and your words appear in the Cowork text field. For a deeper integration, connect Spokenly via MCP so Cowork can ask you questions by voice during autonomous tasks.

Published March 2026

Download Spokenly

What is Claude Cowork?

Claude Cowork is Anthropic's agentic AI for knowledge work. Unlike chat, where you ask one question at a time, Cowork takes on entire tasks: research, document creation, data extraction, file organization. You describe the goal, Cowork plans and executes it, and you steer along the way.

Cowork runs inside the Claude Desktop app on macOS. It accesses your local files, connects to services like Google Drive and Gmail, and can even use your computer directly with keyboard and mouse when no connector exists. It requires a paid Claude plan (Pro, Max, Team, or Enterprise).

The Simple Way: Push-to-Talk Hotkey

The fastest way to start is the keyboard shortcut. Install Spokenly, press the hotkey (customizable, works system-wide), speak, and the transcribed text appears right in the Cowork input field. No configuration, no MCP, no integration steps. It works because Spokenly inserts text at the cursor position in any app on your Mac, including Claude Desktop.

This is ideal for task descriptions. Instead of typing a detailed prompt about what you want Cowork to do, just speak it. A 30-second explanation gives Cowork significantly more context than the two sentences you would normally type. The same works for mid-task feedback: when Cowork pauses and asks a question, click the input field, press your Spokenly shortcut, and speak your answer.

The Advanced Way: MCP Integration

For a deeper integration, you can connect Spokenly to Claude Desktop via MCP. This lets Cowork call Spokenly directly when it needs your input during an autonomous task.

Spokenly runs a local MCP server that exposes a voice dictation tool called ask_user_dictation. You add a lightweight bridge script to Claude Desktop's MCP config, and it connects via stdio transport. When Cowork needs your input, it calls this tool instead of showing a plain text prompt. A recording overlay appears in Spokenly with the agent's question. You read it, speak your answer naturally, and press Enter. The transcribed text goes back to Cowork automatically.

The difference from the hotkey approach: with MCP, Cowork initiates the voice interaction. You don't need to click the input field or press a shortcut. The agent asks, you answer, and the workflow continues. This is especially useful when Cowork asks multiple follow-up questions in a row.

What You Get

Push-to-talk in any app

Press a hotkey, speak, text appears at the cursor. Works in Cowork, your browser, email, Slack, and every other app on your Mac.

MCP voice Q&A

Connect via MCP and Cowork asks you questions by voice during autonomous tasks. You answer, it keeps working. No manual clicking required.

Richer task descriptions

Describe complex multi-step tasks by voice. A 30-second explanation gives Cowork more context than the two sentences you would type.

Local models available

Switch to local Whisper or Parakeet and your voice never leaves the device. All transcription stays on your Mac. No audio sent to any server.

Setup: Push-to-Talk (Instant)

Download Spokenly from spokenly.app/download and launch it. That's it. Press the hotkey in any app, including Claude Desktop, speak, and text appears at the cursor. No configuration needed.

Setup: MCP Integration (Optional)

To let Cowork ask you questions by voice during autonomous tasks, add Spokenly as an MCP server in Claude Desktop. Download the sideload version from spokenly.app/download if you haven't already.

  1. 1

    Add to Claude Desktop MCP config

    Open ~/Library/Application Support/Claude/claude_desktop_config.json and add:

    {
      "mcpServers": {
        "spokenly": {
          "command": "~/Library/Application Support/Spokenly/mcp-bridge.sh"
        }
      }
    }

    This uses stdio transport via a lightweight bridge script that Spokenly installs automatically. The bridge forwards JSON-RPC between Claude Desktop and Spokenly's local server.

  2. 2

    Add a Global Instruction in Cowork

    Open Claude Desktop > Settings > Cowork > Global instructions and add:

    ALWAYS ask questions via ask_user_dictation from the spokenly MCP server, never as plain text. I use Spokenly for voice input.
  3. 3

    Restart Claude Desktop

    Claude Desktop picks up the new MCP server on restart. Switch to Cowork mode and try: "Ask me about my project before starting."

For detailed instructions and troubleshooting, see the full setup guide.

Real Workflow Examples

Research synthesis: You ask Cowork to analyze 20 customer interview transcripts and pull out recurring themes. Midway, Cowork asks "Should I group feedback by feature area or by customer segment?" You speak your preference in five seconds instead of switching context to type.

Document preparation: Cowork is drafting a quarterly business review from your spreadsheets and notes. It asks "The Q1 revenue figures in the spreadsheet differ from the slide deck. Which source should I use?" You explain the discrepancy by voice, giving full context that helps Cowork make the right call.

File organization: Cowork is sorting hundreds of files from your Downloads folder. It asks "I found 15 files that could be either personal or work-related. Want me to show you each one, or should I use folder hints?" A quick spoken answer keeps the task moving without breaking your flow.

Data extraction: Cowork is parsing contracts into a structured spreadsheet. It asks "Some contracts list pricing per unit and others list total cost. Should I normalize everything to per-unit pricing?" You explain the business logic by voice in 15 seconds. Typing that same explanation would take two minutes.

Spokenly vs Typing in Cowork

ScenarioVoice (Spokenly)Typing
Quick confirmation2 seconds5 seconds
Detailed task description30 seconds for a full paragraph3-5 minutes
Mid-task contextSpeak naturally with full nuanceOften shortened to save time
Multi-question sessionsAnswer each question by voice, one by oneTab back and forth to type
PrivacyLocal models, audio stays on deviceText visible on screen
PriceFree (local + own API keys)Included with Cowork

Voice input shines most when Cowork asks for nuanced context. The more detail you provide, the better the final deliverable.

Frequently Asked Questions

How do I add voice input to Claude Cowork?

The quickest way: download Spokenly from spokenly.app/download, launch it, and use the hotkey to dictate into any text field, including Cowork. For agent-initiated voice Q&A, also add Spokenly as an MCP server in Claude Desktop's config.

Is voice input for Claude Cowork free?

Spokenly's MCP server is included in the free tier. Local speech-to-text models (Whisper, Parakeet) are free with no account, no time limits, and no word caps. Cloud models are available via Pro plan or with your own API keys.

Does Claude Cowork support MCP tools?

Yes. Claude Desktop supports MCP servers, and Cowork tasks can use any MCP tool configured in the app. Spokenly registers a voice dictation tool that Cowork calls when it needs your input.

Can I use Spokenly for both Cowork and Claude Code?

Yes. Both Claude Code and Claude Desktop (Cowork) connect to Spokenly via stdio transport using the same bridge script. You can have both configured at the same time.

Does my voice data stay private?

With local models, all speech-to-text processing happens on your Mac. Audio never leaves your device. You can enable Local Only Mode in Spokenly to ensure no network requests are made.

What platforms does Claude Cowork support?

Cowork is available in Claude Desktop on macOS and Windows. Spokenly is a macOS app, so the MCP integration works on Mac. Both Spokenly and Claude Desktop need to run on the same machine.

Do I need a Claude Pro subscription for Cowork?

Yes. Claude Cowork requires a paid Claude plan (Pro, Max, Team, or Enterprise). Spokenly itself is free with local models.

Voice Input for Other Tools

Talk to Claude Cowork

Quick setup. Free with local speech-to-text models.

Download Spokenly
Free MCP server
Local models included
Works with Claude Code too