
Voice Input for Cursor AI Editor
Spokenly is a push-to-talk dictation app for macOS and iOS. Press a shortcut, speak, and text appears at the cursor in any app. It also integrates with Cursor via MCP, so the agent can ask you questions and get voice answers directly.
Updated March 2026
Download SpokenlyWhy Voice Input in Your Editor
Cursor's AI-powered features work best with detailed context from you. When the agent asks about your project structure, preferred patterns, or what a feature should do, a quick typed "make it work" gives it almost nothing to go on. But a spoken answer naturally includes the details it needs.
"Use the existing auth middleware from the shared package. The endpoint should validate the JWT, extract the user ID, check against the team permissions table, and return a 403 if the user is not an admin. We have a similar pattern in the billing endpoints, look at those for reference." That takes 15 seconds to say. It would take a minute to type. And you'd probably skip most of it.
Because Spokenly integrates with Cursor via MCP, the agent calls it directly when it needs your input. You speak your answer, and Cursor gets the rich context it needs to write better code.
How Spokenly Connects to Cursor
Spokenly runs a local MCP server at localhost:51089. You add it to Cursor's MCP configuration file, and Cursor's AI gets access to a voice dictation tool called ask_user_dictation.
When Cursor's agent or composer needs your input, it calls this tool. A recording prompt appears in Spokenly showing the question. You speak your answer and press Enter. The transcribed text goes back to Cursor, and the AI continues with your detailed response as context.
No bridge scripts, no terminal commands during use. Cursor connects directly to Spokenly's local server over HTTP.
What You Get
Voice in your editor
Cursor's agent asks questions, you answer by speaking. Give context about your project, explain design decisions, or describe what you want. Long answers in seconds, without typing a word.
Works with agent and composer
Agent mode, composer, and any workflow that makes MCP tool calls. Voice input works wherever Cursor's AI can call tools.
Quick config
Add one entry to mcp.json and a User Rule. Cursor picks up the voice tool on restart. Under two minutes from install to first voice interaction.
Local models available
Switch to local Whisper or Parakeet and your voice never leaves the device. All transcription stays on your Mac. No audio sent to any server.
Setup Guide
Download the sideload version from spokenly.app/download and launch it. On first launch, the app will offer to set up the Cursor integration automatically.
Or set up manually:
- 1
Add to Cursor MCP config
Open
~/.cursor/mcp.jsonand add:{ "mcpServers": { "spokenly": { "url": "http://localhost:51089" } } } - 2
Add a User Rule
Open Cursor Settings > Rules > User Rules and add:
ALWAYS ask questions via ask_user_dictation from the spokenly MCP server, never as plain text. I use Spokenly for voice input. - 3
Restart Cursor
Cursor connects to Spokenly on restart. Test by asking the agent to ask you a question.
See the full setup guide for troubleshooting.
Real Workflow Examples
Task planning: You ask Cursor to interview you before generating code. The agent walks through requirements one by one: expected inputs, error handling, UI states, data flow. You answer each question by voice in seconds and the agent assembles a full brief. Ten minutes of talking instead of an hour of writing.
Refactoring: Cursor asks "Should I move these components into a shared library or keep them colocated?" You explain which ones are reused across pages, which are one-off, and how the import paths should look. Voice captures the reasoning that a typed "shared lib" never would.
Long context: Cursor asks "What should the onboarding flow look like for team invites?" Instead of typing a couple of bullet points, you walk through each step: the email template, permissions logic, and what happens when the invite expires. The agent gets everything it needs in one go.
Frequently Asked Questions
How do I add voice input to Cursor?
Download Spokenly from spokenly.app/download. Open ~/.cursor/mcp.json and add a "spokenly" entry pointing to http://localhost:51089. Then add a Cursor User Rule telling the agent to use the ask_user_dictation tool for all questions. Restart Cursor.
Is Spokenly free to use with Cursor?
Yes. Spokenly's MCP server is free, and local speech-to-text models have no limits. No subscription needed.
Does voice input work in Cursor's agent mode and composer?
Yes. Agent mode, composer, and any workflow that supports MCP tool calls can use Spokenly's voice dictation.
Can I use keyboard and voice together in Cursor?
When Cursor calls the voice tool, a recording prompt appears. Speak and press Enter, or press Escape to skip and type instead. You always have the choice.
Does my voice data stay on my Mac?
With local models (Whisper, Parakeet), all speech-to-text happens on your Mac. Audio never leaves your device. For maximum accuracy across languages, Spokenly also supports cloud models like GPT-4o Transcribe and Deepgram Nova.
What languages does voice input support?
Spokenly supports 100+ languages with both local and cloud models. For coding conversations in English, local models work well. For multilingual workflows, cloud models like Deepgram Nova offer the best accuracy across languages.
Voice Input for Other Tools
Talk to Cursor
Set up in two minutes. Free with local speech-to-text models.
Download Spokenly