Verdict at a Glance
Aqua Voice targets a specific niche in 2026: cloud dictation for developers and AI tool users. The Avalon model is tuned for coding vocabulary and AI prompts, the $8/mo price undercuts Wispr Flow, and iOS support shipped in April. The catches are real: cloud-only architecture, a 1,000-word lifetime free tier that disappears in 8 minutes, privacy stance that requires opt-in to avoid transcript storage, no HIPAA BAA, and just 49 languages.
Better fit for most users: Spokenly is free with local Parakeet V3 and Whisper, BYOK to OpenAI GPT-4o Transcribe, Deepgram Nova, and Groq at zero markup, and ships an MCP server purpose-built for Claude Code, Cursor, and Codex. Same developer audience, lower cost, offline support.
What works
- Avalon model excels at technical and coding vocabulary
- Streaming mode displays text 450ms faster than competitors
- $8/mo is cheaper than Wispr Flow ($15/mo)
- Strong fit for AI tool users (Cursor, Claude Code, ChatGPT, Gemini)
What does not
- Cloud-only, no offline mode at any tier
- 1,000-word lifetime free tier (not per-month)
- No annual discount ($8/mo monthly = $96/yr annual)
- Privacy Mode off by default, transcripts may be stored
What Is Aqua Voice

Aqua Voice is a system-wide AI dictation app built by Aqua Voice, Inc., a Y Combinator W24 startup based in San Francisco. The company was founded in 2023 by Finn Brown (CEO, dyslexic, origin story is his frustration with Dragon Professional) and Jack McIntire (CTO). They raised a $500K pre-seed with Y Combinator and Pioneer Fund leading, and report roughly $330K ARR as of September 2025.
The product positioning shifted from "voice-driven document editor" at YC launch in 2024 to "input layer for the AI agent era" in 2025-2026. Marketing now emphasizes compatibility with Claude Code, Claude Opus, ChatGPT, Cursor, and Gemini. The hold-to-talk shortcut works in any text field on Mac, Windows, and as of April 2026, iOS.
Notably, more than 50% of Aqua's user base is in Japan, driven by organic growth among programmers escaping the friction of Japanese keyboard input when working with AI agents.
Pricing in 2026
Aqua has three tiers plus a tiny free option. The pricing is uniform: $8/mo monthly equals $96/year annual, with no annual discount, which is unusual in the category.
| Tier | Price | Included |
|---|---|---|
| Free / Starter | $0 | 1,000 words lifetime (~8 min of speech), baseline model, 5 dictionary slots, no Avalon |
| Pro Monthly | $8/mo | Avalon model, unlimited dictation, 800 dictionary terms, replacements, all integrations |
| Pro Annual | $96/yr (= $8/mo) | Same as Pro Monthly, no annual discount |
| iOS Pro Annual (standalone) | $119/yr | App Store-only path, more expensive than cross-platform Pro |
| Team | $12/seat/mo | Centralized billing, org-wide Privacy Mode |
| Enterprise | Custom | SSO/SAML, zero data retention, dedicated support |
Students with a .edu email get 70% off annual plans, dropping the price to roughly $28.80/year. There is no time-bounded trial: the 1,000 free words is the evaluation window.
Compared with the rest of the category: Spokenly is free for local + BYOK ($4.99/mo for Pro on student discount), Superwhisper is $8.49/mo or $249.99 lifetime, Wispr Flow is $15/mo, MacWhisper is €59 lifetime. Aqua's monthly price is competitive, but the lack of annual discount and the lifetime-only free tier are unusual.
The Avalon Model
Avalon is Aqua's proprietary speech-to-text model, launched August 22, 2025, replacing their previous Whisper-based pipeline. The model was explicitly trained on human-computer interaction speech (prompts, code, email), not audiobook or meeting-style training data.
Aqua's published benchmarks
- Beats Whisper Large V3 on 7 of 8 OpenASR splits
- Beats ElevenLabs Scribe on 6 of 8 OpenASR splits
- 97.4% accuracy on AISpeak-10 (coding and AI terms) vs Whisper Large V3 at 65.1% and NVIDIA Canary 1B at 51.5%
- 3.2% WER on LibriSpeech-clean (Google Docs Voice Typing roughly 5.5%)
- Marketing claim: "99.1% accuracy out-of-the-box"
Latency claims: sub-50ms startup, 450ms to 1 second text insertion. Streaming mode displays words as the model transcribes, which feels faster than competitors who paste the final result in one chunk.
Caveat: every Avalon request goes to Aqua's cloud servers. There is no on-device version. For users who need the same technical-vocabulary accuracy offline, Spokenly's BYOK to GPT-4o Transcribe gives you cloud-quality on demand, and local Parakeet V3 plus Whisper Large V3 Turbo cover the offline case.
AI Editing and Context Awareness
Aqua's UX angle is natural-language editing. You can dictate "make this a list", "rephrase that", "add bullet points", or "redo the second sentence" mid-flow. No memorization of specific command syntax. The model rewrites in place. The trade-off is that all of this happens in Aqua's cloud, so the same task on Spokenly's local Parakeet model would stay on-device with a user-written prompt.
Context awareness is the second pillar. Aqua reads the active app or website and adjusts tone automatically: casual for Slack, formal for Gmail, technical for Cursor. Per-app custom instructions let you fine-tune the behavior. The "like it appears on screen" correction feature leverages screen context to fix small errors based on surrounding text.
Filler word removal happens automatically. Aqua can also fill in missing words or names from document context, which is helpful for technical writing but occasionally produces unwanted edits.
Spokenly offers the same AI text processing with explicit user-written prompts (GPT-4, Claude, Gemini, Grok, local Ollama models) and a free tier instead of $8/mo cloud-only.
Platforms and System Requirements
- macOS (Apple Silicon and Intel)
- Windows 10 and 11
- iOS 17+ (launched April 17, 2026 as a keyboard extension)
- No Android, no Linux, no native browser extension
- Current desktop version: v0.11.9 (pre-1.0 versioning, indicating young product)
iOS keyboard quirk worth noting: Apple's security model means the Aqua app briefly foregrounds before returning focus to the text field, adding a few seconds per session. Most third-party voice keyboards on iOS hit the same Apple constraint to some degree.
Privacy and Compliance
Aqua processes everything in the cloud. No on-device mode at any tier. The privacy posture is built around an opt-in Privacy Mode and SOC 2 audit, not local-first architecture.
What you should know
- Privacy Mode is OFF by default for individuals. Without it, transcripts may be stored on Aqua servers.
- Teams and Enterprise can enforce Privacy Mode org-wide.
- SOC 2 Type II audited by Advantage Partners, Vanta-managed trust center.
- No public HIPAA BAA. Unsuitable for healthcare or other regulated workflows.
- Privacy policy does not specify whether transcripts are used to train Avalon.
- Background audio sensitivity: Aqua transcribes nearby TVs, radios, and other voices, which is a problem in shared spaces.
For workflows that cannot tolerate cloud processing or training-data ambiguity, Spokenly's Local Only Mode runs Parakeet or Whisper entirely on your Mac with zero outbound network traffic. Same developer-focused use cases, different privacy posture.
What Users Are Saying
Aqua's public sentiment is concentrated on Product Hunt (5.0/5 from 14 reviews) and Japanese developer blogs. Reddit r/macapps and r/productivity coverage is thinner than for Wispr Flow or Superwhisper, which makes the rating less informative than the volume suggests.
Top praise points
- Streaming mode latency is consistently called the fastest
- Technical accuracy: rarely misses coding terms, model names, or proper nouns
- Natural-language editing without command syntax
- 9to5Mac reviewer dictated 352,000 words at 126 WPM over 105 days (April 2026 review)
- Beats Apple Dictation by 17:1 error margin in one Steve Jobs commencement speech test (Aug 2025)
Top complaints
- Background audio: transcribes nearby voices, TVs, music — bad in cafes and family homes
- No offline mode means flights, train rides, and bad Wi-Fi kill the workflow
- Privacy Mode opt-in default is uncomfortable for some users
- Free tier is 1,000 words lifetime, not per-month, so genuine evaluation requires paying
- Long-text paste failures: occasional clipboard recovery needed
- One Product Hunt reviewer deleted after 3-4 months and switched back to VoiceInk
Aqua Voice Alternatives
Best free alternative
Spokenly
Free with local Parakeet V3 (multilingual, 25 languages on Apple Silicon) and Whisper Large V3 Turbo. BYOK to GPT-4o Transcribe, Deepgram Nova, and Groq at zero markup. Custom AI prompts with Claude, GPT, Gemini, Grok, or local Ollama for context-aware rewriting. MCP server for Claude Code, Cursor, and Codex. Works system-wide on Mac, iPhone, and Windows.
Download Spokenly freeWispr Flow
Cloud-only cross-platform dictation with AI cleanup. $15/mo, supports Android, broader language coverage (100+) than Aqua's 49.
See the comparisonSuperwhisper
Mac power user favorite with local Whisper and Parakeet plus cloud BYOK. $249.99 lifetime tier avoids subscriptions entirely.
See the comparisonApple Dictation
Free, built into macOS and iOS. Limited on technical terms but on-device and zero setup.
See the comparisonWho Should Use It
Choose Aqua Voice if
- You dictate heavily inside AI tools (Cursor, Claude Code, ChatGPT, Gemini)
- Technical and coding vocabulary accuracy is your top priority
- Cloud-only architecture is acceptable
- You want natural-language text editing without command syntax
- You are a student (.edu) and want 70% off annual
- You write in Japanese with AI agents
Skip Aqua Voice if
- You need offline dictation
- Privacy and training-data clarity are non-negotiable
- You handle regulated audio that needs HIPAA BAA
- You work in noisy environments (cafes, family homes)
- You need Android support
- You dictate in languages outside Aqua's 49 supported
FAQ
Is Aqua Voice worth $8 per month?
If you write a lot inside AI tools like Cursor, Claude Code, ChatGPT, or Gemini and want technical-term accuracy, yes. Aqua's Avalon model is specifically tuned for prompt-style speech and code vocabulary. If you want offline mode, BYOK to multiple providers, or a free tier that actually works, Spokenly is a better fit.
Does Aqua Voice work offline?
No. Aqua is cloud-only. Every dictation request sends audio to Aqua's servers running the Avalon model. There is no on-device mode at any plan tier. If offline is a requirement, look at Spokenly with local Parakeet or Whisper, Superwhisper, or MacWhisper.
What is the Avalon model?
Avalon is Aqua's proprietary speech-to-text model, launched August 2025 to replace their Whisper-based pipeline. It is specifically trained on prompt-style speech, code, and email (not audiobooks or meetings). Aqua claims 97.4% accuracy on coding and AI terms versus Whisper Large V3 at 65.1%, and 3.2% WER on LibriSpeech-clean (Google Docs Voice Typing sits around 5.5%).
Is Aqua Voice better than Wispr Flow?
For technical accuracy and speed, Aqua wins. Avalon outperforms Wispr Flow's pipeline on coding and AI vocabulary, and Aqua's streaming mode shows text 450ms faster. For platform breadth (Android, broader language support, polished mobile experience), Wispr Flow wins. Aqua is also cheaper at $8/mo vs $15/mo.
Is Aqua Voice safe and private?
Aqua is cloud-only, so every word leaves your device. Privacy Mode is opt-in (off by default for individuals). With it off, transcripts may be stored on Aqua's servers. SOC 2 Type II is audited by Advantage Partners, and Aqua publishes a Vanta trust center. There is no public HIPAA BAA, so it is unsuitable for healthcare workflows. Aqua's privacy policy does not specify whether transcripts are used to train Avalon.
Does the free tier actually work?
Sort of. Aqua's free tier is 1,000 words total (lifetime), not per week or per month. After ~8 minutes of natural speech you hit the paywall. Compare with Spokenly's free tier (unlimited local models) or Wispr Flow's 2,000 words per week. Aqua's free tier is more of a sample than a usable plan.
Does Aqua Voice work on iPhone?
Yes, since April 17, 2026. Aqua's iOS keyboard extension is in the App Store. One Apple platform constraint to know: the app briefly foregrounds Aqua before returning focus to your text field, adding a few seconds per session compared with always-running competitors.
Does Aqua Voice work in Japan?
Yes, and more than 50% of Aqua's user base is in Japan. The company reports organic growth driven by programmers escaping Japanese keyboard input friction, especially in workflows around Claude Code and Claude Opus. Japanese language support is solid in the multilingual Avalon releases.
