TL;DR: Voice typing for Cursor is a perfect fit because almost everything you type into Cursor is natural language, and typing it is the bottleneck. With Infina you can speak thousands of words of instructions a day, keep Cursor's agent (and a terminal, and Claude Code) busy in parallel, and do more in less time. Infina runs the whole loop by voice: dictate, send, and switch apps, with on-device transcription by default. It's $99 once with a 7-day refund, so it pays for itself.
Infina is our product, so we are biased. The workflow below is still accurate, surface by surface, including where a free tool is enough.
Why voice typing for Cursor works better than in a normal editor
A classic editor is keystroke-dense: symbols, operators, precise syntax. Miserable to dictate. Cursor inverted that.
Per Cursor's own docs, its core surfaces take plain English:
- Chat: questions about the codebase, debugging conversations.
- Composer / Agent: multi-file instructions, like "add auth to these routes and update the tests."
- Inline edits: select code, describe the change you want.
That means the text you produce all day in Cursor is prose, not code. And prose is exactly what dictation is good at: you speak your intent, the model writes the syntax. Speaking is also about three times faster than typing, the core argument in dictate prompts to AI.
The more you lean on Cursor's AI, the smaller the share of your day that deserves a keyboard.
There's a second effect: spoken instructions tend to be longer and richer than typed ones, because talking is cheap. "Refactor this" becomes "refactor this to use the shared client, keep the public API the same, and don't touch the tests." Richer instructions get you better agent runs.
Dictating into chat and Composer
The mechanics are simple, which is the point. With Infina:
- Click into Cursor's chat panel or Composer input.
- Hold Option (⌥), speak, release. The text lands at the cursor.
- Press Enter to send.
Because Infina types at the OS level into whatever's focused, there's nothing Cursor-specific to install. Chat, Composer, inline-edit boxes, the built-in terminal, commit message fields: same gesture everywhere. The same is true if your editor of choice is VS Code or Windsurf; Infina doesn't care which app is focused.
Transcription runs on your Mac by default (Apple Silicon), works offline, and your audio never leaves the device.
Two habits worth building for Cursor specifically:
- Reference code by selection, not by voice. Don't dictate identifiers character by character. Select the code or @-mention the file with a click, then describe the change verbally.
- Dictate the whole instruction in one breath. Composer does better with one complete brief than three fragmented follow-ups. Speaking makes complete briefs cheap.
Iterating on agent output by voice
The real time sink in Cursor isn't the first prompt. It's the loop after it: the agent produces a diff, you read it, and you respond.
Typed, each response is a small chore, and the friction quietly pushes you toward accepting mediocre output. Spoken, the loop tightens. Eyes on the diff, hold Option, and say what you see:
- "The middleware change is right but revert what you did to the config file."
- "Good, now do the same pattern for the other two endpoints."
- "You misunderstood: the cache should be per-user, not global. Try again."
None of this needs punctuation or polish; Cursor's models parse conversational corrections fine. You review more, settle less, and ship faster. This style of working, describing changes to the agent by speaking, is vibe coding by voice.
The hands-free loop for long agent sessions
Push-to-talk still ties every instruction to the keyboard: hold the key, then press Enter, then Cmd-Tab when the agent starts a long run. During heavy sessions (agent grinding in Cursor, a build in the terminal, maybe a Claude Code session on the side) Infina's hands-free mode removes those last touches:
- Double-tap Cmd (⌘) to enable hands-free. Listening runs on-device; nothing is recorded or sent while it waits.
- Speak a sentence that starts with "type", then your instruction to Cursor. Infina types it in.
- Say "send": Enter is pressed for you.
- While the agent runs, say "open Terminal", check the build, queue a command there, and switch back. All by voice, from a few feet away.
This is how one person keeps multiple agents busy at once. It's the mode for reviewing on a second screen, standing, or pacing while the agent works.
One honest note: hands-free is our newest surface and labeled experimental, so it ships off by default and it's happiest in a reasonably quiet room. Push-to-talk always works as the fallback. The full concept is laid out in hands-free voice prompting.
When raw dictation is enough, and when it isn't
Base Infina deliberately outputs raw text: on-device transcription with fast rule-based cleanup, no LLM rewrite. For 90% of what you type into Cursor, that's the correct trade:
- Raw is fine: chat, Composer instructions, inline-edit requests, terminal-ish notes. Cursor's AI doesn't care about your commas, and raw means no cloud round-trip and no latency tax.
- Polish matters: text that humans read verbatim, like code comments, commit messages, PR descriptions, README prose. Dictating those raw means cleaning them up by hand.
If you dictate a lot of human-facing text, the clean path is Infina's optional $10/month cloud add-on: sharper cloud transcription plus LLM cleanup and more languages, on an app you already own. That is the exact job subscription tools charge $15/month forever for.
Honest alternatives for Cursor
- macOS built-in dictation: free, and it types into Cursor like any text field. Triggered from the keyboard, mixed accuracy on technical vocabulary, but the zero-cost way to test whether the habit sticks.
- Wispr Flow: has a Cursor extension and covers Mac, Windows, and phones. The trade-offs: it's cloud-only (no offline mode) and subscription-priced at $15/month forever, versus Infina's on-device default, one-time license, and $10/month cloud add-on for LLM cleanup when you want it. Full comparison: Wispr Flow vs Infina.
- Infina: raw, private, on-device dictation plus the piece nothing else in this list has: voice sending and voice app-switching for hands-free agent sessions. $99 one-time at the time of writing, 7-day no-questions refund, optional $10/month cloud add-on; details on pricing. If subscriptions are the specific thing you're avoiding, see dictation apps without a subscription.
For occasional dictation, free is fine. For hours of daily agent work, the hands-free loop is the difference, and one $99 purchase covers it for good.
FAQ
Can I dictate into Cursor's chat and Composer? Yes. System-level dictation tools type into any focused text field, so chat, Composer, inline-edit boxes, and Cursor's terminal all work with the same hold-Option gesture. No extension required.
Do I need an extension to talk to Cursor? Not with a system-wide tool like Infina or macOS dictation; they type wherever your cursor is. Wispr Flow offers a dedicated Cursor extension; that's their integration model, tied to their cloud subscription.
Does my dictation need punctuation for Cursor's AI? No. Chat and Composer handle conversational, unpunctuated speech well; that's why raw dictation is the right default for prompting. Punctuation matters when you dictate comments, commits, or docs that humans read.
Can I send a Cursor prompt without touching the keyboard? With Infina's hands-free mode, yes: speak a sentence that starts with "type", then say "send". It types the message and presses Enter, and "open [app]" moves you between windows by voice.
Is my code or audio sent to the cloud when I dictate? Not by default with Infina. Transcription runs entirely on your Mac and nothing is stored, so it works offline too; cloud processing exists only as the optional add-on. See on-device dictation for Mac.
What does Infina cost for Cursor users? Same as for everyone: $99 one-time (at the time of writing), no subscription, all 1.x updates included, 7-day money-back guarantee. The optional cloud add-on for polished output and more languages is $10/month.
The bottom line
Cursor already turned coding into describing changes in English; voice typing just moves that English through a faster pipe. Speak your instructions and you produce more of them, in less time, with richer detail.
Start free with macOS dictation to test the habit. Then pick by your real bottleneck: if it's polished prose everywhere, Infina's $10/month cloud add-on handles that on an app you own outright.
If it's long agent sessions where you want to instruct, send, and switch apps without leaving review mode (hands-free, on-device, no subscription), that's precisely the loop Infina was built for. $99 once, risk-free for 7 days.