Voice in.
Text out.
Instant.
Press a hotkey, speak naturally, and watch your words insert instantly in any app.
Local AI. No cloud required.
One-time purchase — use on 3 Macs, lifetime updates
Because paying a subscription for a Whisper wrapper is crazy.
How can I help you with your code today?
Hi everyone,
Push-to-Talk Dictation.
It's like a walkie-talkie for your Mac. Hold the shortcut, speak your mind, release to insert. Or tap once to toggle. Works in Xcode, Slack, Notes, browser — everywhere.
Hold to Speak
Sotto listens while you hold and transcribes when you release.
Instant Insert
No copying needed. Text appears directly in your active app.
Stop typing at the
speed of slow.
The average person types at 40 words per minute but speaks at 150. Your brain works faster than your fingers. Sotto closes that gap.
Don't let friction kill your flow state. Capture complex thoughts, detailed feedback, and long-form content in seconds, not minutes.
Speed up your
vibe coding.
Let's be honest: when you're typing, you're lazy. You type "fix this" and hope the AI reads your mind. And then you get garbage code back.
With Sotto, you can effortlessly ramble about the edge cases, the architecture, and the specific constraints. Better context in = better code out.
- Describe complex bugs in seconds
- Explain architectural decisions clearly
- Give detailed code review feedback
- Stop wrestling with context windows
Your voice, any app.
Sotto works as a system-wide layer. Talk anywhere, type everywhere.
Reply in seconds, not minutes.
File bugs while they're fresh.
Capture ideas at the speed of thought.
Figure out Q1 priorities
✓ Finalize pricing model
Works with Slack, Discord, Linear, Notion, email, and literally any app with a text field.
Voice-first. Privacy-first.
Built for professionals who value speed and privacy.
Push-to-Talk or Toggle
Hold to speak, or press once to start/stop. Fully customizable hotkeys that work system-wide.
Auto-Paste Anywhere
Transcripts paste directly into any app. No copy-paste needed. Just speak and it appears.
100% Local & Private
Default models run entirely on your Mac. Your audio never leaves your device.
Cloud Models Available
Connect OpenAI or Groq for maximum accuracy. Great for accents and technical terms.
Native Swift App
Built with Swift and SwiftUI for a true native macOS experience. Lightweight and fast.
Recording History
Every recording saved with timestamps. Search, replay, and re-transcribe anytime.
Drop audio files here
or click to browse
Transcribe any audio file.
Got voice memos, meeting recordings, or podcast episodes? Just drag and drop them into Sotto. We'll transcribe them using whichever model you choose.
Teach Sotto your words.
Add specialized terms, names, and jargon to your dictionary. Sotto uses these as hints for better accuracy on your unique vocabulary.
Technical Terms
Acronyms, frameworks, and industry jargon.
Names & Brands
People, companies, and products you mention often.
Foreign Words
Loanwords and phrases from other languages.
Your Vocabulary
12 wordsYour audio is always saved.
Every recording is saved locally on your Mac. Not happy with the transcription? Re-transcribe with a different model anytime. Switch from Tiny to OpenAI with one click.
Re-transcribe Anytime
Try a different model if the first result wasn't perfect.
Play Back Audio
Listen to your original recording whenever you need.
Search Everything
Full-text search across all your transcripts.
Pro tip: Start with Tiny for speed, then re-transcribe important recordings with OpenAI for maximum accuracy.
Remember to update the API docs...
Meeting notes: discussed Q1 roadmap...
Quick reminder about the deploy...
Choose your engine.
Run locally for privacy, or connect to the cloud for maximum accuracy. All models use the Neural Engine for fast inference.
WhisperKit Models
Private & Offline — Powered by Apple's Neural Engine
Tiny
Quick notes, drafts
Base (English)
Everyday use
Small (English)
Important documents
Large V3 Turbo
Best overall quality
Distil Large V3
Great speed/accuracy balance
NVIDIA Parakeet Models
State-of-the-art accuracy — Powered by Apple's Neural Engine
Parakeet v2 (English)
NVIDIA ParakeetHighest recall, English only
Parakeet v3 (Multilingual)
NVIDIA ParakeetMultiple languages supported
Cloud Models
Maximum Accuracy — Bring your own API key
OpenAI
gpt-4o-mini-transcribeIndustry-leading accuracy with OpenAI's latest speech models.
Groq
Whisper Large V3 TurboFast, accurate results powered by Groq's inference engine.
Speak any language.
Dictate in 90+ languages. Switch instantly with a click or assign hotkeys for your most-used languages.
Quick Language Switcher
Switch languages instantly from the recording bar. Just click the flag and pick your language.
Per-Language Hotkeys
Assign a unique hotkey to each language. Press once to start dictating in that language.
Auto-Detect Mode
Let the model automatically detect which language you're speaking. Great for multilingual conversations.
One hotkey per language
Assign a dedicated keyboard shortcut to each language you use. Press your hotkey and immediately start dictating in that language — no menu navigation needed.
- ✓Instant language switching with a single keypress
- ✓Works system-wide in any application
- ✓Set up as many languages as you need
Automatic text cleanup.
Toggle rules that apply to every transcription. Fix grammar, remove filler words, and polish your text automatically.
How it works
Enable the rules you want, and they'll automatically apply to every transcription. Rules combine together — enable "Fix Grammar" and "Remove Filler Words" and both will be applied. Works with AI Functions too.
Fix Grammar & Spelling
Correct grammar, spelling, and punctuation errors
"i was going too the store but i didnt have no money"
"I was going to the store, but I didn't have any money."
Remove Filler Words
Remove 'um', 'uh', 'like', 'you know', etc.
"So like, um, I was thinking that, you know, we should basically just, uh, go with option A"
"I was thinking that we should go with option A"
Smart Punctuation
Add proper sentence structure and punctuation
"the meeting is at 3pm we need to prepare the slides also dont forget to invite sarah"
"The meeting is at 3pm. We need to prepare the slides. Also, don't forget to invite Sarah."
Professional Tone
Ensure a polished tone for business communication
"hey can u send me that thing we talked about thanks"
"Hi, could you please send me the document we discussed? Thank you."
Create your own custom rules with any instructions you want.
Transform your voice.
Process your dictation with AI before pasting. Write emails, format code prompts, translate, summarize — all with a single click.
One-click AI processing
Select a function from the recording bar, speak naturally, and your text is automatically processed before pasting. No extra steps. Each function uses your chosen AI provider and model.
Professional Email
Clean up and professionalize email drafts
"hey can we push back the meeting i have another thing that came up also did you get the docs i sent"
"Hi, Would it be possible to reschedule our meeting? Something has come up that I need to attend to. Also, I wanted to confirm whether you received the documents I sent earlier. Thank you."
Vibe Coding
Transform ideas into LLM-ready coding prompts
"i want to make a button that like when you click it it shows a modal and then actually no make it a dropdown instead with like three options"
"Create a button component that opens a dropdown menu on click. The dropdown should display three selectable options. Style it to match the existing design system."
Summarize
Create concise summaries of your thoughts
"so basically the main takeaway from the meeting was that we need to focus on the mobile experience first because most of our users are on phones and then after that we can work on the desktop version and also john mentioned we should look into the analytics to see what features people actually use"
"Key takeaways: (1) Prioritize mobile experience due to majority mobile user base, (2) Desktop version comes after mobile, (3) Review analytics to identify most-used features."
Casual Rewrite
Make formal text more friendly and conversational
"I am writing to inform you that your request has been processed and you should expect to receive confirmation within 3-5 business days."
"Just wanted to let you know we've processed your request! You should get a confirmation in the next 3-5 business days."
Translate
Translate your speech to any language
"Thank you for your help with this project. I really appreciate it."
"Gracias por tu ayuda con este proyecto. Realmente lo aprecio."
Custom Functions
Create your own with any AI provider
Use OpenAI, Anthropic, Gemini, Groq, Mistral, or Grok with your own custom prompts.
Supported AI Providers
RIP Subscription Wrappers
Imagine paying a monthly fee for a Whisper wrapper. Bruh.
OpenWhisper&WisprFlow
Died of recurring billing syndrome
OpenWhisper
Died of charging $20/mo for a free model
WisprFlow
Fatal case of recurring billing syndrome
* No wrappers were harmed in the making of this landing page. Just their business models.
Pay once. Use forever.
No subscriptions. No recurring fees. No BS.
One payment, lifetime access on 3 devices.
One-time payment
Because paying a subscription for a Whisper wrapper is crazy.
Everything included
- Lifetime license — pay once, own forever
- Use on up to 3 Macs
- All future updates included
- Local Whisper models (Tiny to Medium)
- Cloud transcription (OpenAI, Groq)
- Import & transcribe any audio file
- Recording history with re-transcribe
- Custom vocabulary dictionary
- Auto-paste & auto-copy
Tired of subscriptions? Us too. That's why Sotto is a one-time purchase. No monthly fees, no annual renewals. Just pay once and it's yours forever.