Open source voice-to-text dictation app for macOS with on-device AI enhancement.
Install with Homebrew: brew install --cask fluidvoice
Manual download: latest release
Important
This project is free and open source under GPLv3. If FluidVoice is useful to you, please star the repository — it helps visibility and keeps development going.
- Insanely fast Parakeet — rebuilt Parakeet implementation with pretty much zero delay between speaking and seeing words on screen
- Fluid Intelligence — fully local AI model for on-device dictation enhancement. No cloud, no API keys, no data leaving your Mac
- Better Theming — adaptive light/dark theme with a compact toolbar switcher
- Refreshed Onboarding — language-first voice engine setup, real dictation tryout, and AI enhancement setup in one clean pass
Warning
Based on early feedback, Fluid Intelligence may cause you to unsubscribe from other dictation apps and save money. You've been warned.
FluidVoice is fully open source under GPLv3. Fluid Intelligence is a separate, privately maintained local AI runtime that powers advanced on-device dictation enhancement — smart formatting, context-aware capitalization, and post-processing — all running locally on your Mac.
The app works great on its own with any supported speech model and optional cloud AI providers. Fluid Intelligence adds a fully local, private AI layer for users who want on-device enhancement without sending data anywhere.
We're keeping Fluid Intelligence private for now so we can sustainably offer the core dictation experience for free. This may change in the future.
| Email Template | Flowers |
change_time_name.mp4 |
Email_template.mp4 |
| Change Time & Name | Emoji |
emoji.mp4 |
flowers.mp4 |
| Hyphens & Numbers | |
hyphen123.mp4 |
FluidVoice_1.5_cmd_mode.mp4
writemode_FluidVoice_1.5.mp4
- Fluid Intelligence — on-device AI enhancement for smart formatting, context-aware capitalization, and post-processing, all running locally on your Mac with zero data leaving your machine
- Command Mode — control your Mac by voice: launch apps, run shortcuts, trigger system actions, and automate workflows without touching the keyboard
- Write Mode — write or rewrite text directly in any text field across any app. Select text and rewrite it, or dictate new content inline
- Live Preview — real-time transcription overlay with notch support, so you see words appear as you speak
- Multiple Speech Models — Nemotron Speech 3.5, Parakeet Flash, Parakeet TDT v3 & v2, Cohere Transcribe, Apple Speech, and Whisper. Pick the model that fits your language and latency needs
- AI Enhancement — optional post-processing via OpenAI, Groq, custom providers, or local Fluid Intelligence for cleaner, more accurate transcripts
- Audio History — optional local recording history with budget controls and ZIP export, so you can review past dictations without cloud storage
- Today-Usage Stats — daily usage tracking at a glance with a stats header card and toolbar pill
- Adaptive Theming — light/dark theme that follows your system, with a compact toolbar switcher
- Global Hotkey — instant voice capture from anywhere, no app switching needed
- Smart Typing — direct insertion into any app via accessibility APIs for reliable, app-independent text entry
- Menu Bar Integration — quick access, status, and settings from the menu bar
- Auto-Updates — seamless updates with an optional beta channel for early previews
- Per-App Configuration — assign different prompt sets to different apps, so your dictation adapts to whatever you're working in. Fully optional
- Notch-Aware Overlay — transcription overlay that fits cleanly around the MacBook notch, or use a standard overlay if your Mac doesn't have one
- Local-First — your voice and text never leave your machine unless you opt in to a cloud AI provider
- Fastest Parakeet on Mac — one of the fastest native implementations of Parakeet on macOS, with near-instant transcription and minimal latency
- Configurable Overlay — choose from pill-shaped to large overlay sizes to show live preview, or keep it minimal. Everything is optional
- Everything is Optional — AI enhancement, Fluid Intelligence, audio history, analytics, and beta builds are all opt-in. The core dictation experience works out of the box with zero configuration beyond permissions and a hotkey
| Model | Best for | Language support | Download size | Hardware |
|---|---|---|---|---|
| Nemotron Speech 3.5 — Ultra Fast Low Latency | Streaming-capable multilingual dictation | ~40 languages | ~670 MB | Apple Silicon |
| Nemotron 3.5 Multilingual | Higher-accuracy multilingual dictation | ~40 languages | ~530 MB | Apple Silicon |
| Parakeet Flash (Beta) | Lowest-latency live English dictation | English | ~250 MB | Apple Silicon |
| Parakeet TDT v3 | Fast default multilingual dictation | 25 languages | ~500 MB | Apple Silicon |
| Parakeet TDT v2 | Fastest English-only dictation | English | ~500 MB | Apple Silicon |
| Cohere Transcribe | High-accuracy multilingual dictation | 14 languages | ~1.4 GB | Apple Silicon |
| Apple Speech | Zero-download native macOS speech | System languages | Built-in | Apple Silicon + Intel |
| Whisper Tiny / Base / Small / Medium / Large | Broad compatibility, including Intel Macs | 99 languages | ~75 MB to ~2.9 GB | Apple Silicon + Intel |
Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Greek, Hungarian, Italian, Latvian, Lithuanian, Maltese, Polish, Portuguese, Romanian, Russian, Slovak, Slovenian, Spanish, Swedish, and Ukrainian.
English.
English, French, German, Italian, Spanish, Portuguese, Greek, Dutch, Polish, Mandarin, Japanese, Korean, Vietnamese, and Arabic.
System language support depends on the macOS speech recognition languages available on your machine.
Whisper supports up to 99 languages, depending on the model size you choose.
-
Install with Homebrew:
brew install --cask fluidvoice
Or download the latest release.
-
Grant permissions — FluidVoice will ask for microphone and accessibility access. Both are required for dictation and typing into other apps.
-
Set your hotkey — pick a global hotkey in settings that triggers voice capture from anywhere.
-
Go through onboarding — choose your voice model based on your language and latency needs. Models range from zero-download Apple Speech to high-accuracy Nemotron and Whisper.
-
(Optional) Enable Fluid Intelligence — download the local AI model during onboarding for on-device dictation enhancement. Everything runs locally, no data leaves your Mac.
-
(Optional) Bring your own AI provider — add an OpenAI, Groq, or custom provider API key for cloud-based enhancement. Keys are stored securely in macOS Keychain. Select "Always allow" for key access.
-
(Optional) Opt in to beta builds —
Settings → Automatic Updates → Beta Releasesfor early access to new features.
- macOS 15.0 (Sequoia) or later
- Apple Silicon Mac for all models
- Intel Macs supported via Whisper models (from 1.5.1+)
- ~1 GB disk space for a voice model
- ~3.5 GB disk space for the Fluid Intelligence model (optional)
- Microphone access
- Accessibility permissions for typing
git clone https://github.com/altic-dev/FluidVoice.git
cd FluidVoice
open Fluid.xcodeprojBuild and run in Xcode. All dependencies are managed via Swift Package Manager.
xcodebuild -project Fluid.xcodeproj -scheme Fluid -destination 'platform=macOS' build CODE_SIGNING_ALLOWED=NOContributions are welcome! Please create an issue first to discuss major changes before submitting a pull request.
- Clone and open in Xcode as above.
- Signing:
FluidVoice → Signing & Capabilities → Automatically manage signing → pick your Team(Personal Team is fine). Stored inxcuserdata/(gitignored). - Build and run — SPM handles dependencies.
- (Optional) Pre-commit hook to prevent accidental team ID commits:
cp scripts/check-team-id.sh .git/hooks/pre-commit chmod +x .git/hooks/pre-commit
- One feature or fix per PR — keep changes focused and atomic
- Create an issue first so work is trackable before review
- Discuss non-trivial changes before opening a PR
- Follow the PR template
- Test thoroughly on your machine
- Never commit personal team IDs or API keys
- Check
git diffbefore committing
xcodebuild test -project Fluid.xcodeproj -scheme Fluid -destination 'platform=macOS'CI uses unsigned builds:
xcodebuild test -project Fluid.xcodeproj -scheme Fluid -destination 'platform=macOS' CODE_SIGNING_REQUIRED=NO CODE_SIGNING_ALLOWED=NOFluidVoice is local-first. Your voice, audio, and transcribed text never leave your machine unless you explicitly opt in to a cloud AI provider.
Anonymous analytics are enabled by default to track app health and feature usage. You can disable at any time from Settings → Share Anonymous Analytics.
Collected:
- App version, build, macOS version
- Low-cardinality feature/config flags (e.g. app mode, major settings)
- Approximate usage ranges (not exact values)
- High-level success/error outcomes
Not Collected:
- Voice, raw audio, or transcribed text
- Selected text, prompts, or AI responses
- Terminal commands, window titles, file paths, clipboard, or typed content
- Any personal or private information
Join our Discord: https://discord.gg/VUPHaKSvYV
Follow development on X: @ALTIC_DEV
From 2026-02-23 onward, this project is licensed under the GNU General Public License, Version 3.0 (GPLv3).
Versions published before this date were licensed under Apache License 2.0.

