Can I use NVIDIA Parakeet on a Mac?

Yes. Dictato runs NVIDIA's Parakeet model natively on Apple Silicon (M1 and later) via the FluidAudio runtime. The model is optimized for the Apple Neural Engine, so latency stays around 80ms on a modern Mac without any CUDA, no Linux VM, and no cloud round-trip.

Is there an NVIDIA Parakeet Mac app?

Dictato ships NVIDIA Parakeet on macOS alongside WhisperKit, Apple SpeechAnalyzer, and Qwen3-ASR. You can pick Parakeet as your default engine in settings and dictate directly into any macOS app. Dictato is distributed directly from dicta.to (not through the Mac App Store), which is what lets it use the Accessibility API for system-wide text injection.

Do I need an NVIDIA GPU to run Parakeet on Mac?

No. Parakeet was published by NVIDIA but the model runs on Apple Silicon via Core ML and the Apple Neural Engine. You need an Apple Silicon Mac (M1 / M2 / M3 / M4), macOS 14 or later, and about 2.3 GB of disk for the model. No discrete GPU required.

How fast is Parakeet on Apple Silicon?

About 80ms end-to-end on an M-series Mac, fast enough that text appears as you speak. That's 3-6x faster than Whisper, and faster than Apple's built-in dictation in benchmarks we ran on 13,000 recordings. The model predicts all words simultaneously instead of one at a time, which is why latency collapses on the Neural Engine.

Which languages does Parakeet support on Mac?

Parakeet v3 supports 25 major languages including English, Spanish, French, German, Italian, Portuguese, Japanese, Korean, and Chinese. For broader coverage you can switch to WhisperKit (99 languages) or Qwen3-ASR (30 languages with native language hints). Dictato ships all three and lets you switch in one click.

Is Parakeet better than Whisper on Mac?

For English dictation on Apple Silicon, yes, Parakeet is 3-6x faster and slightly more accurate. For rare languages or batch transcription of audio files, Whisper is still the better pick. The honest answer: Parakeet wins live dictation on M-series Macs, Whisper wins broad multilingual coverage. See the [full comparison](/blog/whisper-vs-parakeet-vs-apple-speech-engine).

How much does Dictato cost?

19.99€ once for a lifetime license, no subscription. Everything runs on your Mac, so after that one payment there is nothing left to pay: no server, no API key, no per-word billing. The 7-day free trial includes all four engines, so you can confirm Parakeet runs cleanly on your Mac before paying anything.

NVIDIA Parakeet on Mac: Real Dictation App, 80ms, Offline (2026)

You can run NVIDIA Parakeet locally on a Mac today, at about 80ms latency, with no Python and no cloud. It dictates faster than Apple’s own built-in dictation on the same hardware. Here’s how to get it running, and why it’s so fast on Apple Silicon.

Try Dictato free for 7 days →

What is NVIDIA Parakeet?

Parakeet is a family of speech recognition models published by NVIDIA. The version most Mac users care about is Parakeet-TDT (Token-and-Duration Transducer), a streaming-friendly architecture that predicts tokens and their durations in parallel instead of one at a time. The result is a model that hits real-time speech-to-text with a fraction of the latency of an autoregressive model like Whisper.

The current production release at the time of writing is Parakeet v3. It supports 25 languages and the model file weighs around 2.3 GB. NVIDIA released it under an open license, which is why it’s free to embed in third-party apps.

Why Parakeet runs fast on Apple Silicon

Parakeet is a Conformer-Transducer model. Two architectural choices make it land well on the Apple Neural Engine:

It’s streaming-native. It consumes audio in chunks and emits tokens as it goes. Whisper, by contrast, was designed for whole-utterance transcription: it expects a fixed audio window and processes it in one pass. Streaming lets text appear word-by-word as you speak.
It decodes in parallel. The TDT head predicts token + duration jointly, so you don’t pay an autoregressive cost per token. The model can be quantized and converted to Core ML, then dispatched to the Neural Engine for the heavy parts.

The Mac runtime that makes this work is FluidAudio, an open-source Swift port of Parakeet optimized for the Apple Neural Engine. It’s what Dictato ships internally.

The end-to-end number on M-series chips: about 80ms from microphone to typed text. For context, that’s faster than the round-trip to any cloud STT API, and well below the threshold where users perceive lag.

How to run NVIDIA Parakeet on your Mac

There are two paths, depending on what you’re trying to do.

Path 1: Just dictate (recommended)

Install Dictato. It bundles Parakeet, the FluidAudio runtime, and a system-wide dictation overlay that types into any Mac app: Mail, Slack, Xcode, VS Code, Notion, a browser tab, anything that accepts text input.

Download from dicta.to/download
Open the app, grant microphone and accessibility permissions
Pick Parakeet in Engine Settings (or leave it on Auto)
Press your hotkey, speak, release

That’s the whole flow. There’s no Python to install and no model files to manage by hand. 7-day free trial, then 19.99€ once for a lifetime license. Everything runs on your Mac, so there’s nothing left to pay after that: no subscription, no API key, no per-word billing.

Try Dictato free for 7 days →

Path 2: Build it yourself

If you want to integrate Parakeet directly into your own Swift code, FluidAudio is the package to look at. It’s MIT-licensed and exposes a clean ParakeetEngine API. You’ll handle audio capture, the engine session, and text output yourself. The model isn’t the hard part, audio buffering and Apple’s accessibility/CGEvent APIs for text injection are. Plan on a weekend.

Path 1: 30 seconds. Path 2: a weekend. If you landed here looking for a “Parakeet Mac app,” path 1 is the right one.

Parakeet vs the alternatives on Mac

A quick orientation if you’re choosing between Mac dictation engines:

	Parakeet	WhisperKit	Apple SpeechAnalyzer	Qwen3-ASR
Latency on Apple Silicon	~80ms	150-300ms	150-400ms	200-400ms
Languages	25	99	20	30
Best for	Live English dictation	Multilingual / rare languages	macOS 26 built-in flow	30-language native hints
Model size	~2.3 GB	~600 MB	system	~600 MB
Runs offline	yes	yes	yes (macOS 26)	yes

No single engine wins everywhere. Parakeet wins live English dictation on M-series Macs. The deep-dive comparison with numbers is in Parakeet vs Whisper vs Apple Speech on Mac.

Hardware requirements

You need an Apple Silicon Mac (M1, M2, M3, or M4). Intel Macs aren’t supported, because Parakeet relies on the Neural Engine for its speed. macOS 14 (Sonoma) is the minimum; on macOS 26 you also get the Apple Intelligence proofread layer as a bonus.

Disk-wise, plan for about 2.5 GB free for the model and runtime. 8 GB of RAM is enough for dictation, since the Neural Engine carries the model rather than the CPU and background apps don’t fight Parakeet for cycles. The built-in mic is fine; AirPods and USB mics work too.

No discrete GPU, no CUDA, no internet connection at runtime.

Privacy

Running Parakeet on your Mac means no audio leaves your machine. The microphone signal goes through the local model and lands in the active text field. That’s the whole path: no cloud round-trip, no transcript sitting on a server you don’t control, no API key tying usage back to an account.

This matters for healthcare, legal, and any work with regulated data. It also matters for everyday speed: the 80ms latency is unbeatable specifically because there’s no network in the loop.

Common questions before you install

Will Parakeet drain my battery? Less than you’d expect. The Neural Engine is the most power-efficient compute path on Apple Silicon. A full hour of continuous dictation typically costs a few percent of battery on an M2 MacBook Air.

Can I switch engines per task? Yes. Dictato lets you pick a default and switch in one click: Parakeet for English drafts, WhisperKit for a Polish email, Apple SpeechAnalyzer for macOS 26-only flows. You can also leave it on Auto and Dictato picks a sensible default based on language and context.

Does it work in any Mac app? Yes. Dictato types via the Accessibility API, so it lands text in Mail, Messages, Notes, Slack, Discord, Xcode, VS Code, Cursor, Notion, Linear, Figma, browsers, anywhere the cursor blinks.

What if I’m on Intel Mac? Parakeet won’t run with acceptable latency on Intel. Dictato falls back to WhisperKit there. If you’re on Intel and serious about local dictation, the real bottleneck is the Neural Engine, which only Apple Silicon has. Upgrading the Mac is the right move, not the app.

Ready to dictate with NVIDIA Parakeet on your Mac? Dictato ships Parakeet (plus WhisperKit, Apple SpeechAnalyzer, and Qwen3-ASR), runs 100% offline at 80ms latency, and types into any app. 7-day free trial, then 19.99€ once, yours for life. Everything runs on your Mac, so there’s no subscription and no AI bill to follow. Try it free →