Features

Everything you need for voice input on macOS

🔒

Privacy First

Zero telemetry, zero analytics, zero data collection. Pair with a self-hosted LLM (e.g. Ollama) and all your data — voice, text, vocabulary — stays entirely on your machine.

🎧

Multi-Engine ASR

Five backends — Apple Speech (zero setup), FunASR (Chinese), MLX-Whisper (99 languages, Apple GPU), Sherpa-ONNX, and Whisper API. Real-time streaming overlay during recording.

✨

AI Enhancement

LLM-powered proofreading, translation, and chain pipelines via any OpenAI-compatible API. Clipboard enhancement (Ctrl+Cmd+V) for selected text. Extended thinking visualization.

📚

Self-Improving Correction

Personal vocabulary learns your domain terms from correction history. Conversation history enables topic continuity. The more you use it, the more accurate it gets.

🔍

Launcher & Snippets

Alfred-style search panel — apps, files, clipboard history, bookmarks, calculator with unit conversion, and a command palette. Snippet auto-expansion as you type.

💻

Scripting & Automation

Python plugin system with leader keys, hotkeys, event listeners, persistent storage, and shell commands for macOS automation.

How It Works

Three simple steps — no window switching needed

Hold to Record

Hold fn and speak. A live overlay shows partial results in real time. Cmd to restart, Space to cancel, Z for recent history.

Release to Transcribe

Release the key — WenZi transcribes your speech and optionally enhances it with AI.

Text is Typed

Result is typed into the active app, or shown in a preview panel to edit, switch modes (⌘1–⌘9), and compare before confirming.

The Preview panel — review, edit, switch modes, and compare models before confirming

WenZi Preview panel showing ASR result, AI enhancement, mode switcher, and editable final result

ASR Backends

Choose the best engine for your language and hardware

Backend	Language	Speed	Accuracy	Streaming	Download
Apple Speech ⚠	Multiple	Fast	Good	Supported	None (built-in)
FunASR Default	Chinese	Fast	High	No	~945 MB
MLX-Whisper	99 languages	Medium	High	No	75 MB – 1.6 GB
Sherpa-ONNX	Multiple	Fast	High	Supported	Varies by model
Whisper API	Multiple	Depends on network	High	No	None (cloud)

	Standard	Lite
Local ASR	All 5 backends	Apple Speech (built-in)
Remote ASR	Whisper API	Whisper API
AI Enhancement	Full	Full
Launcher	Full	Full
Scripting	Full	Full
App Size	~945 MB	~64 MB
	⬇ Download	⬇ Download

Speech to Text, Instantly