Your voice controls
your Mac.

Koe is the first AI agent that sees your screen, understands context, and acts on your voice commands. Completely local. Zero cloud. Free.

macOS 13+ / Apple Silicon & Intel / v2.10.0

2,500+
Downloads
0
GitHub Stars
¥0
Forever Free
100% Local 🎤 Zoom/Teams 👥 Speaker ID 📝 AI Summary Notta alternative

Talk to your Mac. It listens.

Koe understands what you want and makes it happen. No menus. No clicking. Just speak.

"Reply to this email"
Koe reads your screen via OCR, drafts a contextual reply, and types it directly into the compose field.
"What's this error?"
Koe captures the error on your screen, analyzes it with the local LLM, and explains the fix via voice.
"Turn down the volume"
Natural language becomes system control. Volume, brightness, music playback, screen lock -- 10+ commands built in.

Screen AI Agent

The first voice assistant that actually sees what you see. Koe doesn't just transcribe -- it understands your screen and takes action.

1

Sees your screen

Real-time OCR captures every element on your display -- text, buttons, error messages, form fields.

2

Understands your intent

A local LLM processes your voice command alongside the screen context to determine the best action.

3

Acts on it

Types text, clicks buttons, controls system settings. All executed locally on your Mac.

How it works
Voice
+
Screen
LLM
Action
All processing happens on your Mac. No data ever leaves your device.

Your iPhone is now
a Mac remote.

Use your iPhone as a wireless voice controller, trackpad, and shortcut launcher for your Mac.

🎙

Voice from anywhere

Speak into your iPhone from across the room. Text appears on your Mac instantly.

AI action suggestions

Koe analyzes your Mac screen and suggests next actions. Tap to execute.

👆

Trackpad & shortcuts

Use your iPhone as a trackpad. Quick access to keyboard shortcuts like Cmd+Tab, Cmd+C/V.

🔐

PIN-secured connection

4-digit PIN on first connect. Auto-reconnects on the same Wi-Fi network after that.

iPhone
Tap to speak
Mac
Text appears here instantly...

Everything you need. Nothing you don't.

👁️

Screen AI Agent

Sees your screen via OCR, understands context with LLM, and takes action -- clicks, types, controls. All locally.

0.5s Recognition

Whisper + Metal GPU acceleration. Hybrid engine: Apple Speech for instant preview, Whisper for final accuracy.

🔒

Complete Privacy

100% local processing. Your voice, your screen, your data -- nothing ever leaves your Mac. Period.

📱

iPhone Remote

Control your Mac from your iPhone. Voice input, trackpad, shortcuts, and AI-powered action suggestions.

Smart Suggestions

AI analyzes your screen and suggests the next action. "Reply to email", "Fix this error", "Close this dialog".

🔊

Voice System Control

"Volume up", "Lock screen", "Play music" -- 10+ natural language commands for macOS system control.

🎙️

Always-on Listening

Say "Hey Koe" to activate hands-free. Custom wake word engine built with MFCC+DTW. Zero external deps.

👀

Real-time Preview

Text appears the moment you start speaking. Apple Speech for instant preview, auto-replaced by Whisper's final result.

📋

Meeting Transcription

Auto-records speech with timestamps. Saves as a text file when done. Perfect for meeting minutes.

Notta-level meeting transcription.
100% free. 100% local.

Start meeting mode with Option+Cmd+M. Koe captures Zoom/Teams audio, transcribes in real-time with speaker labels, and generates AI summaries with action items.

🎤

Zoom/Teams Audio Capture

ScreenCaptureKit grabs remote participant audio. No bot needed, no meeting link sharing required.

👥

Speaker Diarization

Automatic speaker labeling with tinydiarize. Know who said what, without any setup.

Real-time Transcription

Live transcription window with searchable text, speaker colors, and "mark important" voice commands.

📝

AI Summary + TODO

6 templates: general, 1-on-1, standup, sales, brainstorm, interview. Auto-extracts action items to Apple Reminders.

💬

Chat with Transcript

Ask questions about your meeting. "What did we decide about the budget?" AI answers from the transcript.

🔗

Slack / Notion / Calendar

Auto-post summaries to Slack. Create Notion pages. Auto-start recording when calendar meetings begin.

See how Koe compares to Notta View pricing

Why Koe wins

Side-by-side with the alternatives. Koe is the only voice tool that can see and act on your screen.

Feature Koe macOS Dictation Siri ChatGPT Voice
Screen awareness
Clicks buttons
Local processing
Free
iPhone remote
Open source

Up and running in 60 seconds

Install on Mac

Download the PKG and double-click. One click install to /Applications. Allow microphone access.

Connect iPhone

Same Wi-Fi network. Koe auto-detects your Mac. Enter the 4-digit PIN once -- done.

Say anything

Press Cmd+Opt+V or say "Hey Koe". Speak naturally. Koe understands and acts.

Built in the open.
MIT Licensed.

Every line of code is public. Fork it, modify it, use it commercially. Contributions welcome.

Start controlling your Mac
with your voice.

Free. Private. Open source. No account needed.

macOS 13+ / Apple Silicon & Intel / v2.10.0 / MIT License

💰 Need cloud AI? See Pro plans Switching from Notta? Share on X