Voice
Build at the speed of speech
A local Whisper sidecar transcribes you in the same room your code lives in. No round-trip, no consent screens, no waiting.
Now in private beta · macOS
A voice-driven, diffusion-native editor that reads the room — knows what you mean, anticipates the next change, and resolves the diff while you finish the sentence. No friction between intent and code.
Forked from VSCodium · Your keys · Your machine · Free in beta
Everything you expect from a modern AI IDE
What Cursor can't do
Voice
A local Whisper sidecar transcribes you in the same room your code lives in. No round-trip, no consent screens, no waiting.
Diffusion
The eri Engine denoises a working patch while you finish your sentence. Watch the preview resolve instead of staring at a streaming cursor.
Gaze
An optional eye tracker tells eri where on screen you're pointing. “move that button up” stops being a guessing game.
Live
Watch eri turn a single spoken request into a deployed page. The transcript runs locally, the eri Engine diffuses the diff, every beat below is real — slowed down so you can see it.
waiting for diff…
A live cross-section of the eri Engine. The waveform on the left is your voice; the lattice in the middle is the diffusion settling into code; the panel on the right is the preview your eyes have already started to want.
rendered at 60fps on the device you're holding · canvas 2d, hand-rolled physics, zero dependencies
The eri loop
Look at the part you want to change. Say it out loud. eri's diffusion engine streams the diff into the file, and the prototype updates under your gaze. No keyboard, no waiting for tokens to tick out left to right.
Prototype · live preview
One plan. Predictable cost. Cancel any time.
No card. 14 days. Local-first.
pricing.tsx — eri
""
gaze → voice → diffuse → render
The reason to switch
The four primitives that turn the loop on the left into a single gesture.
Voice transcription
Local Whisper, sub-second
Diffusion preview
See the diff resolve
Gaze targeting
Look at what you mean
Iteration history
Branchable, time-travel
Plus everything Cursor ships — Tab completion, ⌘K, Composer, Agent mode, @-mentions, MCP. The loop above is the part that's only here.
Every shortcut you have muscle memory for.
Plus the muscle eri brings on its own — predicting your next move, learning your patterns, remembering you across sessions. Your settings.json, keybindings, and extensions carry over on first launch.
Why eri
Voice in, diffusion out, eyes on the file, every turn rewindable. Pick a tab.
Hold a hotkey, say what you want, release. eri turns intent into edits without the keyboard tax. Whisper runs on your machine, so the audio never leaves the room.
Inside eri
Eri is a VSCodium fork with the Cursor feel — composer, palette, voice, gaze, diffusion, agents. Same keymap, same extensions, new loop.
Voice composer
Hold the side key, talk. eri reads the room — fusing voice, gaze, and context into a single turn before you finish the sentence. Intent-first, not syntax-first. The composer never blocks, edits stack.
eri Engine diffusion
The diff arrives whole, not streamed token by token. Small refactors land before your eyes finish moving.
Palette
Gaze
eri knows which line you mean — before you name it.
Cloud agents
Telemetry
voice turns / day
240+
median edit
1.8s
p95 ship time
34s
audio off-device
0 bytes
Model routing
Eri ships with a small router that knows the strengths of every model and a manual override that fits in a single spoken phrase. Bring your own keys for every provider — eri never proxies the call.
eri uses multiple frontier models behind the eri Engine API.
routing now
eri Engine
default for in-editor turns
// in-house · diffusion
diffusion edits
The whole diff at once. The eri Engine denoises a multi-file change in the time a tokenizer would still be warming up.
median edit
1.8s
// anthropic · 1M ctx
deep reasoning
Ask it to read the whole repo and propose the refactor that nobody dares to write. Opus is eri's slow lane when correctness matters more than speed.
context window
1M
// anthropic · polish
polish lane
Catch the lint, fix the imports, write the commit message. Sonnet runs alongside the eri Engine so the diff lands clean without a second prompt.
polish pass
≤ 4s
// openai · planner
long-form planning
When the request is a whole document — a spec, a migration plan, a long test suite — GPT-5 plans it out and eri executes the parts.
planning window
200k
// google · vision-in
multimodal
Show it the Figma, the screenshot, the design doc. Gemini reads the picture and eri writes the component to match.
modality
image-in
BYOK
Your keys, your billing. Eri binds each provider to a Keychain Access entry scoped to the editor process — the renderer never reads the secret, the sidecar signs requests in a sandboxed worker.
Voice override
Say “use Opus for this one” mid-turn. A local whisper.cpp pass spots the hot keyword use X before the sentence finishes; the next turn falls back to your default.
Local fallback
Lose the network and eri swaps the route to a 4-bit Llama-3.2-3B running on the same machine. The diff queues against the cloud lane and replays the moment your uplink returns — the loop never stops.
You speak in fragments. eri reads your gaze, your last few turns, and the file you're actually in — then compiles a prompt the model can act on, and routes it to whichever brain handles that kind of work best.
▌
Side by side
We timed the same task — “add a pricing toggle and ship it” — on eri and on a popular typing-only editor. Both running the same frontier model. The difference is voice, diffusion, and one tool that owns the whole loop.
voice + diffusion
elapsed
0:28
autocomplete + chat
elapsed
9:40
The wedge
Cursor is the pro IDE. v0, Bolt and Lovable are prompt-to-app for browsers. eri sits between them: voice in, diffusion out, native and yours.
Who it's for
Ship the v1 before the coffee cools. Voice your idea, watch eri scaffold the routes, the auth, and the landing — then put it on Vercel without leaving the conversation.
Run your whole stack from one window. The eri Engine diffuses the small edits, Opus handles the long refactors, cloud agents grind the migrations while you sleep.
Weekends are short. eri remembers your stack, your patterns, and your taste — so Sunday afternoon starts where Saturday morning left off.
Pricing
Free
$0/mo
Local voice loop, your own keys, single project.
Adaptive Eri
Most picked$20/mo
eri Engine included, cloud agents, unlimited projects.
Frontier Eri
$60/mo
Every frontier model, longer cloud runs, priority routing.
Field reports · v0.4 alpha
Notes from the alpha. Names trimmed to initials at the writers' request — we'll loosen up once we're out of invite-only.
First hour with eri, I caught myself talking to the editor like a pair. By hour three I'd shipped a webhook handler I'd been putting off for weeks.
The eri Engine is the first diffusion engine that actually feels like it edits the file instead of regenerating it. The diffs are surgical.
I dictated a refactor across six files while making coffee. Came back, reviewed the diff, accepted four, kept thinking. This is the loop.
It's quiet in a way Copilot never was. eri waits until I finish a thought, then it moves fast — like having a senior pair who actually shuts up.
The gaze targeting was the part I was skeptical about. Two days in I realized I'd stopped reaching for the trackpad to point at things.
Cursor-familiar shortcuts, voice on top, diffusion under the hood. Took twenty minutes to feel native. Now my keyboard is mostly for hotkeys.
Voice-first sounds gimmicky until you've done a four-hour session without RSI flaring. That alone made the switch worth it for me.
I write CLI tools mostly. eri's diff preview is the first AI surface that doesn't make me anxious about accepting changes.
Speak the rough idea, glance at the diff, keep going. The thought-to-merged-code latency is the lowest I've ever felt.
Stop translating thought into keystrokes. eri anticipates the next change, resolves the diff under your gaze, and learns the way you build — session after session.