The AI that never
phones home.

Halfmoon is full-featured AI chat that runs entirely in your browser. Open-source models on your own device — no cloud, no account, nothing to leak.

Start chatting Free · No sign-up · Works offline
0
servers involved
3
tiers — Lite, Core & Full
100%
runs on your device

Privacy you don't have to take on faith.

Private by physics

Not by promise. The model runs on your GPU, inside your browser. Your words never cross the network, so there's nothing to intercept, store, or subpoena.

Offline is a feature

Add it to your home screen like an app. Once a model is downloaded, Halfmoon works on a plane, in a tunnel, off the grid.

Three sizes of mind

Halfmoon comes as Lite, Core, and Full — the same assistant at phone, laptop, and workstation scale. Pick your phase below.

One Halfmoon, three phases.

Halfmoon Lite

Built on Google Gemma 3 1B · ~0.7 GB

The quick thinker. Instant answers, brainstorming, and everyday questions — light enough to run happily on your phone.

Halfmoon Core

Built on Qwen3.5 4B · ~3.9 GB

The daily driver. Writing, summarizing, and real back-and-forth conversation with nuance — the smartest model of its size. Made for laptops.

Halfmoon Full

Built on Qwen3.5 9B · ~6.4 GB

The deep end. Complex reasoning, long-form writing, and code. Wants a desktop-class GPU — and rewards it.

Every tier speaks as Halfmoon — same personality, different horsepower. Swap tiers anytime; each downloads once and stays on your device.

Three steps. Zero accounts.

01

Pick your tier

Lite for phones, Core for laptops, Full for big GPUs. Swap anytime.

02

Wait once

The model downloads to your browser's cache — a first-time-only wait.

03

Chat forever

Streaming replies, code, markdown, history. All local, all free.

The fine print, unfined.

Is it really free?

Yes. Your device does the computing, so there's nothing for us to meter. No subscription, no token limits, no ads.

How is this private, exactly?

Halfmoon uses WebGPU to run the language model inside your browser tab. Prompts and responses are processed on your hardware and stored only in your browser. There is no backend — the page is just files.

What do I need?

A browser with WebGPU — Chrome or Edge 113+, Safari 26+ (including iPhones on iOS 26+), or Firefox 141+ — and 0.7–6.5 GB of free space depending on the tier. Recent phones run Halfmoon Lite comfortably.

Do I have to install anything?

No. It's a web page. But if you tap “Add to Home Screen,” it behaves like a native app — full-screen, with an icon, working offline.

Chat like nobody's watching.

Because nobody is.