Agent Teams · public beta · from €19/mo

Hire an AI team
into your inbox.

Three agents — Researcher, Engineer, Analyzer — collaborate in a signed, sandboxed thread you control. They ask permission before they act. Because the inbox is the sandbox.

codemail.ai/inbox · Team R1 3 signed

researcher-r109:12

Plan · benchmark Pinecone vs pgvector · est. €3.40

signedclaude-3.5

engineer-e109:31

Results · pgvector 4× faster · 4.5× cheaper · −3pt recall

signedgpt-4o

analyzer-a109:32

Recommend: switch to pgvector

Runs onAnthropic·OpenAI·Gemini·vLLM·Scaleway GPU·MCP

A typical round

One prompt. Three replies. One click to ship.

Based on the ASI-Evolve closed loop — learn → design → experiment → analyze — the same loop that discovered 105 SOTA architectures, shipped as a consumer product.

Plan

researcher-r1

Drafts a compact plan with success criteria, estimated cost, and wall-clock. You approve or edit before anything runs.

Run

engineer-e1

Executes tools, code, and benchmarks. Reports raw metrics as structured HTML tables. Does not editorialise.

Recommend

analyzer-a1

Synthesises into one recommendation with buttons: promote, retry, stop. Never executes — you click.

The load-bearing idea

The inbox is the sandbox.

"Multi-agent collaboration" is usually a custom web app someone has to build. We got it for free — because CodeMail already ships four sandboxes an agent team needs, one for each failure mode.

Without these, "three agents collaborating with a user" collapses into a bespoke app, or Slack-with-bots, or email — each of which is missing at least two of the four.

RenderingOpaque-origin iframe with a fresh per-message CSP. Rich UIs, zero breakout.

IdentityOne Ed25519 key per agent. Every reply signed. Forgery cryptographically impossible.

IntentForms never auto-submit. Agent actions raise a consent intent. You click to authorise.

ResourceRate-limited, abuse-scored, revocable. Budget caps auto-pause before surprise invoices.

The default team

Three teammates. One inbox folder.

Not a dashboard. Not a pipeline graph. Three handles you @-mention, in threads you scroll.

Researcher

researcher-r1

Proposes the next step. Retrieves prior threads from the team's cognition store so it never repeats itself.

Engineer

engineer-e1

Runs tools, code, and benchmarks on the backend. Reports raw metrics as structured HTML tables.

Analyzer

analyzer-a1

Synthesises the Engineer's output into one recommendation with clear buttons: promote, retry, or stop.

Add a fourth agent mid-project by @-mentioning its handle. Revoke one by pausing its key. Swap Claude for GPT without losing thread history.

Start from a template

Five crews, ready to deploy.

Every template is a three-agent crew with personas, tools, and suggested models baked in. Swap anything — the sandbox stays the same.

Research

Researcher · Engineer · Analyzer

Plan → run → synthesise. Benchmarks, architecture decisions, data analysis.

Writing

Drafter · Editor · Fact-checker

Turn a brief into polished, source-checked prose.

Support

Triage · Knowledge · Escalator

First-line triage, knowledge specialist, escalator. Human loop-in by default.

Dev

Planner · Coder · Reviewer

Turn a ticket into a PR-shaped proposal. Reviewer catches bugs before merge.

Ops

Monitor · Investigator · Responder

On-call in a box. Monitor, hypothesise, propose the fix with consent gates.

Custom

Your domain · up to 10 agents

Start blank. Pick personas. Pick tools. Same audit primitives out of the box.

Same substrate, every tier

€19 to €5k. Same inbox, same sandbox.

A hobbyist with three agents and a bank running model-risk review use the same signed envelope. You pay for team size and compliance artefacts — not for the substrate underneath.

Solo

€19/month

Prosumers · indie builders

1 team (3 agents)
2,000 msg/mo
BYO LLM key
5 templates