mirror of https://github.com/multica-ai/multica.git synced 2026-06-17 03:38:32 +02:00

Go to file

Bohan Jiang 13f74e651a feat(agents): remove custom_env from agent resources, add audited env endpoint (MUL-2600) (#3209 )

* feat(agents): remove custom_env from agent resources, add audited env endpoint (MUL-2600)

The agent resource shape (list / get / create / update / archive /
restore responses + WebSocket events) no longer carries `custom_env`
values. Reads/writes of env now flow exclusively through a dedicated
`/api/agents/{id}/env` endpoint that is owner/admin-only, rejects
agent-actor sessions, applies a "****" sentinel preserve guard on
PUT, and writes a persistent audit row per reveal/update.

Why
- `multica agent list --output json` historically returned plaintext
`custom_env` for owner/admin callers (the redaction gate gave only
members the masked map). Any agent token running on the workspace
inherits its owner's role and could read every other agent's
secrets just by listing.
- Patching list/get redaction alone (PR #3175 direction) left
symmetric leaks via mutation responses, WS events, the "reveal"
path itself (no actor-aware auth), and a `****` overwrite footgun
on UpdateAgent.

What changed
- Backend: drop `custom_env` from AgentResponse; add coarse
`has_custom_env` + `custom_env_key_count`. Strip env handling from
UpdateAgent (silently ignored if sent). Keep CreateAgent's
custom_env acceptance.
- Backend: new GET/PUT `/api/agents/{id}/env` handlers in
`internal/handler/agent_env.go`:
- resolveActor → 403 for agent actors (closes the lateral-movement
path).
- Owner/admin role gate via existing helper.
- PUT honours value == "****" as "preserve existing value".
- Both write to `activity_log` with `agent_env_revealed` /
`agent_env_updated` actions. Audit details record key names only,
never values.
- Daemon claim path (`ClaimAgentTask`) unchanged — `TaskAgentData`
still carries plaintext env for runtime injection.
- SQL: new `UpdateAgentCustomEnv` query; sqlc regenerated (v1.31.1).
- CLI: new `multica agent env get|set` subcommands. `--custom-env*`
flags removed from `multica agent update`; the no-fields error
now points to the new path.
- Frontend: drop env fields from `Agent` + `UpdateAgentRequest`; add
`getAgentEnv` / `updateAgentEnv` client methods; rewrite env-tab
to show "N variables configured" + explicit "Reveal & edit"
button, fetching values only on intentional reveal.
- Locales: parity-safe additions to en + zh-Hans.
- Docs: agents-create.{mdx,zh.mdx} reflect the new threat model and
endpoint.
- Mobile: schema drops `custom_env` / `custom_env_redacted`, adds
metadata fields.

Tests
- Handler tests pinned the new invariants: no env in list/get
responses, owner reveal happy-path + audit row, agent-actor 403,
`****` sentinel preserves real values, UpdateAgent silently
ignores `custom_env`, pure `mergeAgentEnv` cases.
- CLI tests pivot to the new flag surface: `agent update` MUST NOT
expose the env flags; `agent env set` MUST expose
--custom-env-stdin/--custom-env-file.
- Frontend test fixtures updated; pnpm typecheck / test / lint
pass cleanly.

This is a breaking API change. Scripts that read `custom_env` from
`/api/agents` must migrate to `GET /api/agents/{id}/env`.

Co-authored-by: multica-agent <github@multica.ai>

* fix(agents): close actor-spoofing + audit fail-closed in env endpoints (MUL-2600)

Addresses Elon's review of #3209:

* Mint a task-scoped `mat_` token per claim, bound to (agent, task,
workspace, owner). Daemon injects it into the agent process in place
of its own credential. Auth middleware authoritatively rebuilds
X-User-ID / X-Agent-ID / X-Task-ID from the token row and sets
X-Actor-Source=task_token; that header is server-set only — incoming
values are stripped before any auth branch runs. resolveActor honors
the header so an agent that strips X-Agent-ID / X-Task-ID still
resolves as actor=agent.
* GetAgentEnv / UpdateAgentEnv are now fail-closed on audit-log
failures: GET refuses to return plaintext, PUT persists inside the
same tx as the audit row so they commit/roll back together.
* PUT /api/agents/{id} returns 400 when the body carries custom_env
instead of silently dropping it — directs callers to the audited env
endpoint.
* Agent actors never see mcp_config, even when the underlying member
is owner/admin; mutation broadcasts go through a redaction shim so
WS subscribers don't pick it up either.
* Fix backend test that asserted dense JSON (jsonb::text renders
whitespace) and frontend test that assumed a unique "Test User"
match.

Co-authored-by: multica-agent <github@multica.ai>

* fix(agents): close residual MUL-2600 gaps from review (MUL-2600)

Migration 108 FK now correctly references agent_task_queue(id) instead
of the non-existent agent_task table; the previous name blocked CI
backend migrations.

Task-token-authenticated requests can no longer be re-routed at a
different workspace by passing workspace_slug / workspace_id /
?workspace_id / a URL workspace param. ResolveWorkspaceIDFromRequest
and resolveWorkspaceUUID both short-circuit on X-Actor-Source=task_token
and return only the token-bound X-Workspace-ID; buildMiddleware adds a
defence-in-depth 403 if any URL-resolved workspace disagrees with the
token binding.

mcp_config no longer leaks back to agent actors through UpdateAgent /
CreateAgent / ArchiveAgent / RestoreAgent HTTP responses — the same
redactAgentResponseForActor helper that GetAgent/ListAgents use is now
applied to mutation responses too. WS broadcasts were already redacted
via broadcastAgentResponse.

FailTask and every TaskService cancel path (CancelTask /
CancelTasksForIssue / CancelTasksForAgent / CancelTasksByTriggerComment
/ BroadcastCancelledTasks) now eagerly DeleteTaskTokensByTask so the
mat_ token's 24h window doesn't outlive a terminated task. Failure is
non-fatal — the FK cascade and expiry remain durable guards.

Doc-only: clarify that PUT /api/agents/{id} now hard-rejects bodies
that carry custom_env (was previously "silently ignores").

Tests:
- middleware: TestResolveWorkspaceIDFromRequest gains a task_token
case asserting client-supplied slug/id/query cannot override the
bound workspace.
- handler: TestUpdateAgent_RedactsMcpConfigForAgentActor and
TestUpdateAgent_KeepsMcpConfigForMemberActor pin the mutation-
response redaction contract per actor type.

Co-authored-by: multica-agent <github@multica.ai>

* fix(agents): match redacted mcp_config as JSON null, not Go nil (MUL-2600)

`AgentResponse.McpConfig` is `json.RawMessage` without `omitempty`, so
the redacted response serialises as `"mcp_config": null`. On decode,
`json.RawMessage` keeps the literal bytes `null` rather than collapsing
to Go nil, which made the assertion fire on a non-leak.

The product contract (field always present, distinguished from "no
config" via `mcp_config_redacted`) is intentional, so adjust the test
to check for "no secret-bearing content" instead of weakening the
contract via `omitempty`.

Co-authored-by: multica-agent <github@multica.ai>

---------

Co-authored-by: multica-agent <github@multica.ai>

2026-05-25 18:42:48 +08:00

.agents/skills/web-design-guidelines

add web design guidelines skill (#2832 )

2026-05-19 12:09:41 +08:00

.github

fix(selfhost): derive local port URLs from env (MUL-2506) (#2939 )

2026-05-24 13:05:53 +08:00

apps

feat(agents): remove custom_env from agent resources, add audited env endpoint (MUL-2600) (#3209 )

2026-05-25 18:42:48 +08:00

deploy/helm/multica

feat(self-host): add Helm chart for Kubernetes deployment (#2377 )

2026-05-25 12:53:01 +08:00

docker

feat(deploy): add full-stack Docker Compose for self-hosting

2026-04-10 15:11:18 +08:00

docs

feat(landing): add Contact Sales page and inquiry endpoint (MUL-2493) (#2988 )

2026-05-22 13:22:36 +08:00

e2e

[codex] Simplify onboarding runtime bootstrap (#2836 )

2026-05-19 09:52:35 +02:00

packages

feat(agents): remove custom_env from agent resources, add audited env endpoint (MUL-2600) (#3209 )

2026-05-25 18:42:48 +08:00

scripts

fix(selfhost): derive local port URLs from env (MUL-2506) (#2939 )

2026-05-24 13:05:53 +08:00

server

feat(agents): remove custom_env from agent resources, add audited env endpoint (MUL-2600) (#3209 )

2026-05-25 18:42:48 +08:00

.dockerignore

feat(deploy): add full-stack Docker Compose for self-hosting

2026-04-10 15:11:18 +08:00

.env.example

fix(selfhost): derive local port URLs from env (MUL-2506) (#2939 )

2026-05-24 13:05:53 +08:00

.gitattributes

fix(docker): fix self-hosting Docker build failures

2026-04-11 00:33:18 +10:00

.gitignore

feat(mobile): Multica for iOS — first version (#2337 )

2026-05-22 19:14:55 +08:00

.goreleaser.yml

feat(desktop): support macOS cross-platform packaging (#1262 )

2026-04-20 23:33:41 -07:00

.npmrc

refactor: restructure to monorepo architecture

2026-02-10 18:00:23 +08:00

.vercelignore

Tighten Vercel ignore rules (#1513 )

2026-04-22 19:00:35 +08:00

AGENTS.md

refactor: migrate workspace list from Zustand to React Query

2026-04-13 13:38:02 +08:00

CLAUDE.md

feat(mobile): Multica for iOS — first version (#2337 )

2026-05-22 19:14:55 +08:00

CLI_AND_DAEMON.md

docs(runtime): tighten issue-metadata write bar (MUL-2507) (#3004 )

2026-05-21 17:20:43 +08:00

CLI_INSTALL.md

docs: include GitHub Copilot CLI in root agent listings (#1983 )

2026-05-03 10:59:09 +08:00

CONTRIBUTING.md

fix: gate dev verification code behind explicit env (#1773 )

2026-04-28 15:14:07 +08:00

docker-compose.selfhost.build.yml

feat(selfhost): ship public GHCR deployment flow

2026-04-22 16:58:42 +08:00

docker-compose.selfhost.yml

fix(selfhost): derive local port URLs from env (MUL-2506) (#2939 )

2026-05-24 13:05:53 +08:00

docker-compose.yml

fix(security): bind all services to loopback in docker-compose files (#2759 )

2026-05-18 16:14:41 +08:00

Dockerfile

fix(timezone): harden hourly-rollup rollout against straight-through migrate MUL-2488 (#2998 )

2026-05-21 16:26:42 +08:00

Dockerfile.web

feat(selfhost): ship public GHCR deployment flow

2026-04-22 16:58:42 +08:00

LICENSE

chore(license): refine commercial restriction to target SaaS/resale only

2026-04-08 23:47:05 +08:00

Makefile

fix(selfhost): derive local port URLs from env (MUL-2506) (#2939 )

2026-05-24 13:05:53 +08:00

package.json

feat(mobile): Multica for iOS — first version (#2337 )

2026-05-22 19:14:55 +08:00

playwright.config.ts

test(web): cover issue creation flow regressions

2026-04-10 15:52:43 +08:00

pnpm-lock.yaml

feat(web): add use-cases content pipeline with welcome page (MUL-2349) (#2795 )

2026-05-25 10:05:17 +08:00

pnpm-workspace.yaml

perf: virtualize issue detail timeline + seed test scaffolding (#2413 )

2026-05-11 18:56:27 +08:00

README.md

docs(readme): add Autopilots to features list (#3155 )

2026-05-24 16:53:53 +08:00

README.zh-CN.md

docs: clarify Chinese README license label (#3189 )

2026-05-25 12:47:33 +08:00

SELF_HOSTING_ADVANCED.md

docs: finish /projects link fix + tidy AWS_ENDPOINT_URL description (#2996 )

2026-05-21 15:35:39 +08:00

SELF_HOSTING_AI.md

fix: gate dev verification code behind explicit env (#1773 )

2026-04-28 15:14:07 +08:00

SELF_HOSTING.md

feat(self-host): add Helm chart for Kubernetes deployment (#2377 )

2026-05-25 12:53:01 +08:00

skills-lock.json

add web design guidelines skill (#2832 )

2026-05-19 12:09:41 +08:00

turbo.json

fix(ci): restore frontend checks (#1878 )

2026-04-29 14:49:42 +02:00

README.md

Multica

Your next 10 hires won't be human.

The open-source managed agents platform.
Turn coding agents into real teammates — assign tasks, track progress, compound skills.

Website · Cloud · X · Self-Hosting · Contributing

English | 简体中文

What is Multica?

Multica turns coding agents into real teammates. Assign issues to an agent like you'd assign to a colleague — they'll pick up the work, write code, report blockers, and update statuses autonomously.

No more copy-pasting prompts. No more babysitting runs. Your agents show up on the board, participate in conversations, and compound reusable skills over time. Think of it as open-source infrastructure for managed agents — vendor-neutral, self-hosted, and designed for human + AI teams. Works with Claude Code, Codex, GitHub Copilot CLI, OpenClaw, OpenCode, Hermes, Gemini, Pi, Cursor Agent, Kimi, and Kiro CLI.

For larger teams, Squads add a stable routing layer: assign work to a group led by an agent, and the leader delegates to the right member.

Why "Multica"?

Multica — Multiplexed Information and Computing Agent.

The name is a nod to Multics, the pioneering operating system of the 1960s that introduced time-sharing — letting multiple users share a single machine as if each had it to themselves. Unix was born as a deliberate simplification of Multics: one user, one task, one elegant philosophy.

We think the same inflection is happening again. For decades, software teams have been single-threaded — one engineer, one task, one context switch at a time. AI agents change that equation. Multica brings time-sharing back, but for an era where the "users" multiplexing the system are both humans and autonomous agents.

In Multica, agents are first-class teammates. They get assigned issues, report progress, raise blockers, and ship code — just like their human colleagues. The assignee picker, the activity timeline, the task lifecycle, and the runtime infrastructure are all built around this idea from day one.

Like Multics before it, the bet is on multiplexing: a small team shouldn't feel small. With the right system, two engineers and a fleet of agents can move like twenty.

Features

Multica manages the full agent lifecycle: from task assignment to execution monitoring to skill reuse.

Agents as Teammates — assign to an agent like you'd assign to a colleague. They have profiles, show up on the board, post comments, create issues, and report blockers proactively.
Squads — group agents (and humans) under a leader agent and assign work to the squad. The leader decides who should pick it up, so routing stays stable as the team grows. @FrontendTeam instead of @alice-or-bob-or-carol.
Autonomous Execution — set it and forget it. Full task lifecycle management (enqueue, claim, start, complete/fail) with real-time progress streaming via WebSocket.
Autopilots — schedule recurring work for agents. Cron triggers, webhooks, or manual runs — each autopilot creates the issue and routes it to an agent automatically, so daily standups, weekly reports, and periodic audits run themselves.
Reusable Skills — every solution becomes a reusable skill for the whole team. Deployments, migrations, code reviews — skills compound your team's capabilities over time.
Unified Runtimes — one dashboard for all your compute. Local daemons and cloud runtimes, auto-detection of available CLIs, real-time monitoring.
Multi-Workspace — organize work across teams with workspace-level isolation. Each workspace has its own agents, issues, and settings.

Quick Install

macOS / Linux (Homebrew - recommended)

brew install multica-ai/tap/multica

Use brew upgrade multica-ai/tap/multica to keep the CLI current.

macOS / Linux (install script)

curl -fsSL https://raw.githubusercontent.com/multica-ai/multica/main/scripts/install.sh | bash

Use this if Homebrew is not available. The script installs the Multica CLI on macOS and Linux by using Homebrew when it is on PATH, otherwise it downloads the binary directly.

Windows (PowerShell)

irm https://raw.githubusercontent.com/multica-ai/multica/main/scripts/install.ps1 | iex

Then configure, authenticate, and start the daemon in one command:

multica setup          # Connect to Multica Cloud, log in, start daemon

Self-hosting? Add --with-server to deploy a full Multica server on your machine:
curl -fsSL https://raw.githubusercontent.com/multica-ai/multica/main/scripts/install.sh | bash -s -- --with-server
multica setup self-host
This pulls the official Multica images from GHCR (latest stable by default). Requires Docker. See the Self-Hosting Guide for details. If the selected GHCR tag has not been published yet, fall back to make selfhost-build from a checkout.

Getting Started

1. Set up and start the daemon

multica setup           # Configure, authenticate, and start the daemon

The daemon runs in the background and auto-detects agent CLIs (claude, codex, copilot, openclaw, opencode, hermes, gemini, pi, cursor-agent, kimi, kiro-cli) on your PATH.

2. Verify your runtime

Open your workspace in the Multica web app. Navigate to Settings → Runtimes — you should see your machine listed as an active Runtime.

What is a Runtime? A Runtime is a compute environment that can execute agent tasks. It can be your local machine (via the daemon) or a cloud instance. Each runtime reports which agent CLIs are available, so Multica knows where to route work.

3. Create an agent

Go to Settings → Agents and click New Agent. Pick the runtime you just connected and choose a provider (Claude Code, Codex, GitHub Copilot CLI, OpenClaw, OpenCode, Hermes, Gemini, Pi, Cursor Agent, Kimi, or Kiro CLI). Give your agent a name — this is how it will appear on the board, in comments, and in assignments.

4. Assign your first task

Create an issue from the board (or via multica issue create), then assign it to your new agent. The agent will automatically pick up the task, execute it on your runtime, and report progress — just like a human teammate.

CLI

The multica CLI connects your local machine to Multica — authenticate, manage workspaces, and run the agent daemon.

Command	Description
`multica login`	Authenticate (opens browser)
`multica daemon start`	Start the local agent runtime
`multica daemon status`	Check daemon status
`multica setup`	One-command setup for Multica Cloud (configure + login + start daemon)
`multica setup self-host`	Same, but for self-hosted deployments
`multica workspace list`	List your workspaces (current is marked with `*`)
`multica workspace switch <id\|slug>`	Switch the default workspace for this profile
`multica issue list`	List issues in your workspace
`multica issue create`	Create a new issue
`multica update`	Update to the latest version

See the CLI and Daemon Guide for the full command reference.

Architecture

┌──────────────┐     ┌──────────────┐     ┌──────────────────┐
│   Next.js    │────>│  Go Backend  │────>│   PostgreSQL     │
│   Frontend   │<────│  (Chi + WS)  │<────│   (pgvector)     │
└──────────────┘     └──────┬───────┘     └──────────────────┘
                            │
                     ┌──────┴───────┐
                     │ Agent Daemon │  runs on your machine
                     └──────────────┘  (Claude Code, Codex, GitHub Copilot CLI,
                                        OpenCode, OpenClaw, Hermes, Gemini,
                                        Pi, Cursor Agent, Kimi, Kiro CLI)

Layer	Stack
Frontend	Next.js 16 (App Router)
Backend	Go (Chi router, sqlc, gorilla/websocket)
Database	PostgreSQL 17 with pgvector
Agent Runtime	Local daemon executing Claude Code, Codex, GitHub Copilot CLI, OpenClaw, OpenCode, Hermes, Gemini, Pi, Cursor Agent, Kimi, or Kiro CLI

Development

For contributors working on the Multica codebase, see the Contributing Guide.

Prerequisites: Node.js v20+, pnpm v10.28+, Go v1.26+, Docker

make dev

make dev auto-detects your environment (main checkout or worktree), creates the env file, installs dependencies, sets up the database, runs migrations, and starts all services.

See CONTRIBUTING.md for the full development workflow, worktree support, testing, and troubleshooting.

An iOS mobile client lives in apps/mobile/ — see its README for how to build it onto your own iPhone.

Languages

Go 45.8%

TypeScript 45%

MDX 7.6%

PLpgSQL 0.5%

CSS 0.4%

Other 0.6%