multica

mirror of https://github.com/multica-ai/multica.git synced 2026-07-05 13:29:44 +02:00

Author	SHA1	Message	Date
Bohan Jiang	ce98b1c9ef	fix(squad): skip leader on agent reply to explicit member @-mention (MUL-2624) (#3217 ) When a user explicitly @-mentions an agent on an issue assigned to a squad, the existing rule already suppresses the squad leader on the mention comment itself — the user is routing deliberately, the mentioned agent owns the next step. The leader was still woken on the agent's reply, though, so it would re-@ the user every time the agent answered. Extend the suppression to the second leg of that explicit exchange: when an agent reply lands as a child of a member comment that carried a routing @mention (agent/member/squad/all — issue cross-refs still ignored), the leader stays out. The CreateComment handler already pins agent parent_id == task.TriggerCommentID, so this fires exactly when the agent's reply is provably tied to the upstream routing comment. Top-level agent comments and agent-to-agent threads continue to wake the leader so coordination keeps working everywhere else. Co-authored-by: multica-agent <github@multica.ai>	2026-05-25 17:12:33 +08:00
Wes	cfc652aa5f	fix(daemon): close stdin pipe in Pi adapter to deliver EOF (#2188 ) (#3118 ) Pi reads its prompt from argv (positional, see buildPiArgs) and never expects interactive input, so the Pi backend previously left cmd.Stdin nil. Under systemd, the resulting /dev/null character device has been observed not to satisfy Pi's readable-side wait, leaving runs stuck in "working" forever (#2188). Attach an explicit StdinPipe and close it immediately after Start so the child sees an EOF on a FIFO, matching the pattern already used by the Claude, Codex, Hermes, Kiro, and Kimi backends. The fix is defensive on the daemon side because Pi is mid-refactor and is not accepting issues upstream; once Pi itself stops blocking on stdin, this close is still correct (a closed pipe is a no-op for a process that does not read it). Test asserts the structural invariant: a shell-stub `pi` inspects /proc/self/fd/0 and only emits a valid event stream when stdin is a FIFO. If a future change drops the StdinPipe and stdin reverts to /dev/null (char device), the stub exits non-zero and the test fails.	2026-05-25 15:29:09 +08:00
Bohan Jiang	cd71b0fe05	fix(daemon): disable Codex native auto-memory in per-task config.toml (#3202 ) Codex CLI's auto-memory subsystem writes summaries to `$CODEX_HOME/memories/raw_memories.md` and `state_*.sqlite`, then reads them back on the next turn. The daemon never cleared these files across Reuse(), and Codex CLI may also pull from user-level `~/.codex/memories/` entirely outside the per-task isolation. Either path leaks unrelated context into new Multica tasks — multica#3130 saw `D:\Project\MoHaYu\ WowChat` Raw Memories injected into a brand-new issue's first turn. Write a daemon-managed block into the per-task `config.toml` that sets `features.memories = false`, `memories.generate_memories = false`, and `memories.use_memories = false`. Codex then neither writes nor reads its memory subsystem regardless of where the residual files live. The user's global `~/.codex/config.toml` is never touched. Pattern mirrors `ensureCodexMultiAgentConfig`: idempotent managed-block upsert, two TOML layout variants (root dotted-key vs. inside a `[features]` / `[memories]` table) to satisfy strict toml-rs parsing, and a `MULTICA_CODEX_MEMORY` env-var escape hatch. MUL-2598 Co-authored-by: multica-agent <github@multica.ai>	2026-05-25 15:17:38 +08:00
LinYushen	8e9df90d32	feat: include repo description in agent brief (#3203 ) Add Description field to RepoData structs so that workspace repo descriptions (set via the settings UI) are preserved through normalization and rendered in the agent brief as: - <url> — <description> When no description is set, the existing format is unchanged. Closes MUL-2610 Co-authored-by: multica-agent <github@multica.ai>	2026-05-25 15:16:22 +08:00
Bohan Jiang	be54e79f38	fix: upgrade github.com/jackc/pgx/v5 to 5.9.2 (#3192 ) Remediates two pgx security advisories in a single bump: - CVE-2026-33816 (fixed in 5.9.0) — pgproto3 memory-safety DoS from malformed messages sent by a malicious server. - GHSA-j88v-2chj-qfwx / CVE-2026-41889 (fixed in 5.9.2) — SQL injection via placeholder confusion with dollar-quoted literals under QueryExecModeSimpleProtocol. Not reachable in this codebase (no simple-protocol callers), but pinned to clear future scanner runs. No source changes needed: pgx 5.9.x adds no breaking APIs over 5.8.x (adds PG protocol 3.2 support, SCRAM-SHA-256-PLUS, OAuth, plus pgtype/pgconn bug fixes). Minimum Go bumped to 1.25 in 5.9.0; repo already on 1.26.1. MUL-2597 Co-authored-by: multica-agent <github@multica.ai>	2026-05-25 14:06:56 +08:00
Naiyuan Qing	6261ea45fd	Improve board and squad hover cards (#3188 )	2026-05-25 12:58:39 +08:00
Naiyuan Qing	5f1f08e466	feat(web): add use-cases content pipeline with welcome page (MUL-2349) (#2795 ) * feat(web): add use-cases content pipeline with welcome page (MUL-2349) Wire fumadocs-mdx into apps/web with an independent collection rooted at content/use-cases/. Add the first page at /use-cases/welcome (header + H1 + prose + screenshot + footer) using the about-page visual shell. - source.config.ts + lib/use-cases-source.ts (separate from apps/docs) - features/landing/components/mdx/screenshot.tsx wraps next/image - public/use-cases/welcome/screenshot-1.png placeholder (55KB) - next.config.ts wraps NextConfig with createMDX() - .gitignore + eslint ignore .source/ Co-authored-by: multica-agent <github@multica.ai> * feat(web): bilingual db-boy use case with cookie locale (MUL-2349) Extends the use-cases pipeline into the first real article. - ZH + EN MDX (auto-data-analysis.{zh,en}.mdx) sharing three real screenshots; sensitive fields on db-boy-profile.png (RDS host, DB name, password) are blurred in-place. - Cookie-based locale: /use-cases/<slug> reads multica-locale server-side via lib/use-cases-i18n.ts (mirrors LandingLayout's cookie + Accept-Language fallback). Same URL serves either language; no [lang] segment so all other landing routes stay unchanged. - Frontmatter schema (source.config.ts): z.looseObject with declared hero_image / updated_at (required) / category (optional); a preprocess converts YAML-auto-parsed Date back to a YYYY-MM-DD string. - MDX components factory createMdxComponents(locale) routes the secondary CTA to /docs/zh (ZH) or /docs (EN); internal MDX links use <Link> for SPA nav; full-width and half-width colons both trigger [CTA: ...] / [占位图: ...] markers; 副 and Secondary both work as the secondary CTA prefix. - Index page localizes hero / subtitle / card CTA / metadata; sort fallback uses an epoch placeholder so undefined-order disappears. - Landing header + footer surface use-cases entry in both locales. - Detail route: sticky header, right-rail TOC with anchor jumps, scroll-mt-[100px] on H2/H3 so anchor jumps don't slip under the sticky header. - Drop welcome demo page. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(web): resolve code review blockers on use-cases PR - Add `use-cases` to reserved_slugs.json + regenerate TS (P1: prevent future workspace slug collision) - Fix dead links in both MDX files: /features/* → /docs/* (P2) - Remove duplicate brand suffix in page title metadata (nit) Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com> Co-authored-by: multica-agent <github@multica.ai> * fix(web): align usecases locale routing * chore: refresh web mdx lockfile * fix(web): type mdx next config adapter * fix(web): wrap settings route page --------- Co-authored-by: multica-agent <github@multica.ai> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-25 10:05:17 +08:00
Bohan Jiang	a55c03a0b3	fix(agent): inject Workspace Context into agent brief (MUL-2542) (#3078 ) * fix(agent): inject Workspace Context into agent brief (MUL-2542) The per-workspace `workspace.context` field (Settings → General) was stored in the DB but never reached the agent prompt. Plumb it from the workspace row through the claim response, the daemon's Task struct and TaskContextForEnv, and render it as `## Workspace Context` in the meta brief above `## Available Commands`. Heading is skipped when the field is empty so workspaces that haven't set a context don't see a bare header. Applies to every task kind — issue, comment, chat, autopilot, quick-create — so the shared system prompt is consistent regardless of trigger source. Co-authored-by: multica-agent <github@multica.ai> * chore(server): gofmt files touched by workspace-context injection Run gofmt on the files that buildWorkspaceContext injection touched. Cleans up composite-literal alignment in execenv task context and struct-tag alignment in Task / AgentTaskResponse / RegisterRequest. No behavior change. Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: multica-agent <github@multica.ai> Co-authored-by: J <agent-j@multica.ai>	2026-05-22 17:23:27 +08:00
Bohan Jiang	0bb51ccd0e	feat(issues): mention parent assignee in child-done system comment (MUL-2538) (#3065 ) * feat(issues): mention parent assignee in child-done system comment (MUL-2538) Per Bohan's product call on MUL-2538 ("方案 C"), the platform's child-done system comment now @mentions the parent assignee — member, squad, or agent — and the platform fires the matching side effect explicitly: - agent → mention task via TaskService.EnqueueTaskForMention - squad → leader task via TaskService.EnqueueTaskForSquadLeader - member → 'mentioned' inbox row + EventInboxNew broadcast The generic comment listener still short-circuits on author_type='system' (see notification_listeners.go) so smuggled mention links in the child title can never light up unrelated members; the parent assignee mention is the only side effect, and it is fired from the handler with explicit guards rather than the listener path. Guards retained / added: - Comment-fire gates from prior PR unchanged (status transition, parent state, no parent). - Loop guard: skip trigger when child and parent share the same assignee (same agent / same squad / same member). The comment + mention still render so the timeline tells the full story; the second task does not fire. - Idempotency: HasPendingTaskForIssueAndAgent dedupes rapid-fire enqueues for the same parent (back-to-back child completions). - Readiness: archived agents / missing runtimes are silently skipped. Tests: - TestChildDoneMentionsParentAssignee_{Agent,Member,Squad} verify the mention link + the matching trigger / inbox row. - TestChildDoneSelfTriggerGuard_SameAgent asserts that an agent assigned to both the child and the parent gets the comment + mention but no second task — the documented loop break. - TestChildDoneNotifiesParent updated: when the parent has no assignee (its existing fixture), no routing mention should appear; the assigned branches are exercised by the new cases above. Co-authored-by: multica-agent <github@multica.ai> * feat(issues): skip child-done parent notification for human assignees (MUL-2538) Humans read their own timeline manually — an automated system comment is pure noise for member-assigned parents, and there is no agent task to trigger. Skipping the notification entirely also removes the mention question (no comment → no mention → no inbox row). The agent / squad / unassigned branches stay unchanged. Co-authored-by: multica-agent <github@multica.ai> * fix(issues): close cross-squad shared-leader loop in child-done dispatch (MUL-2538) Elon's review of PR #3065 flagged that triggerChildDoneAgent and triggerChildDoneSquad only compared the child's direct assignee, so a child-done event could still wake the same agent when: - parent assigned to agent A, child assigned to a squad whose leader is A; - parent and child assigned to two different squads sharing the same leader agent. Replace the per-side checks with a single effectiveChildAgentOwner helper that reduces the child to "the agent that would actually act on it" (the agent assignee, or the squad's leader) and lets both trigger paths compare apples to apples. Add coverage for both newly-blocked cases, and tighten the documented side-effect semantics (squad triggers leader only — no member fan-out; notification_preference is not consulted, downstream agent_task / inbox pipeline still respects mutes). Also fix the member-skip test fixture to write user_id, matching the production invariant that issue.assignee_id for assignee_type='member' references user_id (validateAssigneePair, server/internal/handler/issue.go). Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: multica-agent <github@multica.ai>	2026-05-22 16:05:26 +08:00
Dmitry	5bc77f2953	fix(pi): strip leaked tool markup safely (#2956 )	2026-05-22 15:46:10 +08:00
Bohan Jiang	a6f19380b2	test(agent): use ForkLock helper to fix ETXTBSY flake in thinking tests (#3062 ) Two thinking tests wrote fake CLI scripts via os.WriteFile and immediately execed them. Under t.Parallel() with the rest of pkg/agent, a sibling test's concurrent fork can inherit our still-open write fd, so Linux returns ETXTBSY at exec time (Go #22315). CI hit this on main as "TestRunCodexDebugModels_ArgvSeenByBinary: fork/exec ...: text file busy". Switch both call sites to the existing writeTestExecutable helper, which holds syscall.ForkLock across OpenFile→Write→Close so no concurrent fork can inherit the write fd. Same pattern the rest of the package already uses (kimi, kiro, codex, claude tests).	2026-05-22 14:53:56 +08:00
Bohan Jiang	c967ae0e0e	feat(issues): platform-owned parent notify on child done (MUL-2538) (#3055 ) * feat(issues): platform-owned parent notify on child done (MUL-2538) When a child issue transitions from a non-done status into `done` and has an open parent, the server now posts a top-level platform-generated comment on the parent itself. Replaces the agent-prompt rule shipped in PR #2918, which produced self-mention loops, planner ping-pong, and accidental `MUL-` prefix hardcoding because the agent did not always know the workspace prefix. - Migration 107 widens `comment.author_type` to allow `system`; the zero UUID is used as the sentinel `author_id` (the column stays NOT NULL, callers branch on `author_type === 'system'`). - `Handler.notifyParentOfChildDone` fires from both `UpdateIssue` and `BatchUpdateIssues`. Guards: prev status != done, new status == done, parent set, parent not in `done`/`cancelled`. Bypasses the CreateComment HTTP path so the assignee on_comment trigger and the mention-trigger paths do not fire — the comment content carries only the safe issue mention for the child, no `mention://agent/...` / `mention://member/...` / `mention://squad/...` links. - `runtime_config.go` downgrades the Parent/Sub-issue Protocol rule 1 to an explicit "do NOT post one yourself" guardrail; rule 2 (sub-issue creation `--status todo` vs `backlog`) is unchanged. - New handler test exercises the happy path, idempotency, reopen+done, parent done/cancelled guards, and the no-parent case. Runtime-config tests reassert the new wording and the banned strings from the prior revision. Co-authored-by: multica-agent <github@multica.ai> * fix(issues): isolate system comments + wire GH merge path (MUL-2538) Addresses the two must-fix items from the PR #3055 second review: 1. The platform-generated `comment:created` event (author_type='system') was running through the generic comment listeners, which (a) tried to subscribe the zero-UUID author and (b) parsed @mentions from the body for inbox notifications. Both subscriber_listeners and notification_listeners now early-return on author_type='system' so the event becomes a pure WS broadcast for the timeline — no inbox rows, no transcluded-mention attack surface. 2. advanceIssueToDone (the GitHub merge auto-done path) only published issue:updated and skipped notifyParentOfChildDone, so a child closed via merged PR — the dominant completion path — left the parent silent. The helper is now invoked on the same prev/updated pair, with the existing guards (transition + parent state) protecting double-fire. Tests: - New cmd/server/notification_listeners_test: TestNotification_SystemCommentSkipsInboxAndMentions (parent subscribers and smuggled @mention targets stay quiet), TestSubscriberSystemCommentDoesNotSubscribe (zero-UUID never reaches AddIssueSubscriber). - New internal/handler/github_test: TestWebhook_MergedPR_ChildWithParent_NotifiesParent fires a real pull_request closed-merged webhook against a child and asserts the parent receives exactly one safe system comment with the workspace's real identifier (no `mention://agent\|member\|squad` links). Co-authored-by: multica-agent <github@multica.ai> * fix(runtime): drop parent-notification guidance from agent brief (MUL-2538) Per Bohan's product call on PR #3055: the platform now owns the child-done parent notification, so the runtime brief should not mention the parent-comment path at all — not as an instruction, not as a "do not do it" guardrail. The previous revision kept rule 1 of the Parent / Sub-issue Protocol as a "Do NOT post your own parent-notification comment." sentence; that still puts the concept in front of the agent every run, which is exactly what we are trying to avoid. What changes: - Delete the "Parent / Sub-issue Protocol" preamble and rule 1 from buildMetaSkillContent. The remaining content — the `--status todo` vs `--status backlog` rule for creating sub-issues — now lives in a dedicated `## Sub-issue Creation` section, since the parent/child framing it previously sat under is gone. - The system comment on the parent stays exactly as in `366f6e2`: the agent simply does not need to know about it. Tests: - runtime_config_test.go is rewritten around the new section name and the wider "no parent-notification guidance" canary; the banned list now covers both the original PR #2918 wording and the intermediate "do NOT post one" wording. System comment UI: the frontend already renders `author_type === "system"` with author name "Multica" (`useActorName`) and the MulticaIcon avatar (`ActorAvatar` via `isSystem`), matching Bohan's "looks like a normal comment, author is multica + multica logo" requirement — no frontend changes needed. Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: multica-agent <github@multica.ai>	2026-05-22 14:51:43 +08:00
Tom Qiao	1c91c2a3b2	security(db): scope DELETE/UpdateIssueStatus by workspace_id (defense-in-depth) (#3027 ) * fix(security): scope DELETE/UpdateIssueStatus by workspace_id Add workspace_id to the WHERE clause of DeleteIssue, DeleteComment, DeleteProject, DeleteSkill, DeleteChatSession, and UpdateIssueStatus as SQL-layer defense-in-depth. Handler loaders (loadIssueForUser / loadSkillForUser / etc.) already enforce workspace membership today, so this is not patching a known live vuln. But the tenant invariant is currently a handler-layer guarantee — a future loader bypass or a new caller skipping the loader would be silently catastrophic. Making workspace_id part of the SQL identity collapses the trust surface to the schema itself: forging a sibling-workspace UUID becomes ErrNoRows instead of a cross-tenant write. Reference: incident #1661 (util.ParseUUID silent zero UUID returning 204 on a DELETE that matched zero rows) — same class of failure, prevented at a different layer. Scope: - 5 DELETE queries: issue, comment, project, skill, chat_session - 1 simple UPDATE: UpdateIssueStatus (2 narg, no SET ordering risk) - All callers updated (handlers, service, runtime sweeper fallback) Multi-narg UPDATE queries (UpdateIssue, UpdateProject, UpdateSkill, UpdateComment, UpdateChatSession) are deferred to a follow-up to keep this change reviewable: each needs its narg pinning shifted and per-caller verification. sqlc was regenerated by hand (no local sqlc toolchain); CI's backend job is the authoritative compile check. test(security): add workspace_scope_guard regression test Locks in the SQL-layer tenant guard added in this PR. For each of the 6 scoped queries (DeleteIssue, DeleteComment, DeleteProject, DeleteSkill, DeleteChatSession, UpdateIssueStatus), creates the resource in workspace A, invokes the query with a foreign workspace UUID, and asserts the row is untouched (0 rows affected with no error for :exec; pgx.ErrNoRows for :one). A future refactor that drops the workspace_id arg from any of these queries will now fail loudly instead of silently regressing. Includes a sanity sub-test that the in-workspace path still mutates, so a buggy guard that returns no-op for every call would not pass. Co-Authored-By: Claude Opus 4 <noreply@anthropic.com> --------- Co-authored-by: Tom Qiao <tomqiaozc@users.noreply.github.com> Co-authored-by: Claude Opus 4 <noreply@anthropic.com>	2026-05-22 14:39:47 +08:00
Naiyuan Qing	fedd0f1694	feat(issues): live agent activity chip + per-issue indicator + filter (#3058 ) * feat(server): broadcast task:running event The dispatched → running transition was silent: only task:queued, task:dispatch, task:cancelled, task:completed and task:failed broadcast over WS. Any UI that distinguishes "queued" from "running" (e.g. the new issue-card agent activity indicator) would lag by up to the 30s agentTaskSnapshot staleTime on the most user-visible transition. StartTask now broadcasts task:running so the workspace snapshot invalidates immediately, keeping the agent activity UI live. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * feat(issues): live agent activity chip + per-issue indicator + filter Surfaces "which agents are working on what, right now" in the Issues and My Issues views, with a one-click filter to narrow the list to issues that have a running agent task. Two visual surfaces: - Workspace chip in the header (left of Filter). Shows the brand-tinted avatar stack of agents currently running on visible issues. Click toggles a page-scoped filter; idle state renders a static "0 working" button with a hover-card placeholder. When the filter is active the chip pins to brand fill across hover and popover states (the Button outline variant otherwise repaints back to neutral). A muted "Viewing only working agents" hint sits to the left of the chip whenever the filter is on, so users notice the active state without having to hover. - Per-issue indicator on every board card and list row (top-right of the identifier line). Renders the avatar stack of agents in running or queued state on that issue, full-opacity ring at brand/70 when ≥1 is running, half-opacity stack when only queued. Returns null when nothing is in flight. Both surfaces open the same hover-card body that lists each active task with the agent avatar, status dot (composed via the existing availability + workload tokens), and a live-ticking duration. Adds a new "All" scope to /my-issues that unions assignee, creator, and involves_user_id via three parallel fetches deduped on the client — no backend changes for this part. The chip's count and the quick-filter both use the page's currently visible issue ids so they stay in sync with the active scope. State is per-user (Zustand + localStorage) and the agentRunningFilter is intentionally omitted from partialize — running state changes second-to-second and a stored toggle would land users in an unexplained empty list. WS task:running, already added in the preceding commit, drives real-time updates without polling. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * refactor(issues): swap indicator ring pulse for shimmer text label Earlier iterations layered a brand ring with various opacity-pulse cadences around the per-issue avatar stack. Every tuning attempt was either invisible (transparent ring + faded pulse) or oppressive (a visible ring that flashed on a dense board). Moves the "alive" signal onto a small text label and reuses chat's existing `animate-chat-text-shimmer` utility — a soft light sweep across the glyphs that already powers the ChatGPT-style "thinking" cue in task-status-pill. Indicator now reads as a 12 px avatar stack + 10 px label: - Running → full-opacity avatars + shimmering localized "Working" - Queued → half-opacity avatars + muted static "Queued" - Idle → render nothing (unchanged) Avatars and the surrounding card stay completely still; only the few glyphs animate. The label is i18n-driven via the existing `status_running` / `status_queued` keys, so no locale changes are required. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-22 14:20:42 +08:00
Kagura	eefc6cebaa	feat(server): add workspace-level always_redact_env setting (MUL-2495) (#2367 ) * feat(server): add workspace-level always_redact_env setting When a workspace opts into always_redact_env (via workspace settings JSON), all agent GET/LIST responses will have custom_env values masked and mcp_config nulled regardless of the caller's role. This provides a stricter security posture for single-tenant self-hosts or environments where screen-sharing or pairing makes plaintext secrets a risk. The setting is opt-in and defaults to false (preserving existing behavior). Owners can still write secrets via the update path; they just cannot read them back through the API when this setting is enabled. Closes #2352 * fix(server): fail-closed on GetWorkspace, add HTTP tests, distinguish redaction reason Address review feedback on #2367: 1. GetWorkspace failure now returns 500 instead of silently defaulting to alwaysRedact=false (fail-open → fail-closed). 2. Add HTTP-level regression tests for always_redact_env: - GetAgent with flag on → owner sees redacted env - ListAgents with flag on → owner sees redacted env - GetAgent with default settings → owner sees plaintext env 3. Add custom_env_redacted_reason field ('policy' \| 'role') to distinguish workspace-policy redaction from role-based redaction. UI now only sets readOnly when reason is 'role', allowing owners to edit env even when always_redact_env is enabled. 4. Write-back footgun tracked in #2999. Signed-off-by: kagura-agent <kagura.agent.ai@gmail.com> * fix(test): clear workspace settings before DefaultNoRedactForOwner Guard against test-order leakage: if a preceding test enabled always_redact_env on the shared workspace and its cleanup didn't run (e.g. due to -shuffle or parallel execution), this test would incorrectly see policy-level redaction. Explicitly reset settings to NULL before assertions. Signed-off-by: kagura-agent <kagura.agent.ai@gmail.com> * fix(ui): make EnvTab read-only when env is redacted by any policy Previously the readOnly guard only checked for 'role' redaction, leaving the tab editable under 'policy' redaction. This meant a user could save the form with '***' placeholder values, permanently overwriting the actual secrets. Use the boolean custom_env_redacted flag instead so the tab is locked regardless of the redaction reason. Fixes the regression flagged in the third-pass review. Signed-off-by: kagura-agent <kagura.agent.ai@gmail.com> fix: reset workspace settings to empty JSON instead of NULL Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * style: gofmt AgentResponse struct alignment Signed-off-by: kagura-agent <kagura.agent.ai@gmail.com> --------- Signed-off-by: kagura-agent <kagura.agent.ai@gmail.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-05-22 14:10:09 +08:00
Bohan Jiang	46a29b1ebb	fix(squads): warn leader against double-triggering an agent (#3053 ) Squad coordinators were both @mentioning an agent in the parent issue and creating a todo child issue assigned to the same agent, causing the agent to be triggered twice in parallel (mention dispatch + assignment dispatch). The server has no cross-issue dedupe for this case — and adding one would make @mention semantics context-dependent and unpredictable. Fix is at the prompt level: tell the squad leader that a `todo` child issue with an agent assignee already fires that agent, so they must pick exactly one delegation path for any given piece of work — comment-based @mention or todo child-issue assignment, never both. Adds a focused regression test that locks in the new rule via narrow substring checks (so harmless rewording stays free). Fixes #3033 Co-authored-by: multica-agent <github@multica.ai>	2026-05-22 13:48:21 +08:00
Bohan Jiang	7984606eed	feat(landing): add Contact Sales page and inquiry endpoint (MUL-2493) (#2988 ) * feat(landing): add Contact Sales page and inquiry endpoint (MUL-2493) Adds a public `/contact-sales` marketing page with a needs-discovery form modelled on the design reference attached to MUL-2493 — first/last name, business email (with free-provider rejection), company name + size, country/region, intended use case, and a free-text goals field, plus the two consent checkboxes from the reference. Submissions hit a new public `POST /api/contact-sales` endpoint with per-IP rate limiting (Redis-backed via the existing RateLimit middleware, configurable through `RATE_LIMIT_CONTACT_SALES`) and a per-email hourly cap so a single business address can't be used as a flood channel after one valid pass. The inquiry is stored in a new `contact_sales_inquiry` table; analytics fires a `contact_sales_submitted` PostHog event with only the closed-enum dimensions (size, country, use case) — the free-text goals stay in the DB and are never broadcast. The page is linked from the landing header (md+) and the footer's Company column, in both English and Simplified Chinese. The reserved-slug list is updated so a workspace named `contact-sales` can't shadow the route. Co-authored-by: multica-agent <github@multica.ai> * fix(landing): canonicalize business email and tighten contact-sales form (MUL-2493) - Parse the submitted email with net/mail and run the free-email block-list against the canonical addr.Address, so a display-name form like `Ada <ada@gmail.com>` can no longer slip past the gate (the raw string had domain `gmail.com>`, which wasn't blocked). Adds regression tests covering the display-name bypass and the canonicalization helper. - Drop noValidate from the contact-sales form so the browser's native required / email / select checks fire before submit; the JS-side free-email warning still runs as a UX guard. - Update success copy ("respond within three business days") in EN and ZH plus the page metadata. Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: multica-agent <github@multica.ai>	2026-05-22 13:22:36 +08:00
Bohan Jiang	424f67f7cb	fix(security): normalize MIME type in isInlineContentType (#3050 ) isInlineContentType is the security boundary that decides whether an uploaded file is served with Content-Disposition: inline (renderable in the document origin) or attachment. The SVG carve-out added in #3023 to block stored-XSS via uploaded .svg only matched the exact literal "image/svg+xml", so callers that supply "IMAGE/SVG+XML", "image/svg+xml; charset=utf-8", or whitespace-padded variants would still see disposition=inline. MIME type matching is case-insensitive per RFC 2045 §5.1 and may carry parameters, so the safe thing is to normalize at the boundary instead of trusting every caller. Today both call sites (S3.Upload and LocalStorage.Serve) happen to feed in the exact literal because the upload handler overrides .svg to "image/svg+xml" before storage sees it, so this is defense-in-depth rather than a live regression. Hardens the helper so any future caller (including one that ever trusts a client-supplied Content-Type) stays behind the same guard. Co-authored-by: multica-agent <github@multica.ai>	2026-05-22 13:01:52 +08:00
Tom Qiao	295df8d928	fix(security): force attachment disposition for SVG uploads (#3023 ) SVG files are XML and can carry <script>, <foreignObject>, or onload= attributes that execute in the document's origin when rendered inline. The upload handler maps .svg to image/svg+xml, and storage backends (local + S3) previously set Content-Disposition: inline based on the image/ prefix in isInlineContentType. A workspace member could upload a crafted SVG, share its attachment URL in an issue or comment, and any teammate who clicks the link would execute attacker-controlled JS in the application's first-party origin (reading auth cookies, posting to authenticated endpoints). Exclude image/svg+xml from isInlineContentType so both storage paths serve SVG with Content-Disposition: attachment. Test coverage: - New util_test.go covers the inline/attachment matrix including SVG. - Existing local_test.go ContentDisposition table gains an SVG case. Co-authored-by: Tom Qiao <tomqiaozc@users.noreply.github.com>	2026-05-22 12:51:43 +08:00
LinYushen	5bacfd9742	MUL-2526 feat: add member(user_id, workspace_id) index + upgrade sqlc to v1.31.1 (#3046 ) - Add migration 106: CREATE INDEX CONCURRENTLY on member(user_id, workspace_id) - Rewrite ListWorkspaces to drive from member table with explicit fields - Regenerate all sqlc code with v1.31.1 (intentional version upgrade) Co-authored-by: multica-agent <github@multica.ai>	2026-05-22 12:26:56 +08:00
Tom Qiao	b9602adabe	fix(handler): validate skill id UUID at request boundary (#3025 ) loadSkillForUser was passing chi.URLParam(r, "id") directly into parseUUID, the panic-on-invalid helper reserved for trusted UUID round-trips. A malformed `/api/skills/{notuuid}` request panicked in util.MustParseUUID; chi's middleware.Recoverer turned it into a 500 instead of a 400. This violates the documented convention (CLAUDE.md → "Backend Handler UUID Parsing Convention"): pure-UUID request inputs must use parseUUIDOrBadRequest, which writes a 400 and short-circuits. Switch loadSkillForUser to parseUUIDOrBadRequest. Behaviour for valid UUIDs is unchanged; malformed input now returns 400 with a clear "invalid skill id" message. Test: - TestGetSkill_MalformedUUIDReturns400 asserts GET /api/skills/not-a-uuid returns 400. Co-authored-by: Tom Qiao <tomqiaozc@users.noreply.github.com>	2026-05-22 12:22:07 +08:00
Naiyuan Qing	fbd965e5bf	feat(onboarding): v3 — thin server, frontend-orchestrated welcome (#3008 ) * feat(onboarding): Multica Helper as general workspace assistant + blocking modal Reshape Multica Helper from an onboarding-only guide into the workspace's general-purpose AI assistant. The agent's permanent identity (injected as `## Agent Identity` into every task's CLAUDE.md / AGENTS.md / GEMINI.md via execenv.InjectRuntimeConfig) is rewritten to three sections that don't overlap with what the brief already provides: - Who I am (built-in workspace assistant, not onboarding-only) - What Multica is + docs/source/issues URLs as knowledge sources - What I can do (CLI = manifest, `multica --help` is the source of truth) - Tone (concise, like a colleague, match user's language) Bootstrap moves out of the in-flow Step 4. Runtime step now exits the onboarding shell with no bootstrap call; a blocking OnboardingHelperModal mounts inside the workspace layout (web + desktop) and gates purely on `me.onboarded_at == null`. The user picks one of three starter prompts (intro / assign / second_agent) and the modal calls BootstrapOnboardingRuntime with a new optional `starter_prompt` field that becomes the seeded onboarding issue's description. Side effects required to make `onboarded_at == null` an honest signal: - CreateWorkspace no longer marks onboarded (was atomic with CreateMember). The "member exists ⟹ onboarded_at != null" invariant is intentionally broken; guards (useDashboardGuard / desktop App.tsx) already tolerate this — comments updated to reflect the new contract. - AcceptInvitation still marks (invitee skips the modal in someone else's workspace). Code comment added warning future removers. - resolvePostAuthDestination flips to workspace-presence-first: a user with a workspace lands in it regardless of `onboarded_at`, so the modal can pick up an interrupted setup on relogin. Other backend changes: - `onboardingAssistantDescription` rewritten ("Built-in workspace assistant…") - `onboardingAssistantInstructions` rewritten to the 3-section identity - `bootstrapOnboardingRuntimeRequest.StarterPrompt` (optional, 2 KiB rune cap, empty-falls-back-to onboardingIssueDescription) Frontend changes: - Delete `packages/views/onboarding/steps/step-teammate.tsx` (no longer a persisted step) - `ONBOARDING_STEP_ORDER` and `OnboardingStep` type drop `"teammate"` - `handleRuntimeNext` exits via `onComplete(workspace, undefined)` — no bootstrap, `onboarded_at` stays NULL so the modal fires - Runtime step next-button copy → "Start exploring" / "开始探索" - New `packages/views/workspace/onboarding-helper-modal.tsx`: Base UI Dialog, dismissible=false, three localized cards, mutation invalidates agents + issues queries then navigates to the seeded issue - Mounted in both `apps/web/app/[workspaceSlug]/layout.tsx` and `apps/desktop/src/renderer/src/components/workspace-route-layout.tsx` Tests: - Backend: TestBootstrapOnboardingRuntime_{With,No}StarterPrompt and TestCreateWorkspace_DoesNotMarkOnboarded - Frontend: onboarding-helper-modal.test.tsx covers all four gating conditions, three-card behavior, mutation pending state, and the "no close button" invariant Compatibility: - Already-onboarded users: zero impact (modal can't fire) - Invitees: AcceptInvitation still marks → modal can't fire - Skip-runtime path: BootstrapOnboardingNoRuntime still marks → modal can't fire - Old desktop / web clients: legacy teammate-step path keeps working (bootstrap accepts missing starter_prompt) — the new modal only fires on the new frontend bundle - Avatar SVG kept (asterisk variant) — no migration of existing Helper agents, only newly-created Helpers pick up the new instructions/description Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(desktop): suppress OnboardingHelperModal while a WindowOverlay is open On desktop, App.tsx auto-creates a tab pointing at the user's first workspace as soon as workspaces.length flips from 0 → 1 (during onboarding Step 2). The new tab mounts WorkspaceRouteLayout under the overlay, which mounts OnboardingHelperModal. The modal's Portal renders to document.body — appearing AFTER the WindowOverlay in DOM order, so its z-50 wins and the modal floats in front of the still-active onboarding Step 3 (runtime). Suppress the modal whenever any WindowOverlay is active. When the overlay closes (onComplete fires after the user finishes onboarding), the modal re-evaluates `me.onboarded_at == null` and pops on its own. Web is unaffected (onboarding flow lives at /onboarding, not under /[workspaceSlug]/, so WorkspaceRouteLayout never mounts during the onboarding flow). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * docs(onboarding): add v2 refactor plan Captures the design + 8-step implementation order for collapsing the onboarding state machine: single mark-onboarded entry point, persisted Step 3 user choice, dumb Modal, single install-runtime seed call site. Includes old-user compatibility analysis (4 existing gates) and per-PR risk/rollback. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * feat(db): persist Step 3 runtime choice on user record (MUL-onboarding-v2) Adds onboarding_runtime_id UUID NULL + onboarding_runtime_skipped BOOLEAN columns to "user" and the CHECK constraint enforcing the 3-state machine (unset / picked-runtime / explicit-skip; the fourth combination is forbidden). ON DELETE SET NULL on the FK so a deleted runtime degrades to "unset" rather than dangling. PatchUserOnboarding gains the two narg fields plus CASE expressions that collapse the runtime/skipped pair atomically — a follow-up PATCH that flips one side now clears the other in the same statement, instead of preserving it via per-field COALESCE and tripping the CHECK constraint. Backwards compatible for existing users: both new fields default to (NULL, false), which is the "unset" leaf of the state machine, and four upstream gates on me.onboarded_at != null already short-circuit the new fields' readers for everyone who's already onboarded. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * refactor(server): collapse onboarding side effects to service layer Introduces OnboardingService.MarkComplete and WorkspaceContentService.{Ensure,Seed}InstallRuntimeIssue as the single authorities for the two onboarding side effects that used to be duplicated across four handlers: - MarkUserOnboarded + claim starter_content_state + optional install-runtime fallback seed: was inline in BootstrapOnboardingRuntime, BootstrapOnboardingNoRuntime, AcceptInvitation, and CompleteOnboarding. - install-runtime issue seeding: was inline in CreateWorkspace and AcceptInvitation as a "no runtime yet" fallback. After this refactor: - MarkUserOnboarded is called from exactly one place (the service). - install-runtime issue is seeded from exactly one place (the service). - CreateWorkspace deliberately does not seed — the new /ensure-onboarding-content endpoint (also added here) lets the workspace-entry init component request the seed on first mount, so workspaces created but never opened don't accumulate stale issues. - The PatchOnboarding handler now accepts the new runtime_id / runtime_skipped fields and rejects (uuid, skipped=true) up front. - UserResponse exposes the two new persisted fields so the frontend can read them off `me` without an extra round-trip. Handler-side tests added: TestPatchOnboarding_RuntimeChoiceSwitch (the explicit cross-request switch path that the original COALESCE design would have 500'd on) + TestPatchOnboarding_PreserveUntouched. Old handler-local file no_runtime_issue.go is deleted; its content moved to service/workspace_content.go with the helpers exported. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * feat(core): API + types for persisted onboarding runtime choice User type / Zod schema gain onboarding_runtime_id (string \| null) and onboarding_runtime_skipped (boolean); EMPTY_USER + test fixture updated to match. api.patchOnboarding accepts the new optional fields and the new api.ensureOnboardingContent endpoint is wired so the workspace shell can request the fallback seed. Two new store helpers — recordOnboardingRuntimeChoice(runtimeId) and recordOnboardingRuntimeSkipped() — replace the prior pattern of Step 3 calling bootstrap directly. They PATCH the user's choice, sync the auth store, and return. Mutually exclusive on the server side via the CHECK constraint; the client just ships one intent at a time. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * feat(workspace): WorkspaceOnboardingInit single decision point + dumb Modal Replaces OnboardingHelperModal's self-gating render path with a 4-branch dispatcher that runs once on workspace-shell mount: branch 0 me.onboarded_at != null → ensure install-runtime issue fallback, render nothing branch 1 me.onboarding_runtime_skipped → SkipBootstrapping component: loading veil → bootstrap → navigate. On failure shows a Retry UI instead of silently freezing the veil branch 2 me.onboarding_runtime_id → render Modal with the runtime id from `me` (no internal list query) branch 3 (none of the above) → useEffect navigate back to /onboarding so the user walks Step 3 again The Modal itself is now a dumb component — receives `workspace` and `runtimeId` as props, no internal gates, no runtimeListOptions query. Tests rewritten to cover the props-driven render + pick-card paths; the prior gating tests move into the new workspace-onboarding-init.test.tsx alongside the M2 retry-on-failure behaviour. Mounted in both apps/web/app/[workspaceSlug]/layout.tsx and the desktop workspace-route-layout. Desktop keeps its `!overlayActive` suppression guard so the init doesn't portal-jump in front of an active WindowOverlay. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * feat(onboarding): Step 3 records user choice instead of calling bootstrap handleRuntimeNext now PATCHes the user's pick (recordOnboardingRuntime {Choice,Skipped}) and navigates straight into the workspace shell. The workspace-entry WorkspaceOnboardingInit reads the persisted choice off `me` and runs the appropriate branch — Step 3 is pure intent capture with zero side effects on its own. PATCH must succeed before navigation: if it fails the user stays on Step 3 with a toast, because navigating with no persisted intent would land them in WorkspaceOnboardingInit's branch 3 "no decision yet" rescue and trigger a redirect loop back to /onboarding. The prior asymmetry (Connect deferred bootstrap to the workspace, Skip ran bootstrap inline) is gone — both paths defer to the workspace shell now. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * feat(onboarding): v3 — thin server, frontend-orchestrated welcome Collapse v2's persisted runtime-choice fields + 4-branch dispatcher + OnboardingService/WorkspaceContentService stack down to a single rule: `onboarded_at` is the only state field, layout hard-gates on it, and the welcome experience after Step 3 is owned entirely by the frontend. V3 flow - Step 3 button: await POST /api/me/onboarding/complete (mark only) + park a transient signal in `useWelcomeStore` + navigate - Workspace layout: hard gate `onboarded_at == null` -> /onboarding - `<WelcomeAfterOnboarding />` reads the welcome-store signal: - runtime path: find-or-create Multica Helper via generic createAgent with bilingual instructions from `templates/helper-instructions.ts`, blocking modal with 3 starter cards, pick -> createIssue + navigate - skip path: provision install-runtime (in_progress) -> agent-guide (todo, body embeds install-runtime mention chip) -> follow-up comment on install-runtime mentioning agent-guide; then pop celebration modal with 🎉 emoji pop animation, 2 read-only preview cards, single [Got it] CTA that navigates to install-runtime Server cleanup - Drop OnboardingService, WorkspaceContentService, v2 runtime-choice columns/CHECK on user, EnsureOnboardingContent endpoint - CompleteOnboarding/AcceptInvitation call qtx.MarkUserOnboarded directly (no service indirection) - BootstrapOnboardingRuntime / BootstrapOnboardingNoRuntime kept as a deprecation shim in onboarding_shim.go for desktop < v3 during the rollout window — handlers inlined to qtx.* calls, no service layer Localization - Persisted strings (issue titles/bodies, Helper instructions/ description, comment prefix) live as TS const `{en, zh}` maps in `packages/views/onboarding/templates/` — i18n bundle staleness can no longer write raw key paths into DB - UI-rendered strings (modal copy, status chips, buttons) stay in `packages/views/locales/{en,zh-Hans}/onboarding.json` - Language picked from live `i18n.language` (not `me.language`, which is null for new users until they pick a preference) Race protection - Module-level promise dedupe (`findOrCreateHelper`, `seedIssueDeduped`, `postCommentDeduped`) so React StrictMode double-mount can't fire two parallel API calls that the server would then 409 Cross-references between the two skip-path issues render via Multica's mention-chip protocol `[<identifier>](mention://issue/<uuid>)` so they match the styled IssueChip pills used elsewhere. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * feat(onboarding): welcome-after-onboarding modal redesign + cross-user safety Welcome modal polish (the post-Step-3 surface this branch already introduced): Runtime path - Helper avatar replaces the bouncy 🎉 hero; tone-down animation to fade. New copy: "Hi, welcome to Multica / I'm your first Agent assistant" + capability hint sentence so users discover assignment + chat from the first screen. - Cards changed from "click = submit" to multi-select with the existing border-primary + ring selection pattern used by compact-runtime-row; bottom CTA "Assign N tasks to me →" appears only with N>0. - New starter cards: intro / tour / welcome_page (the last one tells Helper to paste an HTML welcome page into the issue comment — works on any runtime regardless of fs access). - Success state added between createIssue and navigation: 🎉 + "All set!" + "Sit tight ☕ — your {agentName} is on it" + inbox/chat hints, single [Got it] button. - Title/prompt for starter cards now live in TS const HELPER_STARTER_PROMPTS (persisted to DB — must not depend on i18n bundle being loaded); subtitle stays in onboarding.json. Skip path - Body restructured into three independent ```md blocks (Name / Description / Instructions) so each picks up the markdown renderer's per-block copy button — no manual extraction. - ZH body now embeds the ZH Helper Description + Instructions (was Chinese-around-English-block). - Follow-up comment uses Multica's mention-chip protocol [identifier](mention://issue/uuid) so it renders as the styled IssueChip pill. - Issue titles bilingual with "Step 1 / Step 2" prefix. Cross-user / cross-workspace safety (code review feedback) - web onLogout + desktop handleDaemonLogout now call useWelcomeStore.reset() so user B logging into the same browser doesn't inherit user A's signal. - WelcomeAfterOnboarding gates on currentWorkspace.id === signal.workspaceId — prevents firing the modal in workspace B when the signal was parked for workspace A (desktop multi-tab, back/forward, deep-link). - Module-level promise dedupes (pendingHelperSetup, pendingIssueSeed, pendingCommentSeed) for the three API calls so React 18+ StrictMode dev double-mount can't race-create duplicates. Other small fixes carried in this commit - Helper instructions / agent description / starter card titles all read i18n.language (not me.language, which is null for new users who haven't picked a UI language preference yet). - Reverted welcome-emoji-pop animation to a small fade for the runtime avatar (kept the bouncy variant for the skip 🎉 hero where the celebration is the whole point). - Removed the duplicate 🎉 from the skip modal title (kept the hero one only). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix(views): i18n hardcoded "Close" in welcome FullScreenError CI lint (i18next/no-literal-string) blocked on a literal "Close" string inside `FullScreenError` — surfaced as a nit in the original code review but missed in the merge. Add `error_close` to onboarding.json (EN: "Close" / ZH: "关闭") and thread it through as a `closeLabel` prop, matching the existing `retryLabel` plumbing. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-21 19:00:26 +08:00
Bohan Jiang	ae530ef057	docs(runtime): tighten issue-metadata write bar (MUL-2507) (#3004 ) The previous wording invited agents to pin too much: any opened PR, external link, or "fact future agents will want one-glance access to" was framed as worth writing, with no explicit upper bound. In practice this caused metadata bags to accumulate single-run details and description-summary noise instead of the small set of repeatedly-read values the feature was designed for. Rework the agent runtime brief and the CLI docs to lead with the bar: write a key only when it is materially important AND likely to be re-read by future runs on the same issue. "Most runs write zero new keys" is now stated as the expected case, and the workflow exit step is rewritten to mirror the same gate. Recommended-key list, safety boundaries, and stale-key cleanup are preserved so the locked-in test anchors still pass. Co-authored-by: multica-agent <github@multica.ai>	2026-05-21 17:20:43 +08:00
LinYushen	e288eff2c5	feat: server auto-generates PAT for cloud runtime bootstrap (#3002 ) When bootstrap is enabled and no PAT is available from the request header or Authorization bearer token, the server now generates a new PAT automatically and forwards it to the cloud service. This removes the need for the frontend to pass X-User-PAT — the server handles it entirely.	2026-05-21 17:07:44 +08:00
YOMXXX	29c2a5d18f	fix(daemon): reclaim stale dispatched claims (MUL-2485) (#2872 ) * fix(daemon): reclaim stale dispatched claims * fix(daemon): widen stale claim reclaim window	2026-05-21 17:06:55 +08:00
Bohan Jiang	0c767c0052	feat(issues): per-issue metadata KV (MUL-2017) (#2845 ) * feat(issues): per-issue metadata KV (MUL-2017) Adds a small JSONB KV map to every issue for agent pipeline state (attempts, PR number, pipeline status, ...). Keys match a narrow regex, values are primitives (string / number / bool), capped at 50 keys per issue and 8KB per blob. Defense-in-depth via two CHECK constraints (object shape + size). All mutations are single-key atomic (jsonb_set / `- key`). `UpdateIssue` intentionally does NOT touch metadata: a whole-blob overwrite would race with concurrent agent writes. GET /api/issues/:id/metadata PUT /api/issues/:id/metadata/:key body: { "value": <primitive> } DELETE /api/issues/:id/metadata/:key Containment filter on list: GET /api/issues?metadata=<json-object> uses PG `@>` against a `jsonb_path_ops` GIN index. Mirrored across ListIssues, CountIssues, ListOpenIssues, and the hand-rolled ListGroupedIssues SQL so CLI/API and UI grouped views stay consistent. CLI: multica issue metadata {list,get,set,delete} multica issue list --metadata key=value (repeatable, AND) set has --type to override the default value-sniffing Co-authored-by: multica-agent <github@multica.ai> * fix(issues): metadata test bugs + wire realtime + read-only display (MUL-2017) - Fix two failing handler tests blocking backend CI: - reset decode target after delete so map merge does not mask removal - url.PathEscape the key segment so spaces no longer panic NewRequest - Wire issue_metadata:changed end to end so the detail / list / my-issues caches stay in sync with set/delete events (other tabs, CLI writes). - Add a read-only Metadata strip to the issue detail sidebar; hidden when the issue has no keys so it stays quiet in the common case. Co-authored-by: multica-agent <github@multica.ai> * feat(runtime): teach agents to read/write issue metadata (MUL-2017) Add an `## Issue Metadata` section to the runtime brief plus a `metadata list` step on entry and a `metadata set`/`delete` step on exit. Section only emits when the task carries an issue id (comment- or assignment-triggered); chat / quick-create / run-only autopilot stay clean so they don't fire failing CLI calls. Co-authored-by: multica-agent <github@multica.ai> * fix(issues): bump metadata migration to 105 and drop attempts as example (MUL-2017) main is now at 104_drop_runtime_timezone; the migrator picks LatestVersion() by sorted filename, so a slot before the tail would let DBs that have already run 099–104 think they're up-to-date while the issue.metadata column is missing — runtime would then fail with column does not exist. Renumbering to 105 puts the migration at the tail and forces it to run. Also drop attempts as a positive example across docs/code comments and test fixtures — the runtime instruction prompt already lists it under "What NOT to pin" (runtime bookkeeping). Replace with pr_number, which is in the recommended-keys set, so docs/tests speak the same language as the prompt. Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: multica-agent <github@multica.ai>	2026-05-21 16:35:45 +08:00
Bohan Jiang	9a5d8a52f3	fix(timezone): harden hourly-rollup rollout against straight-through migrate MUL-2488 (#2998 ) * fix(timezone): harden hourly-rollup rollout against straight-through migrate MUL-2488 PR #2968 introduced the new task_usage_hourly rollup but assumed operators would stop migrate between 102 and 103 to run the one-shot cmd/backfill_task_usage_hourly. Two pieces made that unsafe in practice: 1. The Dockerfile only shipped server / multica / migrate, so a deployed container has no backfill binary to run between phases. 2. cmd/migrate has no per-version stop, and entrypoint.sh runs `migrate up` to the latest version, so 103 silently drops the legacy daily rollups even when nobody ran the backfill — leaving usage dashboards at zero despite source data being intact in task_usage. Changes: - Build cmd/backfill_task_usage_hourly into the runtime image alongside the other binaries so operators can `docker exec` the backfill instead of needing a source checkout. - Add a fail-closed plpgsql guard at the top of migration 103 that aborts the migration when task_usage has rows but task_usage_hourly is empty. Fresh databases (no task_usage rows) are exempt because the new triggers from 102 will populate the hourly table on the first event. Already-applied databases are unaffected — schema_migrations tracks by version only, so 103 is not re-run. Co-authored-by: multica-agent <github@multica.ai> * fix(timezone): use watermark coverage for hourly-rollup guard The previous check only required `task_usage_hourly` to be non-empty, which an interrupted backfill or a manual `rollup_task_usage_hourly_window` call both satisfy. The completion signal we actually trust is `task_usage_hourly_rollup_state.watermark_at` — backfill only stamps it to `now() - 5 min` after every monthly slice succeeded, and the cron worker only advances it on a real tick. Default after migration 101 is `1970-01-01`, so an unrun or partial backfill is trivially detected. Also corrects the comment about fresh-install behavior: the triggers in 102 only enqueue dirty keys for agent_task_queue / issue / task_usage DELETE — they do not write hourly rows. INSERT/UPDATE flows through the `updated_at` watermark window of `rollup_task_usage_hourly()`, which only runs once the operator registers it as a pg_cron job. Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: multica-agent <github@multica.ai>	2026-05-21 16:26:42 +08:00
YYClaw	614dfae884	MUL-2488 feat(timezone): Scheduling / Viewing two-layer timezone architecture (#2968 ) * docs(timezone): add scheduling/viewing timezone architecture RFC * feat(db): replace daily rollups with task_usage_hourly, add user.timezone Migrations 100-104: add "user".timezone (Viewing tz), build the UTC hourly task_usage_hourly rollup with its pipeline, drop the legacy task_usage_daily / task_usage_dashboard_daily pipelines, and drop the agent_runtime.timezone column. Report queries now slice day boundaries at read time by the caller-supplied @tz instead of materialising in a fixed tz. Regenerate sqlc. * feat(server): add task_usage_hourly backfill command Replace the two legacy backfill commands (daily / dashboard_daily) with a single backfill_task_usage_hourly that loads historical task_usage into the new UTC hourly rollup, sliced per workspace. * refactor(server): resolve viewing timezone in report handlers Report handlers resolve the Viewing tz per request (?tz query param, then user.timezone, then UTC) and pass it to the hourly-rollup queries. Drop the UseDailyRollup feature flags and the old raw-scan/daily-rollup dual paths, remove the /api/usage endpoints, and stop the daemon from reporting and the runtime handler from accepting host timezone. * refactor(core): switch report queries to viewing timezone API client and dashboard/runtime queries send ?tz with each report request, the user schema/types carry the new timezone field, and the runtime timezone field/mutation is removed. * feat(views): add viewing timezone preference and UI Add the useViewingTimezone hook and a Timezone setting in Preferences; report charts and the dashboard week boundary follow the viewer tz. Remove the runtime detail timezone editor and its locale strings. * fix(test): update fixtures and stabilize tests for timezone refactor The timezone architecture refactor changed several types without updating dependent test code: - RuntimeDevice no longer has a timezone field — drop it from the create-agent-dialog runtime fixture. - User now requires a timezone field — add it to the apps/web mockUser fixture. - The PreferencesTab timezone tests asserted on the async save handler (PATCH then store update) with a bare expect, racing the mutation's settle callback, and timed out querying the Select's ~600-option IANA list on a loaded CI runner. Wrap the assertions in waitFor and extend the timeout for those three tests. * docs(timezone): document self-host migration order and trigger invariant Add a SELF-HOST UPGRADE ORDER runbook to the backfill command's package comment: applying migrations 100-104 in a single migrate-up drops the legacy daily rollups before the hourly backfill runs, leaving dashboards empty until cron catches up. Add an INVARIANT comment on trg_atq_dirty_hourly noting that agent_id must be added to the trigger's OF list if it ever becomes mutable, otherwise dirty buckets for the old agent_id are silently missed. * style(runtimes): drop trailing blank line in runtime-detail	2026-05-21 15:33:47 +08:00
Multica Eve	41cb91abd9	feat: add cloud runtime fleet proxy API (MUL-2453) (#2986 ) * feat: add cloud runtime fleet proxy API Co-authored-by: multica-agent <github@multica.ai> * test: cover cloud runtime handler nits Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: Eve <eve@multica-ai.local> Co-authored-by: multica-agent <github@multica.ai>	2026-05-21 15:06:10 +08:00
Bohan Jiang	7f9e4e829d	feat(comments): thread-internal --tail pagination + reply cursor (MUL-2421) (#2846 ) * feat(comments): thread-internal pagination via --tail + reply cursor (MUL-2421) Long threads inside a single issue still forced agents to read every reply once they used --thread, even after MUL-2387 fixed cross-thread noise. This adds reply-level paging so a 200-reply thread can be navigated tail-first without dragging the whole conversation into prompt context. - New SQL query ListThreadCommentsForIssuePaged: same recursive root walk as the legacy thread query, but caps reply count and supports an (created_at, id) composite cursor. Root is unconditional — even tail=0 emits it so the reader keeps the "what is this thread about" context. - Handler ListComments: parses `tail` (non-negative, ThreadTailSet flag preserves the tail=0 intent), threads it through to the paged query, and re-uses X-Multica-Next-Before / X-Multica-Next-Before-Id for the reply cursor. Cursor's meaning is now context-dependent: thread cursor under --recent, reply cursor under --thread + --tail. - CLI: new --tail flag (only valid with --thread; mutually exclusive with --recent), reply-cursor semantics for --before / --before-id when paired with --thread + --tail, stderr label flips to "Next reply cursor" so an operator copy-pasting the cursor knows which scope it scrolls. - Tests cover the new contract: tail=N keeps newest N + root, tail=0 is root-only, anchor on a nested reply still walks up, reply cursor scrolls older replies page-by-page, since combined with tail filters after the cut, and the negative-flag-combination matrix. Out of scope: prompt template update to hint at `--thread <id> --tail 30` on long threads — separate follow-up per the issue. Co-authored-by: multica-agent <github@multica.ai> * fix(comments): only emit reply cursor when older reply exists (MUL-2421) The thread-tail path emitted `X-Multica-Next-Before` whenever the page filled to exactly the requested reply count, even when there was nothing older to scroll to. So `--thread <root> --tail 3` on a thread with exactly 3 replies sent a cursor that, when followed, returned just the root — a wasted round-trip that surfaced as a phantom "older replies" affordance in the agent prompt. Switch to a `reply_limit + 1` probe: ask the SQL for one extra row, trim the oldest overflow before responding, and only emit the cursor when an older reply actually existed. The exact-boundary case (replyCount == tail with no overflow) now returns no cursor. Also documents `--thread/--tail/--recent/--before` and the cursor semantics in CLI_AND_DAEMON.md, which was the second must-fix in the MUL-2421 review. Co-authored-by: multica-agent <github@multica.ai> * fix(comments): suppress reply cursor when --since covers older replies (MUL-2421) In the thread + tail + since path the server still emitted a reply cursor whenever there was an older reply on disk, regardless of `since`. If the oldest retained reply on the page was already `<= since`, every older reply was guaranteed to be filtered out too, so the next page only ever returned the root — wasting round-trips until the agent walked the whole pre-`since` history. Mirror the recent + since suppression: when `replies[0].CreatedAt <= since`, drop the cursor. Test covers the exact case from Elon's review: tail=2 overflow, body keeps a fresher reply, but the cursor target (oldest retained reply) is already past `since` — header must be empty. Co-authored-by: multica-agent <github@multica.ai> * feat(prompt): default comment-trigger reads to --thread --tail 30 (MUL-2421) Comment-triggered agents previously defaulted the trigger-thread read to the unbounded `--thread <id> --output json`, which dumps the full thread into the prompt — exactly the kind of context bloat MUL-2387 fixed at the cross-thread layer but never bounded inside a single thread. Use the new `--tail` flag landed earlier in this PR (server + CLI) as the default for both the per-turn prompt and the runtime-config Workflow: - `--thread <trigger-id> --tail 30 --output json` is the new default. Root is always included so "what is this about" context survives. - If 30 replies aren't enough, the prompt now spells out the reply cursor: re-feed the stderr `Next reply cursor: --before <ts> --before-id <reply-id>` pair back to walk older replies. - `--recent 20` stays as the cross-thread background fallback, with an explicit callout that the same `--before` / `--before-id` flags walk threads (not replies) in that mode. - Available Commands core line now surfaces `--tail N` and both stderr cursor labels so non-workflow callers also discover the flag. - `--since` callouts reflect the post-MUL-2421 combinable mode names (`--thread --tail` / `--recent`). Tests (`prompt_test.go`, `execenv_test.go`) pin the new defaults and add a regression guard against the unbounded `--thread` recipe sneaking back in. Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: multica-agent <github@multica.ai>	2026-05-21 13:43:15 +08:00
YOMXXX	83e90c9530	fix(ws): log auth frame write failures (#2946 )	2026-05-21 13:33:12 +08:00
Bohan Jiang	ef6a944063	fix(cli): accept slug + short UUID prefix in workspace get/update/member (#2972 ) * fix(cli): accept slug + short UUID prefix in workspace get/update/member (MUL-2385) `workspace list` shows the 8-char short UUID prefix, name, and slug by default; `workspace get`/`update`/`member list` only accepted full UUIDs. That broke the natural list -> get flow: every value the user could copy from list output was rejected. They had to either rerun list with `--full-id` or parse the JSON output -- both implementation-detail level operations. Extend `resolveWorkspaceByIDOrSlug` with a short UUID prefix fallback (>=4 hex chars, ambiguous matches return all candidates), introduce `resolveWorkspaceRef`/`resolveWorkspaceArg` helpers that fetch the caller's accessible workspaces and resolve UUID/slug/prefix in one call, and wire them into get/update/member list (switch already used the same list-then-resolve pattern). Full UUIDs short-circuit the extra `/api/workspaces` round trip; access control remains on the downstream endpoint. Also add a one-line tip after `workspace list` table output pointing users at get/update/switch with the same identifier columns, and broaden the command Use strings to `<id\|slug\|prefix>` so help reflects the new behavior. Refs https://github.com/multica-ai/multica/issues/2750 Co-authored-by: multica-agent <github@multica.ai> * chore(cli): include prefix hint in workspace list footer Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: multica-agent <github@multica.ai>	2026-05-21 13:08:44 +08:00
YOMXXX	ed2957ddf8	fix(claude): record result model usage (#2899 )	2026-05-21 13:00:12 +08:00
iYuan	2f1f90c11a	fix(agent): retry codex semantic inactivity fresh (#2593 )	2026-05-20 20:03:39 +08:00
Bohan Jiang	8d4f4caf4a	MUL-2338 fix(comments): allow agent self-mention to enqueue cross-issue handoff (#2928 ) * fix(comments): allow agent self-mention to enqueue cross-issue handoff The @mention path in CreateComment unconditionally skipped any self-mention. That dropped the child→parent handoff between issues assigned to the same agent: the child run posted `@J` on the parent issue, the guard tripped, and the parent's J was never woken — the chain silently broke. Drop the self-trigger `continue` in the agent mention branch. Runtime ready / private-agent gate / HasPendingTaskForIssueAndAgent dedup all remain, so a same-issue self-mention while a queued or dispatched task exists is still deduped; a running task no longer pre-empts a new follow-up (the existing queue coalescing handles that). Three regression tests: - cross-issue self-mention enqueues a task on the target issue - same-issue self-mention while running queues a follow-up - same-issue self-mention with a pre-existing queued/dispatched task is deduped MUL-2338 Co-authored-by: multica-agent <github@multica.ai> * test(handler): assign per-workspace issue number in self-mention fixture The fixture inserts two issues in the same test workspace; without an explicit number both default to 0 and the second insert violates uq_issue_workspace_number, taking the backend CI job down on PR #2928. Mirror the workspace-counter advancement pattern from issue_scheduled_test.go so each fixture issue gets a unique number. Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: multica-agent <github@multica.ai>	2026-05-20 17:18:41 +08:00
YOMXXX	34f16e2c7a	fix(opencode): deny interactive questions in daemon mode (#2878 ) * fix(opencode): deny interactive questions in daemon mode * fix(opencode): avoid permission env ordering bypass	2026-05-20 17:17:31 +08:00
Bohan Jiang	aeb284cbeb	feat(runtime): teach agents the parent/sub-issue protocol (MUL-2338) (#2918 ) * feat(runtime): teach agents the parent/sub-issue protocol (MUL-2338) Adds a Parent / Sub-issue Protocol section to the runtime brief built by `buildMetaSkillContent`, emitted whenever the agent is running on a real Multica issue (assignment- or comment-triggered). Two behaviors are now documented for every issue-bound agent: - A. When wrapping up a child issue, post the final result and switch to `in_review` on this issue first, then post a single top-level comment on the parent. Mention the parent assignee only when it is another agent on a still-open parent — never self-mention, never @ member / squad, never re-trigger a `done` / `cancelled` parent. - B. When creating sub-issues, choose `--status backlog` for sub-issues that must wait and `--status todo` for the one to start immediately; promote with `multica issue status <id> todo` when its turn comes. The signal is explicitly framed as best-effort — no server-side state sync, no claim of a guaranteed handshake. The section is skipped for chat, quick-create, and run-only autopilot runs, which have no parent/child semantics. Tests in runtime_config_test.go assert that the section is present in both issue workflows, absent in the three non-issue modes, and that the wording does not introduce a non-existent `multica issue list --parent` command or promise a reliable handshake. Co-authored-by: multica-agent <github@multica.ai> * fix(runtime): split Step A of parent/sub-issue protocol by trigger type (MUL-2338) Comment-triggered runs were inheriting an unconditional `multica issue status <this-issue-id> in_review` from Step A, which conflicts with the comment-triggered workflow rule "Do NOT change the issue status unless the comment explicitly asks for it" (Elon's blocking review on PR #2918). Step A now branches on trigger type: - Assignment-triggered: keep "post final results + flip in_review". - Comment-triggered: complete the reply per the existing workflow rule, only flip status when the triggering comment asked for it, and gate the parent-notification steps on actually closing out child work. Tests lock the boundary: comment-triggered briefs must not contain the unconditional in_review command, must echo the existing status guardrail inside Step A, and must spell out the "closing out" gate. Assignment-triggered briefs still carry the unconditional flip. Co-authored-by: multica-agent <github@multica.ai> * fix(runtime): simplify parent/sub-issue mention rule to always @ parent assignee (MUL-2338) Per Bohan's directive on PR #2918: the per-case mention table (same agent / member / squad / closed parent) is overkill prompt complexity. Replace it with a single rule: always @mention the parent's assignee using the URL that matches assignee_type. The platform's existing run dedup handles re-triggers, and a single rule is easier for agents to follow predictably. Preserves the existing comment-triggered boundary (Step A still does NOT add an unconditional in_review flip on comment-triggered runs). Co-authored-by: multica-agent <github@multica.ai> * refactor(runtime): compress parent/sub-issue protocol to 3-rule convention (MUL-2338) Drop the spec-flavored A/B sub-headings and per-case mention table; keep three numbered rules (close out child, notify parent, pick backlog vs todo) plus a one-line best-effort preamble. The comment-triggered branch still re-asserts the "do not change status unless asked" guardrail and gates parent notification on actually closing out child work; the assignment-triggered branch still flips to `in_review`. Section is now 7 lines instead of 29. A new TestParentSubIssueProtocolIsCompact guards the ≤10-line ceiling so this stays a convention, not a spec. Co-authored-by: multica-agent <github@multica.ai> * fix(runtime): make sub-issue creation rule unconditional in parent/sub-issue protocol (MUL-2338) Elon's review on PR #2918: the preamble previously gated all three rules on the current issue having `parent_issue_id`, but rule 3 (creating sub-issues) needs to reach top-level parents that have no parent themselves — that is exactly where the `todo` vs `backlog` decision matters most. Move the gate from the preamble onto rules 1 and 2 per-rule; rule 3 now applies to any issue-bound run. Section stays at 7 newlines (≤10). Co-authored-by: multica-agent <github@multica.ai> * refactor(runtime): unify parent/sub-issue protocol as mechanism description (MUL-2338) Drop the if/else split between assignment- and comment-triggered runs in the Parent / Sub-issue Protocol section: both runs now read the same two-rule description of how the parent/child mechanism works. The comment-triggered workflow rule "Do NOT change the issue status unless the comment explicitly asks for it" naturally short-circuits the parent notification (no status flip → not closing out the child → skip), so the protocol no longer needs to branch on TriggerCommentID. Tests collapse the two trigger-specific cases into one parameterized test, and the assignment vs comment status-flip invariants are now anchored on the real workflow command (with substituted issue id) instead of the protocol's removed `<this-issue-id>` placeholder. Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: multica-agent <github@multica.ai>	2026-05-20 16:20:33 +08:00
Angular	1f978bf1ec	feat(autopilot): link created issues to projects (#2908 ) * feat(autopilot): link created issues to projects * test(autopilot): cover project flag	2026-05-20 15:37:23 +08:00
Bohan Jiang	b7082a01f1	fix(issues): retry button targets the row's agent (MUL-2457) (#2921 ) * fix(issues): retry button targets the row's agent, not the assignee (MUL-2457) The execution log retry button used to re-fire the issue's current assignee instead of the agent that actually ran the clicked row. After a reassignment, or for squad workers / @-mention agents, the rerun landed on the wrong agent. POST /api/issues/{id}/rerun now accepts an optional task_id: when set, the rerun targets that task's agent (and reuses its leader/worker role). An empty body keeps the assignee-driven CLI/API contract. The execution-log retry button passes task.id, so per-row retry always fires the correct agent. enqueueMentionTask gained a forceFreshSession parameter so the new mention-path rerun keeps the same fresh-session contract as the assignee path. Co-authored-by: multica-agent <github@multica.ai> * fix(issues): inherit trigger provenance + fix cross-issue test (MUL-2457) Address review feedback on PR #2921: 1. RerunIssue now inherits TriggerCommentID from the source task when sourceTaskID is valid. Without this, a per-row rerun of a comment- or mention-triggered task degrades into a generic issue run because the daemon's buildCommentPrompt path keys on TriggerCommentID. The inherited summary is rebuilt naturally inside the enqueue helpers (buildCommentTriggerSummary derives it from the comment ID). 2. The new cross-issue rejection test inserted a second issue without `number`, hitting uq_issue_workspace_number on a same-workspace collision with the fixture's issue. Both inserts now claim the next available per-workspace number (MAX(number)+1) — matching the pattern used by notification_listeners_test. Added TestRerunIssueInheritsTriggerCommentFromSourceTask to lock the trigger provenance contract. Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: multica-agent <github@multica.ai>	2026-05-20 15:30:03 +08:00
Angular	314e91fa6d	fix(chat): guard optimistic task message ids (#2901 )	2026-05-20 15:18:42 +08:00
Bohan Jiang	2bec2221d2	feat(agent): per-agent thinking_level for claude + codex (MUL-2339) (#2865 ) * feat(agent): persist thinking_level per agent (MUL-2339) Adds a nullable `thinking_level` column to the `agent` table so the backend can route a runtime-native reasoning/effort token (e.g. Claude's `xhigh`, Codex's `minimal`) through to the agent CLI on every dispatch. The column is intentionally TEXT rather than an enum — Claude and Codex publish overlapping but distinct vocabularies and we want the persisted value to round-trip exactly through whichever CLI receives it. NULL is the "use runtime default" sentinel that every downstream consumer reads as "do not inject --effort / reasoning_effort". This commit is just the storage layer (migration + sqlc); subsequent commits wire it through the API, daemon, and agent backends. Co-authored-by: multica-agent <github@multica.ai> * feat(agent-backend): inject reasoning effort for claude + codex (MUL-2339) Extends ExecOptions with a runtime-native ThinkingLevel string and wires it into the Claude and Codex backends. Discovery is driven by the local CLI so the daemon advertises whatever the host install supports rather than a hand-maintained list that goes stale. Per Elon's PR1 review: - Claude: parses `claude --help` to learn the `--effort` superset and projects through a per-model allow-list (xhigh is Opus-only; max is session-only on the smaller models). Falls back to a conservative static list when the binary is missing or help drift hides the line. - Codex: drives `codex debug models --output json` so per-model reasoning subsets and the documented default come directly from the CLI. The older config-error probe trick is gone — the JSON path is stable and doesn't pollute stderr with an intentional misconfig. - Cache key includes (provider, executablePath, cliVersion) so a CLI upgrade invalidates entries that referenced the older help / catalog. Per Trump's PR1 constraint, all three Codex injection points (thread/start.config, thread/resume.config, turn/start.effort) flow through one helper (`applyCodexReasoningEffort`) so they cannot drift independently. The shared `codexReasoningCases` fixture in `thinking_test.go` asserts the same value→{shape, key} contract at each site for every level the runtimes know about. Claude's `--effort` is also added to `claudeBlockedArgs` so a user custom_args entry can't silently outvote the daemon-injected value. Co-authored-by: multica-agent <github@multica.ai> * feat(api): wire thinking_level through API + daemon contract (MUL-2339) End-to-end plumbing for the per-agent reasoning/effort setting: - AgentResponse / TaskAgentData now carry `thinking_level`; the daemon's claim response includes it and the daemon's executor passes it through to agent.ExecOptions, where the Claude and Codex backends already know what to do with it. - ModelEntry on the runtime-models wire format gains a `thinking` block carrying `supported_levels` + `default_level` per model so the UI can render a runtime-aware picker without the server having to know about the local CLI install. `handleModelList` projects the agent-package catalog (including the new Thinking field) into the wire shape. - CreateAgent / UpdateAgent gate the field with a synchronous provider enum check (claude / codex only today). UpdateAgent is tri-state: field omitted = no change, "" = explicit clear (new `ClearAgentThinkingLevel` query, mirrors the existing mcp_config null pattern), non-empty = validate then set. Per Trump's PR1 review, the API NEVER auto-clears on a runtime/model swap and ALWAYS returns 400 on an unknown literal value — same shape across CreateAgent, UpdateAgent, and combined patches that move runtime + level in one request. Per-model combination failures (e.g. `xhigh` against a model that only supports up to `high`) surface as a daemon-side task error, not a silent server-side rewrite. TS types follow the same shape: `Agent.thinking_level`, `CreateAgentRequest`/`UpdateAgentRequest` add the field, `RuntimeModel` grows a `thinking` block. Older backends omit the field, which the front-end treats as "no picker for this model" — installed desktop builds keep working. Co-authored-by: multica-agent <github@multica.ai> * fix(agent): correct codex debug models argv + pin via runner test (MUL-2339) `codex debug models --output json` is rejected by codex-cli 0.131.0 — the subcommand emits JSON on stdout by default and has no `--output` flag. Drop the flag and add `--bundled` to skip the network refresh discovery doesn't need. Move the argv to a package-level var and add a test that runs a fake `codex` to assert the binary actually receives exactly `debug models --bundled`, so the contract can't silently drift on the next refactor. Also teach ValidateThinkingLevel to resolve an empty model to the provider's default model entry. Without this, every default-model task with a persisted thinking_level would be misjudged "unknown model" by the daemon guard. Co-authored-by: multica-agent <github@multica.ai> * fix(api): reject runtime switch that would leave invalid thinking_level (MUL-2339) A PATCH that changed `runtime_id` without touching `thinking_level` used to silently keep the existing value, so a Claude agent storing `max` could land on a Codex runtime where `max` is not a recognised token at all, and the daemon would receive a literal-invalid level. Hold the same "always 400 on literal-invalid, never silent coerce" rule on this implicit path. When runtime_id changes and the existing value is not in the new provider's enum, return 400 with the recovery options (clear via `thinking_level=""` or re-set in the same PATCH). Add coverage for both the kept-when-still-valid and the rejected cases, plus the two recovery paths (clear and replace). Co-authored-by: multica-agent <github@multica.ai> * fix(daemon): guard runTask with per-model thinking_level validator (MUL-2339) ValidateThinkingLevel existed but had no call site — `task.Agent. ThinkingLevel` flowed straight into ExecOptions, so `xhigh` configured on a non-Opus Claude model, or API-side stale values that escaped the provider enum gate, would be injected anyway. Run the validator before building ExecOptions. Invalid combinations log a warning and drop the level instead of failing the task: the agent still runs, just at the runtime's default reasoning effort. Discovery errors fail open (keep the level, let the CLI surface any objection) so a transient `claude --help` failure can't strand work. Empty model is forwarded as-is; the validator resolves it to the provider's default model internally per the cross-package contract. Co-authored-by: multica-agent <github@multica.ai> * chore(agent): drop stale `--output json` comments + unused scanner (MUL-2339) Codex CLI's `debug models` subcommand emits JSON without an `--output` flag, and `parseCodexDebugModels` never read from the bufio.Scanner. Sync the comments with the actual invocation and remove the dead init. Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: multica-agent <github@multica.ai>	2026-05-20 12:30:10 +08:00
Jiayuan Zhang	fc8528d64d	feat(autopilot): support assigning to a squad (MUL-2429) (#2888 ) * feat(autopilot): support assigning autopilot to a squad (MUL-2429) Path A (Squad-as-Leader) from the RFC: when an autopilot's assignee is a squad, dispatch resolves to squad.leader_id and executes against the leader's runtime — semantics match a human manually assigning the issue to that squad, no fan-out. Backend scope only; frontend picker change is a follow-up PR. Changes: - 096_autopilot_squad_assignee migration: drop agent FK on autopilot.assignee_id, add assignee_type column (default 'agent'), add autopilot_run.squad_id attribution column. - service.AgentReadiness: single source of truth for archived / runtime-bound / runtime-online checks. Shared by autopilot admission gate, run_only dispatch, and isSquadLeaderReady. - service.resolveAutopilotLeader: translates assignee_type/id to the agent that actually runs the work. - dispatchCreateIssue: stamps issue with assignee_type='squad' for squad autopilots and enqueues via EnqueueTaskForSquadLeader. - dispatchRunOnly: belt-and-braces readiness re-check after resolving squad → leader so a leader that went offline between admission and dispatch produces a clean failure instead of a doomed task. - handler.CreateAutopilot / UpdateAutopilot: accept assignee_type with squad/agent existence + leader-archived validation. Backward-compatible default of "agent" preserves the contract for older clients. - Analytics: AutopilotRunStarted/Completed/Failed events carry assignee_type and squad_id; PostHog can now group autopilot runs by squad without joining back to the autopilot row. Co-authored-by: multica-agent <github@multica.ai> * fix(autopilot): reject archived squads, route post-admission skips, cleanup dangling-agent autopilots (MUL-2429) Addresses three review findings on PR #2888: 1. Archived squad handling: validateAutopilotAssignee now rejects squads with archived_at set; resolveAutopilotLeader returns errSquadArchived so the admission gate fails closed; DeleteSquad now mirrors the issue transfer for autopilot rows (TransferSquadAutopilotsToLeader) so surviving autopilots flip to assignee_type='agent' (leader) instead of dangling at the archived squad. 2. dispatchRunOnly post-admission readiness: introduces errDispatchSkipped sentinel, recognised by DispatchAutopilot via handleDispatchSkip so the run is recorded as `skipped` (not `failed`). Manual triggers no longer 500 when the leader's runtime goes offline between admission and task creation. New TestManualTriggerDoesNotErrorOnPostAdmissionSkip locks the behaviour in. 3. Dangling agent assignee after migration 096 dropped the FK: shouldSkipDispatch now distinguishes pgx.ErrNoRows / errSquadArchived (hard skip — retrying won't help) from transient DB errors (fail-open). DeleteAgentRuntime pauses autopilots that target agents about to be hard-deleted (ListArchivedAgentIDsByRuntime + PauseAutopilotsByAgentAssignees) so the breakage surfaces as a paused row in the UI instead of a quiet skip-burning loop. Unit tests cover the sentinel unwrap contract and errSquadArchived errors.Is behaviour. Integration test TestAutopilotDispatchSkipsWhenRuntimeOffline re-verified against a fresh DB with migration 096 applied. Co-authored-by: multica-agent <github@multica.ai> * fix(autopilot): bump last_run_at on post-admission skip (MUL-2429) Match recordSkippedRun (pre-flight skip) and the success path so the scheduler / "last seen" UI both reflect that this tick evaluated the trigger, even when the post-admission readiness gate caught a late regression. Addresses Emacs review caveat #1 on PR #2888. Co-authored-by: multica-agent <github@multica.ai> * feat(autopilot): mixed agent/squad assignee picker in dialog (MUL-2429) End-to-end UI for assigning an autopilot to a squad. Closes the PR #2888 backend gap: the squad-as-assignee feature was already wired in Go (Path A, RFC §4) but the desktop dialog never offered the choice. - core/types/autopilot: add `AutopilotAssigneeType`, surface `assignee_type` on `Autopilot` + Create/Update request payloads. - views/autopilots/pickers/agent-picker: switch to a polymorphic AssigneeSelection (`{type, id}`); render agents and squads as two grouped sections with shared pinyin search. - views/autopilots/autopilot-dialog: maintain `assigneeType` state, send it on create/update, render the trigger avatar / hover dot with `assignee.type`. - views/autopilots/autopilots-page + autopilot-detail-page: render the assignee row using `autopilot.assignee_type` so squad-typed autopilots show the squad avatar + name, not a broken agent lookup. - locales: add `agents_group` / `squads_group` / `select_assignee` keys (en + zh-Hans), keep legacy `select_agent` for callers that still reference it. Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: Lambda <lambda@multica.ai> Co-authored-by: multica-agent <github@multica.ai>	2026-05-20 05:30:13 +02:00
Jiayuan Zhang	e48f6a84d6	feat(github): expose read-only installation list to workspace members (MUL-2413) (#2886 ) * feat(github): expose read-only installation list to workspace members (MUL-2413) Relax `GET /api/workspaces/{id}/github/installations` from owner/admin-only to any workspace member so the Settings → Integrations tab no longer renders blank for non-admins (the original symptom of MUL-2413). The handler now reads the caller's role from the workspace middleware: - owner / admin keep the full row including the numeric `installation_id` (the connect / disconnect handle) and receive `can_manage: true`. - every other role (member / guest) receives rows with `installation_id` omitted and `can_manage: false`, giving them visibility into "is GitHub wired up?" without the management handle. `GET /github/connect` and `DELETE /github/installations/{id}` stay under the admin/owner middleware group — this PR only relaxes the read path. Tests: `TestListGitHubInstallations_RoleGating` exercises admin, owner, member, and guest paths against the real DB-backed handler fixture and asserts the field stripping + `can_manage` contract. Refs: MUL-2413 Co-authored-by: multica-agent <github@multica.ai> * fix(github): redact installation_id from realtime broadcasts (MUL-2413) GET /github/installations strips the numeric installation_id for non-admin members, but the github_installation:created / uninstall / suspend WS events were still publishing it, so the same handle was reachable from any workspace client subscribed to the workspace scope. Broadcast both payload variants without it — the frontend uses these events only to invalidate the installations query, so admins re-query the list endpoint to recover the management handle. Also adds a router-level test that mounts the production middleware split (member-visible list vs. owner/admin connect+delete) so a future routing change can't silently widen the write surface. Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: Lambda <lambda@multica.ai> Co-authored-by: multica-agent <github@multica.ai>	2026-05-20 04:17:45 +02:00
Jiayuan Zhang	2ad1cd8ff8	feat(profile): user profile description injected into agent brief (MUL-2406) ## Summary Adds per-user `profile_description` so coding agents have cheap, durable context about who is asking. v1 per the brief Xeon locked in on [MUL-2406](mention://issue/63a7247c-4f6a-42cf-90d1-7c746e77158a): - DB — `user.profile_description TEXT NOT NULL DEFAULT ''` (migration 096). 2000-rune cap enforced server-side. No nullable / privacy state to manage. - API — `PATCH /api/me` accepts the field; `UserResponse` always emits it. Client wraps `updateMe` in a lenient `UserSchema` + `EMPTY_USER` fallback per CLAUDE.md API Response Compatibility. - UI — Settings → Account gains an "About you" textarea with live `n/2000` counter, `maxLength` guard, and a localized too-long error (EN + zh-Hans). - CLI — `multica user profile get` / `multica user profile update` with `--description / --description-stdin / --description-file / --clear`, mirroring the existing `issue comment add` input-mode menu. - Daemon injection — claim handler resolves the runtime owner and stamps `requesting_user_name` + `requesting_user_profile_description` on the task. `buildMetaSkillContent` emits `## Requesting User` between `## Agent Identity` and `## Available Commands`, blockquoted and framed as background context. The block is omitted entirely when the description is empty (no token cost when unused). Brief is written once per task via `CLAUDE.md` / `AGENTS.md`, not the per-turn prompt — same path the agent already reads for identity, so no extra per-turn cost. ## Test plan - [x] `go build ./...`, `go vet ./...`, `go test ./internal/cli/ ./internal/daemon/ ./internal/daemon/execenv/ ./cmd/multica/` - [x] New brief tests: `TestBuildMetaSkillContentEmitsRequestingUser`, `TestBuildMetaSkillContentOmitsRequestingUserWhenEmpty` - [x] `pnpm typecheck`, `pnpm lint`, `pnpm test` (74 files, 644 tests pass) - [ ] Handler DB tests (`TestUpdateMe*`) require a migrated test DB — not runnable in this sandbox - [ ] Manual: open Settings → Account, set a description, confirm the next daemon-run agent's `CLAUDE.md` shows `## Requesting User`	2026-05-19 19:51:28 +02:00
Jiayuan Zhang	591e47842d	refactor(onboarding): remove starter-content kit; unify install-runtime issue across mark-onboarded paths (MUL-2438) (#2884 ) * refactor(onboarding): remove starter-content kit, unify install-runtime issue across mark-onboarded paths (MUL-2438) Drops the post-onboarding ImportStarterContent / DismissStarterContent flow (handler + routes + StarterContentPrompt + templates + locale strings + analytics event). The bug — web onboarding seeding 6+ starter issues without a runtime — only existed through that path; with it gone the source disappears. The "install a runtime" issue from BootstrapOnboardingNoRuntime is now the canonical no-runtime onboarding seed. The title/description and a LockAndFindActiveDuplicate-deduped seeder move to handler/no_runtime_issue.go, and CompleteOnboarding / CreateWorkspace / AcceptInvitation seed it whenever the workspace has no runtime yet, so every mark-onboarded entry point lands the user on a concrete next step. starter_content_state column is kept and continues to be claimed as 'imported' in all five entry points so older desktop builds (which still render the legacy dialog on NULL) don't surface it to accounts created after this change. Co-authored-by: multica-agent <github@multica.ai> * fix(onboarding): backfill starter_content_state for in-window NULL users (MUL-2438) 054 only covered pre-feature users. Anyone onboarded between then and the starter-content kit removal could still sit at NULL, and old desktop clients gate the legacy StarterContentPrompt on `starter_content_state IS NULL`. The import/dismiss routes are gone, so leaving these rows NULL would surface a dialog whose buttons 404. Mark them 'imported' to match the new helper's claim semantics. Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: Lambda <lambda@multica.ai> Co-authored-by: multica-agent <github@multica.ai>	2026-05-19 18:37:48 +02:00
Jiayuan Zhang	cd37b4e3d6	feat(settings): consolidate GitHub options under a dedicated Settings tab (MUL-2414)	2026-05-19 17:23:30 +02:00
Bohan Jiang	f120e0ef43	refactor(cli): tidy workspace subtree (MUL-2386) (#2866 ) - Drop `workspace current`; `workspace get` (no args) already prints the current default workspace, so the two were doing the same thing. - Rename `workspace members` to `workspace member list` to free up the `member` namespace for future `add` / `remove` subcommands and align with the rest of the CLI's `<resource> <verb>` shape. - Add `--full-id` to `workspace list`, matching `project list`, `autopilot list`, and friends. Docs and the daemon prompt are updated to match. Co-authored-by: multica-agent <github@multica.ai>	2026-05-19 17:54:21 +08:00
Joey Frasier (Boothe)	76cd8275ff	fix(openclaw): parse whole buffer instead of line-by-line scanner (MUL-1908) (#2292 ) * fix(openclaw): parse whole buffer instead of line-by-line scanner Follow-up to `c87d7676` (WOR-10). The stdout/stderr swap fixed the dominant case but `processOutput` still scanned line-by-line and only attempted a whole-buffer parse from a fragile fallback path. Pretty-printed JSON (openclaw 2026.5.x emits the result blob indented across many lines) made every individual line unparseable on its own — `{`, ` "payloads": [`, ` {`, etc. — so the success path hinged entirely on the fallback joining `rawLines` and re-trying. Under load (daemon restarts racing the close-on-cancel goroutine, partial chunked reads when stdout closes mid-flight) the line scanner could see truncated input that never reassembled into valid JSON, surfacing "openclaw returned no parseable output" against runs where the agent had in fact completed the work and posted comments. Roughly 30–40% of recent runs in v0.2.27 logs hit this path; multica still wrote a `task_failed` inbox row for each one even though the underlying issue had moved to `in_review` or `done`. The fix: - processOutput now reads the full stdout buffer with `io.ReadAll` first. - A new `parseWholeBufferOpenclawResult` helper attempts a single `json.Unmarshal` against the entire buffer (after trimming, and after optionally stripping leading non-JSON log lines). When it matches, we build the result and return — the line scanner never runs. - If the whole-buffer parse fails, we fall through to the existing NDJSON line-by-line scanner. This preserves streaming-event support (kept for forward compatibility and other backends) without leaving openclaw's dominant pretty-printed shape at the mercy of timing. - The failure path now emits a `(got N bytes; preview: ...)` suffix on the canonical "no parseable output" error so future debugging isn't blind. The exact canonical phrase is preserved for empty buffers so existing dashboards / log-grep tooling keep matching. Tests: - TestOpenclawProcessOutputWholeBufferPrettyJSON: feeds a hand-crafted multi-line indented blob (multiple payloads, nested agentMeta, usage map) and asserts every field round-trips through the whole-buffer fast path. - TestOpenclawProcessOutputDeeplyIndentedFixture: re-runs the recorded openclaw 2026.5.5 stdout fixture (1070 lines) directly through parseWholeBufferOpenclawResult, asserting the bug-shape parses cleanly on the first attempt without falling through to NDJSON scanning. - TestOpenclawProcessOutputEmptyBufferErrorIncludesByteCount: tightens the empty-buffer failure path, asserts the canonical phrase survives so observability tooling keeps working. All existing tests in the openclaw + buildOpenclawArgs suites stay green (streaming NDJSON event tests, lifecycle tests, structured-error tests, usage-field-variant tests). The two pre-existing flaky timeout-tight codex tests (TestCodexExecuteSemanticInactivityAllowsContinuous) fail on both this branch and on `c87d7676` baseline; they are unrelated and out of scope here. Co-authored-by: multica-agent <github@multica.ai> fix(openclaw): drop dead preview branch, document streaming regression Rebase + review-fix follow-up on top of f27df2d9b. processOutput's preview branch was unreachable: openclawNoParseableOutputError was only called from the `!gotEvents && trimmed == ""` path, which by construction means the entire scanned buffer collapsed to whitespace, so the `(got N bytes; preview: ...)` formatter could never fire on a non-empty buffer. Replace the helper with a single canonical-string constant (callsite is now inline) and update the test name to match what it actually asserts (the canonical empty-buffer error string is preserved for external log-grep / dashboard consumers). Also document on processOutput that the line-scanner path is no longer truly streaming after the io.ReadAll switch: events accumulate until stdout closes. OpenClaw 2026.5.x does not emit streaming events so this regression is invisible today, but flag it for the next backend that might. Misc: switch the scanner's input source from `strings.NewReader(string(buf))` to `bytes.NewReader(buf)` to drop one unnecessary byte/string round-trip. MUL-1908 Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: multica-agent <github@multica.ai> Co-authored-by: J (Multica agent) <j@multica.local>	2026-05-19 17:42:41 +08:00
Bohan Jiang	54368fd826	feat(projects): scheduled-only Gantt data source + WS reactivity (MUL-1881) (#2856 ) * feat(projects): scheduled-only Gantt data source + WS reactivity (MUL-1881) Project Gantt now fetches its own scheduled-only data instead of riding the Board/List pagination cache. The Unscheduled drawer and pagination warning banner are gone, and any WS-driven issue change (create / update / delete) invalidates the new cache so the timeline stays live. - Backend: `GET /api/issues?scheduled=true` adds an `(i.start_date IS NOT NULL OR i.due_date IS NOT NULL)` predicate on both ListIssues and CountIssues. New SQL filter is plumbed through sqlc + handler. - Frontend: new `projectGanttIssuesOptions(wsId, projectId)` issues a single fetch and lives under its own cache key. WS handlers and mutations invalidate the prefix on create/update/delete so the bar reacts to start_date / due_date changes from other tabs and from this tab without waiting on the WS round-trip. - GanttView: drops the Unscheduled section, the pagination warning banner, and the load-all button; renders only scheduled rows. - Removes now-dead `useLoadAllRemaining`, `myIssueListPaginationOptions`, `summarizeIssueListPagination`, and the gantt locale strings that supported the old plumbing. Co-authored-by: multica-agent <github@multica.ai> * fix(projects): page through Gantt fetch and isolate per-view data sources - Walk paginated `scheduled=true` issues until total is reached so projects with more than 500 scheduled bars no longer silently truncate. - Gantt mode disables the bucketed Board/List query and reads its own scheduled cache for the project empty-state check, so the page never short-circuits Gantt with a Board-derived "no issues" CTA. - `onIssueLabelsChanged` patches matching rows in the Project Gantt cache in-place, keeping label filters consistent after attach/detach from other tabs or agents. MUL-1881 Co-authored-by: multica-agent <github@multica.ai> --------- Co-authored-by: multica-agent <github@multica.ai>	2026-05-19 17:04:16 +08:00
Kagura	59617f376e	feat(auth): make auth token TTL configurable via AUTH_TOKEN_TTL env var (MUL-2371) (#2713 ) * feat(auth): make auth token TTL configurable via AUTH_TOKEN_TTL env var Add AUTH_TOKEN_TTL environment variable (in seconds) to override the hardcoded 30-day auth token lifetime. Self-hosted deployments on trusted networks can set a longer value to avoid frequent magic-link re-authentication. The value is read once at startup and cached. Invalid or missing values fall back to the 30-day default with a warning log. Closes #2685 * refactor(auth): extract parseAuthTokenTTL for testability Address review feedback: extract pure parse function from sync.Once wrapper so the parsing logic can be unit-tested independently. Add TestParseAuthTokenTTL with table-driven cases. Co-Authored-By: Claude Opus 4 (1M context) <noreply@anthropic.com> * refactor(auth): accept Go duration strings + hoist shared TTL in SetAuthCookies Address nice-to-have review feedback from Bohan-J: - parseAuthTokenTTL now tries time.ParseDuration first (e.g. '8760h'), falling back to ParseInt for integer seconds - Warn on unreasonable values (>10 years) but still accept them - Hoist AuthTokenTTL() and time.Now() in SetAuthCookies so both cookies share the exact same expiry - Add security trade-off note in .env.example - Add 5 new test cases for duration strings Co-Authored-By: Claude Opus 4 (1M context) <noreply@anthropic.com> Signed-off-by: kagura-agent <kagura.agent.ai@gmail.com> * fix: use AuthTokenTTL() in CloudFront middleware, guard ParseInt overflow Address review feedback from Bohan-J (round 2): 1. CloudFront refresh middleware (cloudfront.go:21) was hardcoding 3024time.Hour instead of using auth.AuthTokenTTL(). Now calls AuthTokenTTL() so the middleware respects AUTH_TOKEN_TTL env var. 2. parseAuthTokenTTL integer-seconds branch: very large values like 9999999999 would silently overflow int64 when multiplied by time.Second. Added overflow guard comparing against math.MaxInt64/int64(time.Second) before the multiplication. 3. Updated AuthTokenTTL() doc comment to reflect that it accepts Go duration strings or integer seconds (not just seconds). 4. Added middleware test (cloudfront_test.go) verifying short AUTH_TOKEN_TTL produces short cookie expiry, not 30-day hardcode. Also covers nil signer and existing-cookie-skip cases. 5. Added integer overflow test case to cookie_test.go. * style: run gofmt on cookie.go and cookie_test.go --------- Signed-off-by: kagura-agent <kagura.agent.ai@gmail.com> Co-authored-by: Claude Opus 4 (1M context) <noreply@anthropic.com>	2026-05-19 16:22:07 +08:00

1 2 3 4 5 ...

870 Commits