multica/server/internal/handler at 3466bbc1961749d2d04a4bdbbf73e4ea8aab2a93 - multica - Gitea: Git with a cup of tea

highperfocused/multica

mirror of https://github.com/multica-ai/multica.git synced 2026-07-05 13:29:44 +02:00

Files

History

Jiang Bohan 3466bbc196 fix(skills): atomic Redis claim + surface store write failures (PR #1557 review)

Two real gaps GPT-Boy flagged:

1. RedisLocalSkill{List,Import}Store.PopPending was doing ZREM then SET as
   two separate round-trips. If the SET failed for any reason — transient
   Redis error, context cancellation, pod getting SIGKILL'd mid-call — the
   request was already gone from the pending zset but the stored record
   still said "pending", and no subsequent PopPending would re-dispatch
   it. Exactly the "request disappears" class of bug this PR is supposed
   to kill.

   Fix: push the claim into a Lua script so Redis runs ZREM + SET as one
   atomic unit. If ZREM returns 0 (another node won the race), SET is
   skipped and the caller retries.

2. ReportLocalSkill{List,Import}Result handlers were logging Complete/Fail
   store failures at Warn and still returning 200 OK. That made the
   daemon think the report landed when it hadn't, leaving the request
   stuck in "running" until the server-side timeout and — worse for the
   import flow — leaving the just-created Skill row orphaned in Postgres
   so every retry collided with the unique-name constraint.

   Fix: escalate to Error + return 500 so the daemon (and monitoring) can
   see the write failed. For the import flow, Complete failure after the
   Skill row is already committed also triggers a best-effort DeleteSkill
   so a daemon retry lands on a clean slate instead of hitting
   "a skill with this name already exists" forever.

Tests
- New TestRedisLocalSkillListStore_PopPendingAtomicClaim asserts the
  happy-path invariant: after one PopPending the record is "running"
  AND a second PopPending returns nothing. Deliberately does NOT poke
  Redis internals directly so the test survives any future key-layout
  refactor.
- Existing cross-instance / concurrent / timeout / per-runtime tests
  continue to pass against the Lua-based claim path (verified locally
  against a scratch redis-server; 8/8 Redis tests green).

2026-04-23 16:57:55 +08:00

..

activity_test.go

…

activity.go

…

agent_test.go

…

agent.go

feat(agents): surface task source on AgentTaskResponse + use it in Tasks tab (#1455 )

2026-04-22 19:26:57 +08:00

auth_signup_test.go

feat(analytics): full PostHog pipeline + 6 funnel events (MUL-1122) (#1367 )

2026-04-21 14:42:52 +08:00

auth.go

feat(onboarding): redesigned flow + post-landing starter content opt-in (#1411 )

2026-04-21 20:32:33 +08:00

autopilot.go

…

chat.go

feat(realtime): phase 0 — extract Broadcaster interface + add metrics (MUL-1138) (#1429 )

2026-04-23 13:36:55 +08:00

comment.go

fix(server/comment): remove HTML sanitizer that was corrupting Markdown (#1387 ) (#1436 )

2026-04-21 15:40:30 +08:00

config_test.go

feat(selfhost): ship public GHCR deployment flow

2026-04-22 16:58:42 +08:00

config.go

docs(handler): note that GetConfig is public-only and what may be returned (#1538 )

2026-04-23 01:51:59 +08:00

daemon_test.go

fix(server/task): synthesize result comment for comment-triggered tasks too (#1440 )

2026-04-21 16:09:59 +08:00

daemon.go

fix(skills): shared-state runtime local-skill stores (MUL-1288)

2026-04-23 16:06:24 +08:00

feedback_test.go

feat(feedback): in-app feedback flow + Help launcher (#1546 )

2026-04-23 10:35:55 +08:00

feedback.go

feat(feedback): in-app feedback flow + Help launcher (#1546 )

2026-04-23 10:35:55 +08:00

file_test.go

…

file.go

…

handler_test.go

feat(analytics): full PostHog pipeline + 6 funnel events (MUL-1122) (#1367 )

2026-04-21 14:42:52 +08:00

handler.go

fix(skills): shared-state runtime local-skill stores (MUL-1288)

2026-04-23 16:06:24 +08:00

inbox.go

…

invitation.go

feat(analytics): full PostHog pipeline + 6 funnel events (MUL-1122) (#1367 )

2026-04-21 14:42:52 +08:00

issue_reaction.go

…

issue.go

…

onboarding_test.go

feat(onboarding): redesigned flow + post-landing starter content opt-in (#1411 )

2026-04-21 20:32:33 +08:00

onboarding.go

feat(analytics): instrument onboarding funnel (MUL-1250) (#1489 )

2026-04-22 16:28:08 +08:00

personal_access_token.go

…

pin.go

refactor(pin): drop server-side enrichment, derive sidebar fields client-side (#1484 )

2026-04-22 15:08:16 +08:00

project.go

…

reaction.go

…

runtime_local_skills_redis_store_test.go

fix(skills): atomic Redis claim + surface store write failures (PR #1557 review)

2026-04-23 16:57:55 +08:00

runtime_local_skills_redis_store.go

fix(skills): atomic Redis claim + surface store write failures (PR #1557 review)

2026-04-23 16:57:55 +08:00

runtime_local_skills_test.go

fix(skills): shared-state runtime local-skill stores (MUL-1288)

2026-04-23 16:06:24 +08:00

runtime_local_skills.go

fix(skills): atomic Redis claim + surface store write failures (PR #1557 review)

2026-04-23 16:57:55 +08:00

runtime_models_test.go

…

runtime_models.go

…

runtime_ping.go

…

runtime_test.go

…

runtime_update.go

…

runtime.go

…

search_test.go

…

skill_create.go

feat(skills): import runtime local skills into workspace (#1431 )

2026-04-22 13:16:51 +08:00

skill_test.go

…

skill.go

feat(skills): import runtime local skills into workspace (#1431 )

2026-04-22 13:16:51 +08:00

subscriber_test.go

…

subscriber.go

…

task_lifecycle.go

feat(server): orphan-task recovery + auto-retry + manual rerun (MUL-1128) (#1476 )

2026-04-22 13:08:37 +08:00

trigger_test.go

…

usage_test.go

…

workspace_reserved_slugs.go

feat(slugs): reserve homepage + expand reserved slug list (MUL-961) (#1483 )

2026-04-22 15:08:06 +08:00

workspace_test.go

…

workspace.go

feat(analytics): full PostHog pipeline + 6 funnel events (MUL-1122) (#1367 )

2026-04-21 14:42:52 +08:00