From 02d754ba675f15e1a56d76b62c97d820e615fecc Mon Sep 17 00:00:00 2001 From: Tatsuya Shimomoto Date: Sun, 8 Mar 2026 21:02:14 +0900 Subject: [PATCH] fix: use general-purpose agent instead of Explore for skill-stocktake evaluation MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit The Explore agent is a "Fast agent" optimized for codebase exploration, not deep reasoning. The skill-stocktake V4 design requires holistic AI judgment (actionability, scope fit, uniqueness, currency) which needs the full reasoning capability of the conversation's main model. Additionally, the Agent tool has no `model` parameter — specifying `model: opus` was silently ignored, causing the evaluation to run on the lightweight Explore model. This resulted in all skills receiving "Keep" verdicts without genuine critical analysis. Changing to `general-purpose` agent ensures evaluation runs on the conversation's main model (e.g., Opus 4.6), enabling the holistic judgment that V4 was designed for. --- skills/skill-stocktake/SKILL.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/skills/skill-stocktake/SKILL.md b/skills/skill-stocktake/SKILL.md index 03bc497b..9d86b4db 100644 --- a/skills/skill-stocktake/SKILL.md +++ b/skills/skill-stocktake/SKILL.md @@ -74,7 +74,7 @@ Scanning: ### Phase 2 — Quality Evaluation -Launch a Task tool subagent (**Explore agent, model: opus**) with the full inventory and checklist. +Launch an Agent tool subagent (**general-purpose agent**) with the full inventory and checklist. The subagent reads each skill, applies the checklist, and returns per-skill JSON: `{ "verdict": "Keep"|"Improve"|"Update"|"Retire"|"Merge into [X]", "reason": "..." }`