Shimo
|
a2e465c74d
|
feat(skills): add skill-comply — automated behavioral compliance measurement (#724)
* feat(skills): add skill-comply — automated behavioral compliance measurement
Automated compliance measurement for skills, rules, and agent definitions.
Generates behavioral specs, runs scenarios at 3 strictness levels,
classifies tool calls via LLM, and produces self-contained reports.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* fix(skill-comply): address bot review feedback
- AGENTS.md: fix stale skill count (115 → 117) in project structure
- run.py: replace remaining print() with logger, add zero-division guard,
create parent dirs for --output path
- runner.py: add returncode check for claude subprocess, clarify
relative_to path traversal validation
- parser.py: use is_file() instead of exists(), catch KeyError for
missing trace fields, add file check in parse_spec
- classifier.py: log warnings on malformed classification output,
guard against non-dict JSON responses
- grader.py: filter negative indices from LLM classification
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
|
2026-03-22 21:51:49 -07:00 |
|