Judges agree — score is reliable. The "spread" is the standard deviation between the 3 judges' average scores — small spread means they agree, large spread means take the score with a grain of salt.
Free SKILL.md scraped from GitHub. Clone the repo or copy the file directly into your Claude Code skills directory.
npx versuz@latest install electron-chromium-upgradegit clone https://github.com/electron/electron.gitcp electron/.claude/skills/electron-chromium-upgrade/SKILL.md ~/.claude/skills/electron-chromium-upgrade/SKILL.md“Output is well-formed JSON matching the requested structure (headers as keys, rows as objects), but provides only a minimal single-row example with no implementation details, edge-case handling, or evidence of robust parsing. RULE E applies: well-formed but lacks specifics needed for real-world use. A developer would need to write the actual parser themselves.”
“Output follows the structure but lacks specific evidence (HTTP status codes or redirects) as requested, reducing correctness and usefulness.”
“The output returns all required fields with an absolute image URL and fallbacks, but lacks provenance/confidence and explicit fallback precedence; minor naming/validation gaps reduce completeness and usefulness.”