ShelldarylmcdFree

surface-audit

One-pass audit of the Roslyn MCP server's live surface (tools / resources / prompts / shipped skills) against documentation count claims. Use when preparing a release, chasing doc drift, or answering "how many tools does this server have?" without burning a dozen greps. Calls server_info, globs skills/*/SKILL.md, greps README / CHANGELOG / ai_docs / docs for numeric surface claims (e.g. "X tools", "Y prompts", "Z skills"), and reports drift as a compact table. Read-only; never edits docs.

Repo bundle on Versuzdarylmcd/Roslyn-Backed-MCP47 indexed entries (SKILL.md and CLAUDE.md) from this repository — open the full bundle view.

Open bundle →

View on GitHub ↗</>github.com/darylmcd/Roslyn-Backed-MCP Yours? Claim it ↗

§ 01 — Stats

Prior1090

Quality—

Score—

Tasks—

§ 02 — Install

Get surface-audit.

Free SKILL.md scraped from GitHub. Clone the repo or copy the file directly into your Claude Code skills directory.

One-line install · Claude Code

$npx versuz@latest install darylmcd-roslyn-backed-mcp-claude-skills-surface-audit

Or clone the repo

$git clone https://github.com/darylmcd/Roslyn-Backed-MCP.git

Or copy the SKILL.md manually

$cp Roslyn-Backed-MCP/SKILL.MD ~/.claude/skills/darylmcd-roslyn-backed-mcp-claude-skills-surface-audit/SKILL.md

More Versuz picks

★ Featured$1.99

vz-bench-debug

Document

★ Featured$0.99

vz-scrape-runner

Web

Got something better ?Submit your skill — it enters tomorrow's cycle. No fee.

Submit yours →

§ 05 — Challenge

Think you can beat it?

$npx versuz challenge darylmcd-roslyn-backed-mcp-claude-skills-surface-audit↵

Show SKILL.md content (~1.8k tokens)

---
name: surface-audit
description: One-pass audit of the Roslyn MCP server's live surface (tools / resources / prompts / shipped skills) against documentation count claims. Use when preparing a release, chasing doc drift, or answering "how many tools does this server have?" without burning a dozen greps. Calls server_info, globs skills/*/SKILL.md, greps README / CHANGELOG / ai_docs / docs for numeric surface claims (e.g. "X tools", "Y prompts", "Z skills"), and reports drift as a compact table. Read-only; never edits docs.
---

# surface-audit

Cross-check the live server surface against every hard-coded count in the repo's documentation.

## Motivation

The 2026-04-10→04-24 retro showed multiple self-audit sessions spending significant grep budget verifying doc claims like "10 skills", "66 stable + 60 experimental = 126 tools", "123 tools". This skill does that walk in one pass so agents get an authoritative drift report without polluting the transcript.

## Steps

### 1. Collect live surface

Call `mcp__roslyn__server_info`. Record from the response:
- `surface.tools.stable`, `surface.tools.experimental`, `surface.registered.tools`
- `surface.resources.stable`, `surface.resources.experimental`, `surface.registered.resources`
- `surface.prompts.stable`, `surface.prompts.experimental`, `surface.registered.prompts`
- `version`, `catalogVersion`, `surface.registered.parityOk`

### 2. Count shipped skills on disk

`Glob skills/*/SKILL.md` — the count is the live skill count. Record as `shippedSkills`.

### 3. Count maintainer-only skills

`Glob .claude/skills/*/SKILL.md` — these are the repo-local override skills (not shipped). Record as `maintainerSkills`. Report the totals but DO NOT compare against shipped-skill count claims (they're separate surfaces).

### 4. Sweep docs for numeric surface claims

Grep for patterns that match surface count claims. Regex list (case-insensitive):

```
[0-9]+\s*(stable|experimental)?\s*(tools?|resources?|prompts?|skills?)
```

Scope the grep to:
- `README.md`
- `docs/**/*.md`
- `ai_docs/**/*.md` (skip `ai_docs/archive/**` and `ai_docs/reports/**` — historical snapshots, drift expected)
- `CHANGELOG.md` — ONLY the `## [Unreleased]` section; shipped version sections are frozen history
- `.claude-plugin/plugin.json` + `.claude-plugin/marketplace.json` — description strings

For each hit, capture: file, line, the exact claim text, the claimed count.

### 5. Compute drift

Produce a table:

| Claim source | Claim text (excerpt) | Claimed | Live | Drift |
|---|---|---|---|---|
| `docs/setup.md:42` | "107 stable tools and 54 experimental" | 107 / 54 | 107 / 54 | ok |
| `README.md:18` | "over 150 Roslyn-powered tools" | 150 | 161 | ok (+11 live, claim is conservative) |
| `ai_docs/references/tool-usage.md:9` | "...10 shipped skills..." | 10 | 12 | **drift +2** |

Drift categories:
- `ok` — claim exactly matches or is a conservative lower bound (e.g. "over N" where live > N)
- `drift +N` / `drift -N` — claim is stale; report the delta
- `unverifiable` — claim is about a category we can't measure from `server_info` or globs (e.g. "dozens of diagnostics"). Flag but don't count as drift.

### 6. Report

Output order:

1. **Live surface summary** — 4-line block: version, tools, resources, prompts, skills (shipped / maintainer).
2. **Drift table** — only rows where drift ≠ ok. If clean, say "no drift found across N scanned files."
3. **Unverifiable claims** — separate table, for human review.
4. **Skills audit table** — see "Skills audit" section below; one row per discovered SKILL.md.
5. **Suggested next steps** — a 1-3 line call-out naming the files to edit (but DO NOT edit them; this skill is read-only).

## Skills audit

Skills are shipped (and maintainer-local) product surface that compose MCP tools into guided workflows. A broken tool reference in any SKILL.md breaks every plugin user. This static audit lane verifies each discovered skill against the live catalog. (Formerly Phase 16b of the `/mcp-server-stress` audit prompt; relocated here because it is a static-catalog check, not a server-execution check.)

### Discover live skills

The discovery surface is the union of two filesystem trees — both must be walked, and the live directories are the source of truth:

- `Glob skills/*/SKILL.md` — shipped skills (plugin consumers see these).
- `Glob .claude/skills/*/SKILL.md` — maintainer-local skills (this repo only; not shipped).

Do NOT rely on a hand-maintained skill list — it drifts.

### Per-skill verification (for each glob result)

1. **Frontmatter parity.** `name` matches the directory name; `description` is non-empty and accurate.
2. **Tool-reference resolution.** Extract every `mcp__roslyn__<tool>` reference from the SKILL.md body and any `prompts/*.md` siblings. Cross-check each against the live catalog from Step 1's `server_info` / `roslyn://server/catalog` capture. Any reference to a renamed/removed tool is a **P2 FAIL** — record it in the audit table and the suggested-next-steps section.
3. **Doc consistency.** Output formats and field references in the SKILL.md body should match what the referenced tools actually return. Drift here is a FLAG, not a FAIL.

Skills are **not** rows in `roslyn://server/catalog`; this audit is a quality/contract check, not a tier-promotion pipeline.

### Tagging

Each skill row gets one of: `pass`, `flag`, `fail`. A `fail` requires at least one tool reference that does not resolve to the live catalog or a missing/empty frontmatter field.

### Skills audit table

Append one row per discovered SKILL.md:

| Skill | Tree | frontmatter_ok | tool_refs_valid (invalid_count) | tag | Notes |
|-------|------|----------------|----------------------------------|-----|-------|
| `surface-audit` | `.claude/skills` | yes | yes (0) | pass | |
| `example-shipped` | `skills` | yes | no (2) | fail | references removed `old_tool_name`, `another_old` |

### Skills audit checkpoint

Before closing the report, confirm:

- Do all live skills pass frontmatter parity?
- Does any skill reference a removed/renamed tool (the P2 FAIL classification)?
- If a skills tree was inaccessible (e.g. the shipped `skills/` tree is absent in a non-Roslyn-Backed-MCP repo checkout), is that recorded as `blocked — <tree> not accessible` rather than silently dropped?

## Non-goals

- Does NOT edit documentation. The user decides which claims to update and in what PR.
- Does NOT re-audit the old `## [1.x.y]` CHANGELOG sections. Frozen history is expected to drift from live.
- Does NOT count backlog rows, plan initiatives, or CI steps — those are tracked elsewhere.
- Does NOT execute skill workflows end-to-end against a live workspace — that is `/mcp-server-stress`'s lane (server-execution audit). This skill is the static-surface lane.