Dataseb155Free

test-orchestrator

Full-ecosystem test pyramid orchestrator. This skill should be used when the user asks to 'run all tests', 'test orchestrator', 'check coverage', 'verify test health', or needs the full L0-L4 pyramid executed with reporting.

Repo bundle on Versuzseb155/atlas-plugin336 indexed entries (SKILL.md and CLAUDE.md) from this repository — open the full bundle view.

Open bundle →

View on GitHub ↗</>github.com/seb155/atlas-plugin Yours? Claim it ↗

§ 01 — Stats

Prior1090

Quality—

Score—

Tasks—

§ 02 — Install

Get test-orchestrator.

Free SKILL.md scraped from GitHub. Clone the repo or copy the file directly into your Claude Code skills directory.

One-line install · Claude Code

$npx versuz@latest install seb155-atlas-plugin-dist-atlas-dev-addon-skills-test-orchestrator

Or clone the repo

$git clone https://github.com/seb155/atlas-plugin.git

Or copy the SKILL.md manually

$cp atlas-plugin/SKILL.MD ~/.claude/skills/seb155-atlas-plugin-dist-atlas-dev-addon-skills-test-orchestrator/SKILL.md

More Versuz picks

★ Featured$1.99

vz-bench-debug

Document

★ Featured$0.99

vz-scrape-runner

Web

Got something better ?Submit your skill — it enters tomorrow's cycle. No fee.

Submit yours →

§ 05 — Challenge

Think you can beat it?

$npx versuz challenge seb155-atlas-plugin-dist-atlas-dev-addon-skills-test-orchestrator↵

Show SKILL.md content (~2.1k tokens)

---
name: test-orchestrator
description: "Full-ecosystem test pyramid orchestrator. This skill should be used when the user asks to 'run all tests', 'test orchestrator', 'check coverage', 'verify test health', or needs the full L0-L4 pyramid executed with reporting."
mode: [coding, engineering]
model: sonnet
---

# Test Orchestrator

**Principle**: Never claim tests pass without running them. Always show actual output.

## Subcommands

| Command | Suite | Approx. time | When to use |
|---------|-------|--------------|-------------|
| `/atlas test smoke` | Backend smoke + Frontend vitest | ~30s | After every change |
| `/atlas test unit` | All backend + frontend unit tests | ~2min | Before PR |
| `/atlas test integration` | DB integration tests | ~3min | Local only (needs DB) |
| `/atlas test e2e` | Playwright E2E suite | ~5min | UI feature changes |
| `/atlas test security` | Security test suite | ~1min | Before merge to main |
| `/atlas test plugin` | Atlas Plugin structural + build tests | ~20s | After editing skills/hooks |
| `/atlas test infra` | Infrastructure health checks | ~30s | After Docker/infra changes |
| `/atlas test full` | Complete pyramid (all above) | ~10min | Release, major refactor |
| `/atlas test coverage` | Coverage report + threshold check | ~3min | Sprint review |

## Exact Commands

### `/atlas test smoke`
```bash
# Backend smoke — fastest health check
docker exec synapse-backend bash -c "cd /app && python -m pytest tests/ -x -q --tb=short -m smoke 2>/dev/null || python -m pytest tests/ -x -q --tb=short --co -q 2>&1 | head -5 && python -m pytest tests/test_health.py tests/test_api_instruments.py -x -q --tb=short 2>/dev/null || python -m pytest tests/ -x -q --tb=short -k 'health or smoke'"

# Frontend vitest — unit + type check
# From the project's frontend directory:
bunx vitest --run --reporter=verbose 2>&1 | tail -20
bun run type-check 2>&1 | tail -10
```

### `/atlas test unit`
```bash
# Backend — full unit suite (not integration)
docker exec synapse-backend bash -c "cd /app && python -m pytest tests/ -x -q --tb=short --ignore=tests/integration"

# Frontend — vitest + type-check
# From the project's frontend directory: `bunx vitest --run && bun run type-check`
```

### `/atlas test integration`
```bash
# Requires: Docker stack running (db:5433, backend:8001)
# Local only — NEVER run on ATL-dev (no Docker)
docker exec synapse-backend bash -c "cd /app && python -m pytest tests/integration/ -x -q --tb=short"
```

### `/atlas test e2e`
```bash
# Full Playwright QA suite
# From the project's frontend directory: `bunx playwright test e2e/qa-*.spec.ts`

# Single spec (faster iteration)
# From the project's frontend directory: `bunx playwright test e2e/qa-instruments.spec.ts`
```

### `/atlas test security`
```bash
# Backend security tests
docker exec synapse-backend bash -c "cd /app && python -m pytest tests/ -x -q --tb=short -k 'security or rbac or auth'"

# Frontend — grep for risky patterns (run from the project root)
grep -r "localStorage.*[Tt]oken" frontend/src/ && echo "WARNING: token in localStorage" || echo "OK: no token in localStorage"
grep -r "allow_origins.*\*" backend/ && echo "WARNING: CORS wildcard" || echo "OK: no CORS wildcard"
```

### `/atlas test plugin`
```bash
# Atlas Plugin structural tests (no Docker needed)
# From the project's atlas-plugin directory: `python -m pytest tests/ -x -q --tb=short`
```

### `/atlas test infra`
```bash
# Docker stack health (run from the project root)
docker compose -f compose.yml ps --format "{{.Name}} {{.Status}}"

# API health
curl -s http://localhost:8001/health | python3 -m json.tool

# Frontend responding
curl -s -o /dev/null -w "Frontend HTTP %{http_code}\n" http://localhost:4000

# DB connectivity
docker exec synapse-backend bash -c "cd /app && python -c 'from app.db import get_db; print(\"DB OK\")'"
```

### `/atlas test full`
```bash
# Run all suites sequentially (fail-fast at each level)
# Step 1: Environment health (run from the project root)
docker compose -f compose.yml ps --format "{{.Name}} {{.Status}}"
curl -sf http://localhost:8001/health > /dev/null && echo "Backend OK" || echo "Backend DOWN"

# Step 2: Plugin (fastest, no Docker)
# From the project's atlas-plugin directory: `python -m pytest tests/ -x -q --tb=short`

# Step 3: Backend unit
docker exec synapse-backend bash -c "cd /app && python -m pytest tests/ -x -q --tb=short --ignore=tests/integration"

# Step 4: Frontend unit + types
# From the project's frontend directory: `bunx vitest --run && bun run type-check`

# Step 5: Integration
docker exec synapse-backend bash -c "cd /app && python -m pytest tests/integration/ -x -q --tb=short"

# Step 6: E2E
# From the project's frontend directory: `bunx playwright test e2e/qa-*.spec.ts`
```

### `/atlas test coverage`
```bash
# Backend coverage (check thresholds in .atlas/test-config.yaml)
docker exec synapse-backend bash -c "cd /app && python -m pytest tests/ -q --tb=short --ignore=tests/integration --cov=app --cov-report=term-missing --cov-fail-under=15"

# Frontend coverage
# From the project's frontend directory: `bunx vitest --run --coverage`

# Plugin coverage
# From the project's atlas-plugin directory: `python -m pytest tests/ -q --tb=short --cov=. --cov-report=term-missing --cov-fail-under=100 -k 'frontmatter or schema or structure'`
```

## Coverage Thresholds

See `.atlas/test-config.yaml` for authoritative thresholds.

| Suite | Current floor | Target |
|-------|--------------|--------|
| Backend | 15% | 25% |
| Frontend | 8% | 15% |
| Plugin structural | 100% | 100% |

## Output Format

Always report results in this format:

```
TEST PYRAMID REPORT
Plugin:      PASS/FAIL  {n} passed ({n} failed) — {duration}
Backend:     PASS/FAIL  {n} passed ({n} failed) — {duration}
Frontend:    PASS/FAIL  {n} passed, type-check PASS/FAIL — {duration}
Integration: PASS/FAIL  {n} passed ({n} failed) — {duration}
E2E:         PASS/FAIL  {n} scenarios — {duration}
Coverage:    BE {n}% / FE {n}% / Plugin {n}%
OVERALL: PASS/FAIL
```

## Safe Flags

| Flag | Behavior |
|------|----------|
| `-x` | Stop at first failure (ALWAYS use with pytest) |
| `-q` | Quiet output |
| `--tb=short` | Short traceback (NEVER `--tb=long`) |
| `--run` | Single vitest run (NEVER `--watch`) |

## Never

- NEVER use `--pdb`, `-s`, `--watch`, `nodemon` (interactive = hang)
- NEVER run `tests/` without `-x` on a large suite — test one file first
- NEVER skip type-check (`bun run type-check`) when frontend tests pass
- NEVER run integration tests on `ATL-dev` (no Docker)

---

## SOTA Test Architecture — Read When Auditing or Designing CI Gates

Before:
- writing or reviewing any `.woodpecker/*.yml` / `.github/workflows/*.yml` test step,
- planning a test coverage sprint,
- responding to "we have X tests but prod still breaks",
- auditing a project's test maturity for handoff/review,

**read `references/sota-testing-patterns.md`**.

It documents the **5 defects of "test theatre"** (skeleton tests, smoke-only CI, `failure: ignore`, mocked-DB integration, unenforced coverage) and the **5-level test maturity model** (L0 theatre → L5 quality gates) with concrete templates and rollout playbooks.

**TL;DR rules** (enforced via code review of CI configs):

- Test step in CI MUST run more than `-m smoke` (broader filter, e.g. `not external and not slow`).
- `failure: ignore` / `continue-on-error: true` BANNED on test steps. Red = blocks merge.
- Integration tests touch a real DB (postgres-in-CI service) — not mocks.
- Coverage enforced in CI (`--cov-fail-under=N`), not just local.
- No skeleton tests in tree (`grep -r "auto-generated skeleton"` returns 0).
- Templates centralized so adding a new page/hook/route auto-creates a test stub.

When user asks "is our test setup good?" → run the 5-question audit from the reference and propose a L0→L1→L2 rollout sized for the team.