Test Suite Auditor (L2 Coordinator)
Coordinates comprehensive test suite audit across 6 quality categories using 5 specialized workers.
Purpose & Scope
- •L2 Coordinator that delegates to L3 specialized audit workers
- •Audits all tests against 6 quality categories (via 5 workers)
- •Calculates Usefulness Score for each test (Keep/Remove/Refactor)
- •Identifies missing tests for critical business logic
- •Detects anti-patterns and isolation issues
- •Aggregates results into unified report
- •Creates single Linear task in Epic 0
- •Manual invocation by user; not part of Story pipeline
Core Philosophy
"Write tests. Not too many. Mostly integration." — Kent Beck "Test based on risk, not coverage." — ISO 29119
Key Principles:
- •Test business logic, not frameworks — bcrypt/Prisma/Express already tested
- •No performance/load/stress tests — Tests infrastructure, not code correctness (use k6/JMeter separately)
- •Risk-based prioritization — Priority ≥15 or remove
- •E2E for critical paths only — Money/Security/Data (Priority ≥20)
- •Usefulness over quantity — One useful test > 10 useless tests
- •Every test must justify existence — Impact × Probability ≥15
Workflow
Phase 1: Discovery (Automated)
Inputs: Codebase root directory
Actions:
- •Find all test files using Glob:
- •
**/*.test.*(Jest, Vitest) - •
**/*.spec.*(Mocha, Jasmine) - •
**/__tests__/**/*(Jest convention)
- •
- •Parse test file structure (test names, assertions count)
- •Auto-discover Team ID from docs/tasks/kanban_board.md
Output: testFilesMetadata — list of test files with basic stats
Phase 2: Research Best Practices (ONCE)
Goal: Gather testing best practices context ONCE, share with all workers
Actions:
- •Use MCP Ref/Context7 to research testing best practices for detected tech stack
- •Load ../ln-350-story-test-planner/references/risk_based_testing_guide.md
- •Build
contextStorewith:- •Testing philosophy (E2E primary, Unit supplementary)
- •Usefulness Score formulas (Impact × Probability)
- •Anti-patterns catalog
- •Framework detection patterns
Output: contextStore — shared context for all workers
Key Benefit: Context gathered ONCE → passed to all workers → token-efficient
Phase 3: Domain Discovery (NEW)
Purpose: Detect project domains from production code folder structure for domain-aware coverage analysis.
Algorithm: (same as ln-360-codebase-auditor)
- •
Priority 1: Explicit domain folders
- •Check for:
src/domains/*/,src/features/*/,src/modules/*/ - •Monorepo patterns:
packages/*/,libs/*/,apps/*/ - •If found (>1 match) → use as domains
- •Check for:
- •
Priority 2: Top-level src/ folders*
- •List folders:
src/users/,src/orders/,src/payments/ - •Exclude infrastructure:
utils,shared,common,lib,helpers,config,types,interfaces,constants,middleware,infrastructure,core - •If remaining >1 → use as domains
- •List folders:
- •
Priority 3: Fallback to global mode
- •If <2 domains detected →
domain_mode = "global" - •All workers scan entire codebase (backward-compatible behavior)
- •If <2 domains detected →
Heuristics for domain detection:
| Heuristic | Indicator | Example |
|---|---|---|
| File count | >5 files in folder | src/users/ with 12 files |
| Structure | controllers/, services/, models/ present | MVC/Clean Architecture |
| Barrel export | index.ts/index.js exists | Module pattern |
| README | README.md describes domain | Domain documentation |
Output:
{
"domain_mode": "domain-aware",
"all_domains": [
{"name": "users", "path": "src/users", "file_count": 45},
{"name": "orders", "path": "src/orders", "file_count": 32},
{"name": "shared", "path": "src/shared", "file_count": 15, "is_shared": true}
]
}
Shared folder handling:
- •Folders named
shared,common,utils,lib,core→ markis_shared: true - •Shared code audited but grouped separately in report
Phase 4: Delegate to Workers
CRITICAL: All delegations use Task tool with
subagent_type: "general-purpose"for context isolation.
Prompt template:
Task(description: "Test audit via ln-63X",
prompt: "Execute ln-63X-{worker}. Read skill from ln-63X-{worker}/SKILL.md. Context: {contextStore}",
subagent_type: "general-purpose")
Anti-Patterns:
- •❌ Direct Skill tool invocation without Task wrapper
- •❌ Any execution bypassing subagent context isolation
Phase 4a: Global Workers (PARALLEL)
Global workers scan entire test suite (not domain-aware):
| # | Worker | Category | What It Audits |
|---|---|---|---|
| 1 | ln-631-test-business-logic-auditor | Business Logic Focus | Framework/Library tests (Prisma, Express, bcrypt, JWT, axios, React hooks) → REMOVE |
| 2 | ln-632-test-e2e-priority-auditor | E2E Priority | E2E baseline (2/endpoint), Pyramid validation, Missing E2E tests |
| 3 | ln-633-test-value-auditor | Risk-Based Value | Usefulness Score = Impact × Probability<br>Decisions: ≥15 KEEP, 10-14 REVIEW, <10 REMOVE |
| 5 | ln-635-test-isolation-auditor | Isolation + Anti-Patterns | Isolation (6 categories), Determinism, Anti-Patterns (6 types) |
Invocation (4 workers in PARALLEL):
FOR EACH worker IN [ln-631, ln-632, ln-633, ln-635]:
Task(description: "Test audit via " + worker,
prompt: "Execute " + worker + ". Read skill. Context: " + JSON.stringify(contextStore),
subagent_type: "general-purpose")
Phase 4b: Domain-Aware Worker (PARALLEL per domain)
Domain-aware worker runs once per domain:
| # | Worker | Category | What It Audits |
|---|---|---|---|
| 4 | ln-634-test-coverage-auditor | Coverage Gaps | Missing tests for critical paths per domain (Money 20+, Security 20+, Data 15+, Core Flows 15+) |
Invocation:
IF domain_mode == "domain-aware":
FOR EACH domain IN all_domains:
domain_context = {
...contextStore,
domain_mode: "domain-aware",
current_domain: { name: domain.name, path: domain.path }
}
Skill(skill="ln-634-test-coverage-auditor", args=JSON.stringify(domain_context))
ELSE:
// Fallback: invoke once for entire codebase (global mode)
Skill(skill="ln-634-test-coverage-auditor", args=JSON.stringify(contextStore))
Parallelism strategy:
- •Phase 4a: All 4 global workers run in PARALLEL
- •Phase 4b: All N domain-aware invocations run in PARALLEL
- •Example: 3 domains → 3 ln-374 invocations in single message
Each worker returns structured JSON:
{
"category": "Business Logic Focus",
"score": 7,
"total_issues": 12,
"findings": [
{
"severity": "MEDIUM",
"test_file": "auth.test.ts",
"test_name": "bcrypt hashes password",
"decision": "REMOVE",
"usefulness_score": 3,
"reason": "Tests library behavior, not OUR code",
"effort": "S"
}
]
}
Phase 5: Aggregate Results
Goal: Merge all worker results into unified Test Suite Audit Report with domain grouping
Actions:
- •Collect results from all workers (global + domain-aware)
- •Global workers (ln-371, ln-372, ln-373, ln-375) → merge findings (as before)
- •Domain-aware worker (ln-374) → group by domain.name:
- •Aggregate coverage gaps per domain
- •Build Domain Coverage Summary table
- •Merge findings into decision categories:
- •Tests to REMOVE (Usefulness Score <10)
- •Tests to REVIEW (Usefulness Score 10-14)
- •Tests to KEEP (Usefulness Score ≥15)
- •Missing Tests (Priority/Justification) — grouped by domain if domain_mode="domain-aware"
- •Anti-Patterns Found (counts + examples)
- •Calculate compliance scores (6 categories, each /10)
- •Build decision summary (KEEP/REVIEW/REMOVE counts)
- •Generate Executive Summary (2-3 sentences)
- •Create Linear task in Epic 0 with full report (see Output Format below)
- •Return summary to user
Findings grouping:
- •Categories 1-3, 5-6 (Business Logic, E2E, Value, Isolation, Anti-Patterns) → single tables (global)
- •Category 4 (Coverage Gaps) → subtables per domain (if domain_mode="domain-aware")
Output Format
## Test Suite Audit Report - [DATE] ### Executive Summary [2-3 sentences: test suite health, major issues, key recommendations] ### Test Count by Decision | Decision | Count | % | |----------|-------|---| | KEEP | X | X% | | REVIEW | X | X% | | REMOVE | X | X% | | **Total** | **X** | | ### Compliance Score | Category | Score | Notes | |----------|-------|-------| | Business Logic Focus | X/10 | X framework tests found | | E2E Priority | X/10 | X E2E, X Integration, X Unit | | Risk-Based Value | X/10 | X tests with Priority <10 | | Coverage Gaps | X/10 | X critical paths untested | | Test Isolation | X/10 | X isolation issues | | Anti-Patterns | X/10 | X anti-patterns found | | **Overall** | **X/10** | | ### Domain Coverage Summary (NEW - if domain_mode="domain-aware") | Domain | Critical Paths | Tested | Coverage % | Gaps | |--------|---------------|--------|------------|------| | users | 8 | 6 | 75% | 2 | | orders | 12 | 8 | 67% | 4 | | payments | 6 | 5 | 83% | 1 | | **Total** | **26** | **19** | **73%** | **7** | ### Audit Findings | Severity | Location | Issue | Violated Principle | Recommendation | Effort | |----------|----------|-------|-------------------|----------------|--------| | **CRITICAL** | - | Missing E2E for payment with discount | #3: E2E for critical paths only (Money Priority 25) | Add E2E test: successful payment + discount edge cases | M | | **HIGH** | auth.test.ts:45 | Test "bcrypt hashes password" (Score 3) | #1: Test business logic, not frameworks | Delete — bcrypt already tested by maintainers | S | | **HIGH** | db.test.ts:78 | Test "Prisma findMany returns array" (Score 4) | #1: Test business logic, not frameworks | Delete — Prisma ORM already tested | S | | **HIGH** | - | Missing Unit test for tax calculation edge cases | #5: Every test must justify existence (Impact 5 × Probability 4 = 20) | Add Unit tests for country-specific tax rules | M | | **MEDIUM** | utils.test.ts:23 | Test "validateEmail returns true" (Score 12) | #4: Usefulness over quantity | Review: if E2E login covers → DELETE; else KEEP | S | | **MEDIUM** | order.test.ts:200-350 | Giant test (>100 lines) | Anti-pattern: The Giant | Split into focused tests (one scenario per test) | M | | **MEDIUM** | test.ts:45 | No assertions in test | Anti-pattern: The Liar | Add specific assertions or delete test | S | | **LOW** | auth.test.ts | Only positive login scenarios | Anti-pattern: Happy Path Only | Add negative tests (invalid credentials, expired tokens) | M | ### Coverage Gaps by Domain (if domain_mode="domain-aware") #### Domain: users (src/users/) | Severity | Category | Missing Test | Location | Priority | Effort | |----------|----------|--------------|----------|----------|--------| | CRITICAL | Money | E2E: processRefund() | services/user.ts:120 | 20 | M | | HIGH | Security | Unit: validatePermissions() | middleware/auth.ts:45 | 18 | S | #### Domain: orders (src/orders/) | Severity | Category | Missing Test | Location | Priority | Effort | |----------|----------|--------------|----------|----------|--------| | CRITICAL | Money | E2E: applyDiscount() | services/order.ts:45 | 25 | M | | HIGH | Data | Integration: orderTransaction() | repositories/order.ts:78 | 16 | M |
Worker Architecture
Each worker:
- •Receives
contextStorewith testing best practices - •Receives
testFilesMetadatawith test file list - •Loads full test file contents when analyzing
- •Returns structured JSON with category findings
- •Operates independently (failure in one doesn't block others)
Token Efficiency:
- •Coordinator: metadata only (~1000 tokens)
- •Workers: full test file contents when needed (~5000-10000 tokens each)
- •Context gathered ONCE, shared with all workers
Critical Rules
- •Two-stage delegation: Global workers (4) + Domain-aware worker (ln-374 × N domains)
- •Domain discovery: Auto-detect domains from folder structure; fallback to global mode if <2 domains
- •Parallel execution: All workers (global + domain-aware) run in PARALLEL
- •Domain-grouped output: Coverage Gaps findings grouped by domain (if domain_mode="domain-aware")
- •Delete > Archive: Remove useless tests, don't comment out
- •E2E baseline: Every endpoint needs 2 E2E (positive + negative)
- •Justify each test: If can't explain Priority ≥15, remove it
- •Trust frameworks: Don't test Express/Prisma/bcrypt behavior
- •No performance/load tests: Flag and REMOVE tests measuring throughput/latency/memory (DevOps Epic territory)
- •Code is truth: If test contradicts code behavior, update test
- •Language preservation: Report in project's language (EN/RU)
Definition of Done
- •All test files discovered via Glob
- •Context gathered from testing best practices (MCP Ref/Context7)
- •Domain discovery completed (domain_mode determined)
- •contextStore built with test metadata + domain info
- •Global workers (4) invoked in PARALLEL
- •Domain-aware worker (ln-374) invoked per domain in PARALLEL
- •All workers completed successfully (or reported errors)
- •Results aggregated with domain grouping (if domain_mode="domain-aware")
- •Domain Coverage Summary built (if domain_mode="domain-aware")
- •Compliance scores calculated (6 categories)
- •Keep/Remove/Refactor decisions for each test
- •Missing tests identified with Priority (grouped by domain if applicable)
- •Anti-patterns catalogued
- •Linear task created in Epic 0 with full report
- •Summary returned to user
Related Skills
- •
Workers:
- •ln-631-test-business-logic-auditor — Framework tests detection
- •ln-632-test-e2e-priority-auditor — E2E baseline validation
- •ln-633-test-value-auditor — Usefulness Score calculation
- •ln-634-test-coverage-auditor — Coverage gaps identification
- •ln-635-test-isolation-auditor — Isolation + Anti-Patterns
- •
Reference:
- •../ln-350-story-test-planner — Risk-Based Testing Guide
- •../ln-620-codebase-auditor — Codebase audit coordinator (similar pattern)
Version: 4.0.0 Last Updated: 2025-12-23