Skip to main content

audit

Review past Claude Code sessions for PBR workflow compliance and UX quality.

Stars
17
Source
SienkLogic/plan-build-run
Updated
2026-04-03
Slug
SienkLogic--plan-build-run--audit
View on GitHubRaw SKILL.md

// install — copy + paste into any project

mkdir -p .claude/skills && curl -fsSL https://raw.githubusercontent.com/SienkLogic/plan-build-run/HEAD/plugins/pbr/skills/audit/SKILL.md -o .claude/skills/audit.md

Drops the SKILL.md into .claude/skills/audit.md. Works with Claude Code, Cursor, and any agent that loads SKILL.md files from .claude/skills/.

STOP — DO NOT READ THIS FILE. You are already reading it. This prompt was injected into your context by Claude Code's plugin system. Using the Read tool on this SKILL.md file wastes tokens. Begin executing Step 0 immediately.

Step 0 — Immediate Output

Before ANY tool calls, display this banner:

╔══════════════════════════════════════════════════════════════╗
║  PLAN-BUILD-RUN ► SESSION AUDIT                              ║
╚══════════════════════════════════════════════════════════════╝

Then proceed to Step 1.

/pbr:audit — Session Compliance & UX Review

You are running the audit skill. Your job is to analyze past Claude Code session logs for this project, checking PBR workflow compliance (STATE.md updates, hook firing, commit format, skill usage) and user experience quality (flow choice, friction, unmet expectations). You produce a comprehensive report document.

This skill uses parallel Task() delegation to analyze multiple sessions simultaneously, keeping main context lean.


Context Budget

Reference: skills/shared/context-budget.md for the universal orchestrator rules. Reference: skills/shared/agent-type-resolution.md for agent type fallback when spawning Task() subagents.

Additionally for this skill:

  • Delegate ALL session analysis to audit subagents — do NOT read JSONL files in main context
  • Main context handles: argument parsing, session discovery, agent orchestration, report synthesis
  • Target: main context stays under 20% utilization

Step 1 — Parse Arguments

Parse $ARGUMENTS for:

Argument Default Description
--from DATE Start of today Start of audit window (ISO date or natural language)
--to DATE Now End of audit window
--today false Shorthand for --from start of today --to now
--mode MODE full compliance = workflow only, ux = user experience only, full = both
--preset PRESET config.json audit.preset Override audit preset (minimal/standard/comprehensive)
--dimension ID (none) Add specific dimension(s) to the active set. Accepts code (SI-01) or slug (skill-template-refs). Repeatable.
--skip ID (none) Remove specific dimension(s) from the active set. Accepts code or slug. Repeatable.
--only ID (none) Run ONLY these dimensions, ignoring preset/categories. Accepts code or slug. Repeatable.

Natural language parsing: Accept formats like:

  • --today or just today
  • --from 2026-02-21 or --from "yesterday"
  • --from "3 days ago" or --from "last monday"
  • A bare date like 02/21 implies --from 02/21 --to 02/21 (full day)
  • A bare 3 implies last 3 days

If no arguments provided, default to --today --mode full.

Display the parsed time range to the user:

Audit window: {from} → {to}
Mode: {mode}

Step 1b — Resolve Dimensions

Load the project's config.json audit section. Build cliFlags from parsed arguments:

const cliFlags = {
  preset: parsedArgs.preset,       // string or undefined
  dimension: parsedArgs.dimension, // array of ID/slug strings or undefined
  skip: parsedArgs.skip,           // array of ID/slug strings or undefined
  only: parsedArgs.only,           // array of ID/slug strings or undefined
};

Call audit-dimensions.js resolveDimensions(config, cliFlags) to compute the active dimension set.

Display the resolution summary:

Preset: {preset} ({N} base dimensions)
Active dimensions: {N} ({breakdown by category, e.g., "SI: 15, IH: 10, EF: 7, WC: 12"})

If --dimension or --skip was used, also display:

Adjustments: +{added} -{removed}

Store the resolved dimension list for use in Step 4.


Step 2 — Discover Session Logs

Session JSONL files live at:

~/.claude/projects/{encoded-project-path}/*.jsonl

Where {encoded-project-path} encodes the project directory path (e.g., D:\Repos\plan-build-runD--Repos-plan-build-run).

CRITICAL: Determine the correct encoded path for the current project by listing ~/.claude/projects/ and finding the directory that matches.

Use Bash to find sessions in the audit window:

find ~/.claude/projects/{encoded-path}/ -name "*.jsonl" -maxdepth 1 \
  -newermt "{from_datetime}" ! -newermt "{to_datetime}" | sort

CRITICAL — Exclude Current Session: Before proceeding, determine the current session ID:

  1. The current session's JSONL file is the most recently modified .jsonl in the project directory that is actively being written to (i.e., THIS session)
  2. Identify it by checking which session file has been modified within the last 60 seconds: find ~/.claude/projects/{encoded-path}/ -name "*.jsonl" -maxdepth 1 -mmin -1 2>/dev/null
  3. Remove the current session from the discovered sessions list
  4. Display: Excluding current session {id} from analysis (self-referential)
  5. If no sessions remain after exclusion, show the "no sessions found" error

For each session file found, also check for subagent logs:

ls ~/.claude/projects/{encoded-path}/{session-id}/subagents/*.jsonl 2>/dev/null

Display discovery results:

Found {N} sessions in audit window:
  {session-id-1} ({size}, {date})
  {session-id-2} ({size}, {date})
  ...

If no sessions found, display an error and exit:

╔══════════════════════════════════════════════════════════════╗
║  ERROR                                                       ║
╚══════════════════════════════════════════════════════════════╝

No session logs found between {from} and {to}.
Check: ~/.claude/projects/{encoded-path}/

Step 3 — Discover Git Activity

In parallel with session analysis (Step 4), gather git commit data for the audit window:

git log --since="{from_iso}" --until="{to_iso}" --format="%h %s %an %ai" --all

Check for:

  • Conventional commit format violations
  • Forbidden Co-Authored-By lines
  • Release-please automated commits

This data feeds into the final report synthesis.


Step 4 — Spawn Audit Agents

CRITICAL: Spawn one pbr:audit agent per session, ALL in parallel. Do NOT analyze sessions sequentially.

4a. Discover Insights Report

Before spawning agents, check for a recent insights report:

ls -t ~/.claude/insights/*.html 2>/dev/null | head -1

If a file exists and was modified within the last 30 days, store its absolute path as insights_report_path. Otherwise set to 'none'.

4b. Compute Spawn Parameters

Compute paths for the spawn prompt:

  • Plugin root: absolute path to plugins/pbr (e.g., D:/Repos/plan-build-run/plugins/pbr)
  • Planning dir: absolute path to .planning (e.g., D:/Repos/plan-build-run/.planning)
  • Config JSON: JSON.stringify(config) from the loaded config.json — escape backslashes and quotes for template embedding

For each session:

Task({
  subagent_type: "pbr:audit",
  prompt: "<files_to_read>
    CRITICAL: Read these files BEFORE any other action:
    1. {absolute_path_to_session.jsonl} — session log to analyze
    2. {subagent log paths, if any} — subagent session logs
  </files_to_read>
  <audit_assignment>
    Session JSONL: {absolute_path_to_session.jsonl}
    Subagent logs: {list of subagent jsonl paths, or 'none'}
    Audit mode: {mode}
    Active dimensions: {comma-separated list of dimension codes from Step 1b}
    Preset: {preset}
    Plugin root: {absolute path to plugins/pbr}
    Planning dir: {absolute path to .planning}
    Config JSON: {JSON.stringify(config) — escaped for template}

    Run programmatic checks first via audit-checks/index.js runAllChecks(),
    then analyze JSONL for session-dependent dimensions.
    Return per-dimension results table.

    Only check dimensions in the active set. Skip all others.
    Output path: DO NOT write to disk — return findings inline.

    Analyze this session for PBR workflow compliance and/or UX quality
    per your audit checklists. Return your full findings as structured
    markdown in your response.

    Insights report: {insights_report_path}
  </audit_assignment>"
})

Also spawn a git analysis agent:

Task({
  subagent_type: "pbr:general",
  model: "haiku",
  prompt: "Run these git commands in {project_dir}:
    1. git log --since='{from}' --until='{to}' --format='%h|%s|%an|%ai' --all
    2. git log --since='{from}' --until='{to}' --all --format='%B' | grep -i 'co-authored-by' || echo 'None found'
    Report: all commits, any format violations against pattern {type}({scope}): {desc}, any co-author lines."
})

Display progress:

◐ Analyzing {N} sessions in parallel...

Step 5 — Collect and Synthesize

As agents complete, check each audit agent's Task() output for ## AUDIT COMPLETE. If the marker is absent, mark that session as "analysis failed" in the synthesis and skip its findings — do not treat incomplete output as valid analysis. Log: ⚠ Session {id}: audit agent did not return completion marker — skipping.

Wait for all agents before proceeding.

5a. Parse Per-Dimension Results

From each agent's output, extract the per-dimension results tables. Each row has: Code, Dimension, Status, Evidence Summary.

Merge results across sessions:

  • For static dimensions (SI, IH, FV): results are the same across sessions — use the single result
  • For session-dependent dimensions (EF, WC, BC, SQ, QM): use the worst status across sessions (fail > warn > pass)
  • Record which sessions contributed to each dimension's evidence

5b. Build Category Summaries

For each of the 9 categories (AC, SI, IH, EF, WC, BC, SQ, FV, QM):

  • Count pass/warn/fail per category
  • Format: {Category Name} ({Code}): {pass}/{total} pass, {warn} warn, {fail} fail

Compute overall dimension score: {pass}/{total} dimensions passed

5c. Executive Summary

  • Total sessions, total commits, releases
  • Overall dimension score: {pass}/{total} dimensions passed
  • Overall compliance: how many sessions passed/failed
  • Headline finding (the most important issue)

5d. Per-Session Summary Table

Session Duration Commands Compliance UX Rating Dimensions Checked

5e. Cross-Session Patterns

  • Recurring issues (e.g., STATE.md never read across multiple sessions)
  • Hook coverage gaps
  • Common flow mistakes
  • Dimensions that failed across ALL sessions (systemic issues)

5f. Consolidated Findings

Merge and deduplicate findings across sessions. Categorize by severity:

  • CRITICAL: Workflow bypassed despite user requests, hooks not firing
  • HIGH: State files not consulted, missing artifacts
  • MEDIUM: Suboptimal flow choice, missing feedback
  • LOW: Minor ceremony issues, informational

5g. Recommendations

Prioritize as:

  • Immediate: Fix in next session
  • Short-term: Fix in next sprint/milestone
  • Medium-term: Architectural improvements

Step 6 — Write Report

CRITICAL: Write the full report to disk. Do NOT just display it inline.

Write to: .planning/audits/{YYYY-MM-DD}-session-audit.md

CRITICAL: Create the audits directory NOW. Do not skip this step.

Create .planning/audits/ directory if it doesn't exist.

The report should follow this v2 structure:

# PBR Session Audit Report — {date range}

**Audit Period:** {from} – {to}
**Sessions Analyzed:** {N}
**Commits:** {N}
**Mode:** {mode}

---

## Executive Summary
{2-3 sentence overview}
**Overall dimension score:** {pass}/{total} dimensions passed

## Dimension Coverage

**Preset:** {preset}
**Dimensions checked:** {N}/{total enabled}
**Overall:** {pass} pass, {warn} warn, {fail} fail

## Per-Category Results

### Self-Integrity (SI): {pass}/{total} pass, {warn} warn

| Code | Dimension | Status | Evidence Summary |
|------|-----------|--------|------------------|

### Infrastructure Health (IH): {pass}/{total} pass, {warn} warn

(repeat for each active category: AC, SI, IH, EF, WC, BC, SQ, FV, QM)

## Config Cross-Reference

For each dimension that references an `audit.thresholds` key, show:

| Dimension | Threshold Key | Configured Value | Check Result |
|-----------|--------------|-----------------|--------------|
| IH-03 | hook_performance_ms | 500 | pass |

(Only include dimensions with non-null thresholdKey from audit-dimensions.js)

## Session Summary
{per-session table}

## Detailed Session Analysis
{per-session findings}

## Git Activity
{commit summary, format compliance}

## Cross-Session Patterns
{recurring issues}

## Trend Analysis

If QM-03 baseline comparison data is available:
- Dimensions that **regressed** (were pass, now warn/fail)
- Dimensions that **improved** (were warn/fail, now pass)
- Overall trend direction

If no prior audit data: "No prior audit data available for trend comparison."

## Consolidated Findings
### Critical
### High
### Medium
### Low

## Recommendations
### Immediate
### Short-Term
### Medium-Term

---
*Generated by /pbr:audit on {date}*

Step 6b — Spot-Check Artifacts

Before displaying results, verify the report landed on disk:

  1. Glob .planning/audits/{YYYY-MM-DD}-session-audit.md to confirm the file exists
  2. If missing: re-attempt the write (Step 6). If still missing, display an error and include findings inline instead.

Step 6c — Verbosity Control

Apply verbosity filtering to the Per-Category Results section based on the active preset:

Preset Per-Category Results Behavior
minimal Omit Per-Category Results detail entirely. Show only category-level summary lines (e.g., "SI: 14/15 pass, 1 warn").
standard Per-Category Results shows only warn/fail dimensions. Passing dimensions are omitted from the detail tables.
comprehensive Per-Category Results shows ALL dimensions including passing ones.

Implementation: After building the per-category dimension tables in Step 5, filter rows before writing to the report:

  • If preset is minimal: skip writing dimension tables, keep only the ### {Category} ({Code}): {summary} header lines
  • If preset is standard: filter each category table to only include rows where Status is warn or fail
  • If preset is comprehensive: include all rows

The Dimension Coverage section and Executive Summary are always included regardless of verbosity.


Step 7 — Display Summary

After writing the report, display inline (keep it concise — the full report is on disk):

╔══════════════════════════════════════════════════════════════╗
║  PLAN-BUILD-RUN ► AUDIT COMPLETE ✓                           ║
╚══════════════════════════════════════════════════════════════╝

{N} sessions analyzed, {N} commits reviewed

Compliance: {X}/{N} sessions passed
UX Rating:  {average or per-session ratings}

Top findings:
  1. {headline finding 1}
  2. {headline finding 2}
  3. {headline finding 3}

Full report: .planning/audits/{filename}


╔══════════════════════════════════════════════════════════════╗
║  ▶ NEXT UP                                                   ║
╚══════════════════════════════════════════════════════════════╝

{Smart routing based on findings:}
- If critical issues found: **Fix workflow** → `/pbr:quick`
- If todos identified: **Create todos** → `/pbr:add-todo "{description}"`
- Default: **See project status** → `/pbr:progress`

<sub>`/clear` first → fresh context window</sub>

Error Handling

Reference: skills/shared/error-reporting.md for branded error output patterns.

Agent fails to analyze a session

If an audit agent fails:

⚠ Failed to analyze session {id}: {error}
Continuing with remaining {N-1} sessions.

Include a note in the final report that session was skipped.

No sessions found

Display error (Step 2) and exit gracefully.

Very large session files (>5MB)

Warn the agent to sample rather than read the full log:

Note: Session {id} is {size}MB. Sampling key sections (first 200 lines, last 200 lines, user messages, hook events).

Reference: skills/shared/commit-planning-docs.md -- if planning.commit_docs is true, commit the audit report file.


Anti-Patterns

Additionally for this skill:

  1. DO NOT read JSONL files in main context — always delegate to audit agents
  2. DO NOT display the full report inline — write to disk, show summary
  3. DO NOT analyze sessions sequentially — spawn all agents in parallel
  4. DO NOT describe multiple agent spawns in prose (e.g., "launched 5 audit agents") — each session audit agent and the git analysis agent MUST be a separate Task() tool call in a single response message. Each separate call gets its own colored badge and independent ctrl+o expansion in the Claude Code UI. Multiple Task() calls in one message still run concurrently.
  5. DO NOT report findings without evidence (line numbers, timestamps, quotes)
  6. DO NOT judge explore sessions for missing STATE.md updates (explore is read-only)
  7. DO NOT flag release-please/merge commits as format violations
  8. DO NOT fabricate UX ratings — base them on concrete evidence (user repetitions, escalations, course-corrections)
  9. DO NOT exceed 5 headline findings in the inline summary — full details go in the report file