Skip to main content
AI/MLjmagly

corpus-health

Report on research corpus health, completeness, and integrity

Stars
141
Source
jmagly/aiwg
Updated
2026-05-31
Slug
jmagly--aiwg--corpus-health
View on GitHubRaw SKILL.md

// install — copy + paste into any project

mkdir -p .claude/skills && curl -fsSL https://raw.githubusercontent.com/jmagly/aiwg/HEAD/agentic/code/frameworks/sdlc-complete/skills/corpus-health/SKILL.md -o .claude/skills/corpus-health.md

Drops the SKILL.md into .claude/skills/corpus-health.md. Works with Claude Code, Cursor, and any agent that loads SKILL.md files from .claude/skills/.

Corpus Health Command

Assess the health and completeness of the research corpus at .aiwg/research/.

Kernel Delegation

As of ADR-021, corpus-health delegates structural lint to the semantic memory kernel.

Delegation pattern:

  1. corpus-health retains its research-specific health-check UX
  2. Delegates to memory-lint --consumer research-complete --severity warning
  3. Research-specific layers remain in this wrapper:
    • GRADE coverage check
    • Citation completeness validation
    • Research corpus-specific metrics

Backward compatibility: No UX changes.

@agentic/code/addons/semantic-memory/skills/memory-lint/SKILL.md

Instructions

When invoked, analyze the research corpus and report on its health:

  1. Scan Corpus Structure

    • Count sources in .aiwg/research/sources/
    • Count findings in .aiwg/research/findings/
    • Count quality assessments in .aiwg/research/quality-assessments/
    • Count provenance records in .aiwg/research/provenance/records/
  2. Frontmatter Completeness

    • Check each source for required frontmatter per @$AIWG_ROOT/agentic/code/frameworks/sdlc-complete/schemas/research/frontmatter-schema.yaml
    • Report sources missing: ref_id, title, authors, year, DOI, key_findings
    • Calculate completeness percentage
  3. PDF Integrity

    • Check .aiwg/research/pdfs/ for downloaded papers
    • Verify SHA-256 checksums against frontmatter pdf_hash values
    • Report missing PDFs and checksum mismatches
  4. Cross-Reference Integrity

    • Verify all REF-XXX in findings reference valid sources
    • Check for orphaned findings (no source reference)
    • Check for orphaned sources (never cited in any artifact)
  5. Staleness Check

    • Flag sources with last_verified > 90 days old
    • Flag DOIs that haven't been re-verified
    • Report sources needing refresh
  6. Evidence Gaps

    • Load .aiwg/research/TODO.md for planned research
    • Count unresolved evidence gaps
    • Report gap severity distribution
  7. Generate Health Report

    • Summary dashboard with pass/warn/fail indicators
    • Detailed findings per category
    • Action items prioritized by severity

Arguments

  • --brief - Show summary only
  • --fix - Attempt to fix frontmatter gaps and regenerate checksums
  • --report - Save report to .aiwg/reports/corpus-health.md

References

  • @$AIWG_ROOT/agentic/code/frameworks/sdlc-complete/schemas/research/frontmatter-schema.yaml - Frontmatter requirements
  • @$AIWG_ROOT/agentic/code/frameworks/sdlc-complete/schemas/research/fixity-manifest.yaml - Fixity verification
  • @$AIWG_ROOT/agentic/code/frameworks/sdlc-complete/rules/research-metadata.md - Research metadata rules
  • @$AIWG_ROOT/agentic/code/frameworks/sdlc-complete/agents/citation-verifier.md - Citation Verifier agent