Metrics Framework: Leading vs Lagging Indicators

When to use: Setting up metrics, designing dashboards, or when you need early signal before outcomes materialize

Framework source: Aakash Gupta's "Become an A/B Testing Expert" and metrics guidance

Quick Start

Tell me: "Help me build a metrics framework for [product/feature]"
I will check thoughts/shared/pm/frameworks/ for your North Star and thoughts/shared/pm/metrics/ for baseline data
I will ask about your North Star (lagging metric), product stage, and decision speed needs
We build a metric hierarchy: Lagging (quarterly) -> Leading (weekly) -> Input (daily)
For each key metric, we define alert thresholds (green/yellow/red) and a dashboard layout
Output goes to thoughts/shared/pm/metrics/metrics-framework-[date].md

Key principle: Leading metrics should PREDICT lagging metrics. If they do not, you are tracking the wrong thing. We always validate the correlation before committing to a leading indicator.

Context Routing Logic (Internal - for Claude)

Automatic Context Checks: When this skill is invoked, immediately check:

Source	Files/Folders	Search Terms	What to Extract
Strategy Context	`thoughts/shared/pm/frameworks/*.md`	North Star, objective, business goal	Company-wide lagging metric target
Business Model	`thoughts/shared/pm/context/business-info-template.md`	metrics, KPIs, growth model	Current revenue/retention/growth focus
Metrics History	`thoughts/shared/pm/metrics/*.md`	baseline, trends, benchmarks	Historical leading/lagging metric data
PRDs	`thoughts/shared/pm/prds/*.md`	success metrics, measurement	Feature-level metric definitions
Meetings	`thoughts/shared/product/meeting-notes/*.md`	"metrics", "dashboard", "KPI"	Stakeholder metric priorities

Context Priority:

Business strategy and North Star FIRST
Current product metrics and baselines SECOND
Historical trend data THIRD
Feature-specific metrics FOURTH

Cross-Skill Links:

If defining North Star → Link to /define-north-star
If setting up feature success → Link to /feature-metrics for STEDII framework
If tracking product health → Link to /retention-analysis and /activation-analysis
If analyzing test results → Link to /feature-results

Step 0: Understanding Your Metric System

Before designing your metrics framework, let me understand what you're optimizing for...

Checking:

thoughts/shared/pm/context/business-info-template.md for business model and goals
thoughts/shared/pm/frameworks/*.md for strategic priorities
thoughts/shared/pm/metrics/ for existing metric baselines
thoughts/shared/pm/prds/ for current feature success metrics

Based on what I find, I'll show you:

Your Current Metric Health

North Star / Lagging Metric:

[Current North Star metric from strategy]
[Baseline value: X]
[Target: Y by when]

Existing Leading Indicators:

[What you're already tracking]
[Historical effectiveness: does this predict the lagging metric?]

Metric Gaps:

[Metrics you're missing that would help you move faster]
[Decision-making gaps: where you're flying blind]

PM-Specific Diagnosis Questions

Decision Speed: How long do you need to make go/no-go decisions? (Days vs weeks vs months)
Product Stage: Are you focused on growth, retention, monetization, or operational efficiency?
Team Alignment: Do different teams use different success metrics? (Problem!)
Data Quality: Do you have reliable tracking for the metrics you care about?
Stakeholder Expectations: Does your executive team care about your current metrics?

Leading vs Lagging Metrics

Lagging Metrics (Outcome Indicators)

Definition: Metrics that measure final outcomes - what already happened

Characteristics:

Slow to respond
High certainty (accurate)
Ultimate measure of success
Hard to influence directly

Examples:

Revenue
Customer lifetime value (LTV)
Annual retention rate
Net Promoter Score (NPS)
Market share
Profit margins

When to use:

Board reporting
Annual/quarterly goals
Validating long-term bets
Measuring ultimate success

Leading Metrics (Predictive Indicators)

Definition: Metrics that predict future outcomes - early signals

Characteristics:

Fast to respond
Lower certainty (probabilistic)
Actionable in real-time
Directly influenceable

Examples:

Day 7 activation rate (predicts retention)
Weekly active users (predicts revenue)
Feature adoption rate (predicts stickiness)
Time to first value (predicts churn)
Support ticket volume (predicts satisfaction)

When to use:

Day-to-day decisions
Experiment evaluation
Early problem detection
Sprint planning

The Connection

Leading metrics should PREDICT lagging metrics.

If they don't, you're tracking the wrong leading metrics.

Example 1: E-commerce

Lagging: Monthly revenue Leading:

Add-to-cart rate (faster signal)
Checkout start rate
Payment success rate

Logic: If add-to-cart drops today, revenue drops next week.

Example 2: SaaS Product

Lagging: 90-day retention Leading:

Day 7 activation rate
Weekly active usage
Feature adoption in first 14 days

Logic: Users who activate in 7 days have 10x better 90-day retention.

Example 3: Marketplace

Lagging: GMV (Gross Merchandise Value) Leading:

Search-to-view rate
View-to-cart rate
Seller response time
Inventory depth

Logic: More searches + better inventory = more transactions = higher GMV

How to Identify Your Leading Metrics

Step 1: Start with Your North Star (Lagging Metric)

What's the ultimate outcome you care about?

Examples:

Revenue
User retention
Engagement
Customer satisfaction

Step 2: Work Backwards to Find Predictors

Use this prompt pattern:

Use /metrics-framework and reference thoughts/shared/pm/context/business-info-template.md

Our North Star / main outcome metric is: [your lagging metric]

Help me identify 3-5 leading indicators that:
1. Happen BEFORE this outcome
2. Predict this outcome with data
3. Are actionable (we can influence them)
4. Give us signal within 1-2 weeks

Our product: [describe product]
User journey: [describe key actions]
Current metrics: [what you track today]

Step 3: Validate the Correlation

Run analysis to prove the connection:

Cohort analysis:
- Users with high leading metric → retention/revenue outcome
- Users with low leading metric → churn outcome
Time-series correlation:
- Leading metric moves up Week 1
- Lagging metric moves up Week 4-6
- Correlation coefficient > 0.7

If no correlation exists, it's not a leading indicator.

Minimum Data Quality Bar

Before trusting any correlation between leading and lagging metrics:

Minimum thresholds:

Sample size: At least 1,000 users (or 100 accounts for B2B) in each comparison group
Time period: At least 6 weeks of data (2 full product cycles minimum)
Event frequency: Each user should have 5+ events of the measured behavior (avoid drawing conclusions from single actions)
Segment stability: The user segments being compared should be stable (not changing composition) during the measurement period

Data quality red flags:

Correlation based on fewer than 500 users -- mark as "Directional only, not statistically reliable"
Correlation from a single week of data -- mark as "Preliminary, needs confirmation over 4+ weeks"
Correlation that only holds for one segment but not others -- note the segment-specific caveat
Correlation that reverses direction when a confounding variable is controlled for -- do not report as a finding

How to communicate data quality:

High confidence: "Based on 3,200 users over 8 weeks, X predicts Y with r=0.72"
Medium confidence: "Directional signal from 800 users over 4 weeks. Needs more data to confirm."
Low confidence: "Preliminary observation from 200 users. Not yet actionable."

Metric Hierarchy

Build a metric tree from lagging → leading:

LAGGING (Outcome)
    ↑
LEADING (Predictive)
    ↑
INPUT METRICS (Drivers)

Example: Slack

Tier 1 - Lagging (Annual):

Revenue
Logo retention

Tier 2 - Leading (Monthly):

Monthly Active Teams (MAT)
Teams sending 2,000+ messages

Tier 3 - Input Metrics (Weekly):

Weekly Active Users
Messages sent per user
Channels created
Integrations added

Tier 4 - Activation Metrics (Daily):

Day 1 setup completion
First message sent
Team invites sent

How it works:

Day 1 activation → predicts Week 1 usage
Week 1 usage → predicts Month 1 engagement
Month 1 engagement → predicts annual retention
Annual retention → drives revenue

Dashboard Design

Problem: Most dashboards show only lagging metrics

Solution: Split your dashboard into sections:

1. North Star (Top of Dashboard)

The ONE lagging metric that matters most
Monthly/Quarterly trend

2. Leading Indicators (Middle)

3-5 metrics that predict North Star
Weekly trends
Alert thresholds

3. Input Metrics (Bottom)

5-10 levers you can pull
Daily/Weekly monitoring
Team-specific metrics

Example Dashboard: SaaS Product

North Star:

📊 MRR (Monthly Recurring Revenue): $250K (+12% MoM)

Leading Indicators:

⚡ Weekly Active Accounts: 2,500 (+8% WoW)
⚡ Day 7 Activation Rate: 45% (+5pp vs last week)
⚡ Feature Adoption (core feature): 72% (+3pp)

Input Metrics:

Signups this week: 320
Setup completion rate: 75%
Time to first value: 8 minutes (median)
Support tickets: 45 (-10 vs last week)
Invited users per account: 3.2 avg

Real-World Examples

Netflix

Lagging: Subscriber growth, Revenue Leading:

New user activation (% watching 2+ titles in first month)
Weekly streaming hours per member
Content completion rates

Why it works: High engagement predicts low churn predicts revenue growth

Airbnb

Lagging: Nights booked (GMV) Leading:

Search-to-booking conversion rate
Average search results quality
Host response time
New listings added

Why it works: More supply + faster responses = more bookings

Spotify

Lagging: Premium subscriber growth Leading:

Free user engagement (hours listened)
Playlist saves per user
Weekly active users

Why it works: Highly engaged free users convert to paid

Common Patterns

Pattern 1: Activation → Retention → Revenue

Chain:

Input: Signup rate
Leading: Day 7 activation
Leading: Day 30 retention
Lagging: LTV

Application: Focus on activation to drive long-term revenue

Pattern 2: Engagement → Monetization

Chain:

Input: Feature usage
Leading: Weekly active usage
Leading: Feature depth (# features used)
Lagging: Conversion to paid

Application: Drive engagement before asking for money

Pattern 3: Quality → Virality → Growth

Chain:

Input: User satisfaction score
Leading: Invite/referral rate
Leading: Viral coefficient (invites per user)
Lagging: Organic user growth

Application: Build quality product → users refer friends

When Leading Metrics Fail

Red Flag #1: No Correlation with Outcomes

Problem: Your "leading" metric moves, but lagging metric doesn't

Example:

Leading (claimed): Page views
Lagging: Revenue
Reality: Page views don't predict revenue

Fix: Find a different leading metric with proven correlation

Red Flag #2: Too Slow to Be Leading

Problem: Your leading metric takes 6 months to show signal

Example:

Claiming "Q1 signups" is a leading indicator for "Q3 revenue"
That's still lagging (too slow)

Fix: Find faster signals (weekly, not quarterly)

Red Flag #3: Vanity Metrics Disguised as Leading

Problem: Metric looks good but doesn't predict anything

Examples:

Social media followers (doesn't predict revenue)
App downloads (doesn't predict active users)
Website traffic (doesn't predict conversions)

Fix: Validate with cohort analysis

Practical Application

For Product Decisions:

Question: Should we build Feature X?

Bad answer: "It might increase revenue" Good answer: "It will increase [leading metric] by X%, which historically drives [lagging metric] by Y%"

For Experiments:

Question: Should we run an A/B test for 4 weeks?

Bad answer: "We need to measure revenue impact" Good answer: "We can measure activation rate in Week 1, which predicts retention in Month 1"

For Team Goals:

Bad goal: "Increase revenue 20%" Good goal: "Increase activation rate from 45% → 60%, which drives revenue +20%"

Why better: Team can influence activation daily, can't directly influence revenue

Metric Selection Checklist

Use this for any metric you're considering:

Leading Metric Criteria:

Responds within 1-2 weeks (not months)
Predicts lagging outcome (proven with data)
Actionable (team can influence it)
Measurable (reliable tracking)
Understandable (team knows what it means)

Lagging Metric Criteria:

Ultimate outcome you care about
Aligns with business goals
Can't be gamed
Measures real value delivery

Example Workflow: Building Your Metric System

Step 1: Define North Star (Lagging)

"What's the ultimate outcome?" → Answer: Customer lifetime value (LTV)

Step 2: Identify Leading Indicators

"What predicts LTV?" → Analysis shows: 90-day retention predicts LTV → Leading metric: 90-day retention rate

Step 3: Find Faster Signals

"What predicts 90-day retention?" → Analysis shows: Day 7 activation predicts 90-day retention → Leading metric: Day 7 activation rate

Step 4: Identify Input Drivers

"What drives Day 7 activation?" → Analysis shows:

Time to first value
Feature adoption in first session
Invite completion → Input metrics: TTV, first-session features used, invites sent

Step 5: Build Dashboard

North Star: LTV
Leading (slow): 90-day retention
Leading (fast): Day 7 activation
Inputs: TTV, feature adoption, invites

Common Mistakes

❌ Only tracking lagging metrics

Problem: No early warning system
Fix: Add 3-5 leading indicators

❌ Claiming correlation without data

Problem: "Trust me, this predicts that"
Fix: Run cohort analysis to prove it

❌ Too many leading metrics

Problem: Analysis paralysis
Fix: Pick 3-5 that matter most

❌ Optimizing leading metrics that don't matter

Problem: Increase page views, revenue doesn't move
Fix: Validate correlation first

Output Template

Metrics Framework: [Feature/Product Name]

North Star: [Metric] Framework Type: [Leading/Lagging Hierarchy or Input/Output Model]

Metric Hierarchy

Tier	Metric	Current	Target	Cadence	Owner
Lagging (Quarterly)	[e.g., Revenue, LTV, 90-day retention]	[Baseline]	[Goal]	Quarterly review	[Executive/PM]
Leading (Weekly)	[e.g., Day 7 activation, WAU, feature adoption]	[Baseline]	[Goal]	Weekly review	[PM/Growth]
Leading (Weekly)	[e.g., Time to first value, invite rate]	[Baseline]	[Goal]	Weekly review	[PM/Growth]
Input (Daily)	[e.g., Signups, setup completion, first action]	[Baseline]	[Goal]	Daily monitoring	[Team lead]
Input (Daily)	[e.g., Support tickets, error rate]	[Baseline]	[Goal]	Daily monitoring	[Engineering]

Correlation Validation Plan

Hypothesis	Test Method	Data Needed	Timeline
[Leading metric X] predicts [Lagging metric Y]	Cohort analysis: compare high-X vs low-X users on Y outcome	3 months of user-level data	2 weeks to validate
[Input metric A] drives [Leading metric B]	Time-series correlation: does A movement precede B movement?	6 weeks of daily data	1 week to validate

Alert Thresholds

Metric	Green (On Track)	Yellow (Attention)	Red (Intervention)	Action on Red
[Lagging metric]	[Above target]	[Within 1 SD or 10% below target]	[2 SD or 20%+ below target]	[Who gets alerted; what investigation to run; what corrective action]
[Leading metric]	[Above target]	[Within 1 SD or 10% below target]	[2 SD or 20%+ below target]	[Who gets alerted; what investigation to run; what corrective action]
[Input metric]	[Above target]	[Within 1 SD or 10% below target]	[2 SD or 20%+ below target]	[Who gets alerted; what investigation to run; what corrective action]

Dashboard Design

Section 1 - North Star (top of dashboard): [The ONE lagging metric, shown as a large number with monthly/quarterly trend line and target overlay]

Section 2 - Leading Indicators (middle): [3-5 weekly metrics shown as sparkline charts with alert thresholds marked. Each metric includes a comparison to prior week and target.]

Section 3 - Input Metrics (bottom): [5-10 daily levers shown as a table with current value, WoW change, and status indicator (green/yellow/red). Grouped by team ownership.]

Alert Threshold Guidance

For each key metric, define three thresholds:

Green (On Track): Metric is at or above target. No action needed. Keep monitoring at standard cadence.
Yellow (Needs Attention): Metric is trending down or below target by a small margin (typically 1 standard deviation below target, or 10% miss). Action: Increase monitoring cadence (e.g., daily instead of weekly), investigate root cause, prepare a response plan.
Red (Intervention Needed): Metric has dropped significantly (typically 2 standard deviations below target, or missing by more than 20%). Action: Immediate investigation.

For each Red threshold, specify three things:

Who gets alerted: Name the person or team (e.g., "PM + Engineering Lead + VP Product")
What investigation to run: Specific analysis (e.g., "Run cohort breakdown by acquisition channel and device type to isolate the drop")
What action to take: Concrete response (e.g., "If caused by a recent release, initiate rollback decision within 24 hours. If caused by external factor, schedule war room within 48 hours.")

Calibration tip: Start with thresholds based on historical variance. After 4-6 weeks, adjust based on actual alert volume. If you get Yellow alerts every week, your threshold is too sensitive. If you never get Yellow alerts, it is too loose.

Metric Retirement

Leading indicators don't last forever. Retire or replace them when:

Signals it's time to retire a metric:

The correlation to the lagging outcome has weakened for 2+ consecutive quarters
The metric has flatlined (no movement despite product changes) -- it may have reached a natural ceiling
The metric is being gamed (teams optimize for the number rather than the underlying behavior)
The product has evolved and the metric no longer captures the core user behavior

Retirement process:

Flag it: Add a "Under Review" tag to the metric in your dashboard and alert stakeholders
Run a replacement analysis: What metric better predicts the lagging outcome now? Use the correlation validation plan to test candidates.
Overlap period: Run old and new metrics in parallel for 4-6 weeks to validate the replacement
Sunset: Remove old metric from primary dashboard. Archive in a "Retired Metrics" section with retirement rationale and replacement metric.
Document: Add to the learning log -- "Metric X retired because [reason]. Replaced by Metric Y because [evidence]."

Common retirement patterns:

Activation metric retires when onboarding flow changes fundamentally
Feature adoption metric retires when feature reaches saturation (>80% adoption)
NPS retires when replaced by a more predictive satisfaction measure (e.g., task completion rate)

Multi-Product Guidance

For products with multiple value streams (e.g., separate features for creation, analytics, and collaboration), do not try to force everything into one flat metric hierarchy. Instead:

Define a shared North Star that captures overall product value (e.g., "Weekly Active Organizations").
Build a metric hierarchy for each value stream:
- Stream A (Creation): Leading metrics specific to creation (e.g., documents created, time to first publish)
- Stream B (Analytics): Leading metrics specific to analytics (e.g., dashboards viewed, insights exported)
- Stream C (Collaboration): Leading metrics specific to collaboration (e.g., comments posted, shared workspaces)
Show how each stream rolls up to the North Star: Each stream contributes to the shared North Star through a specific mechanism. Make this explicit (e.g., "Users who use both Creation and Collaboration have 2x higher weekly activity than single-stream users").
Each stream needs its own leading indicators, but guardrail metrics can be shared across streams (e.g., NPS, support ticket volume).
Avoid averaging across streams. If one stream is thriving and another is declining, an average masks the problem. Report each stream independently and then show the aggregate.

Output Integration

Where Files Go

Metric framework documentation:

Active work: thoughts/shared/pm/metrics/metrics-framework-[date].md (living document for your team)
Dashboard spec: Share with analytics team for instrumentation

Link to Other Work

After building your framework:

Reference in strategy - "Our metric hierarchy connects [lagging] to [leading] to [input]"
Use in feature PRDs - "This feature's success metric is [X], a leading indicator for [lagging]"
Guide experiments - Teams use leading metrics to evaluate experiments faster
Inform roadmap - Identify which product work drives which leading metrics

Cross-Skill Integration

Feeds into:

/define-north-star - Your lagging metric becomes (or informs) the North Star
/feature-metrics - Feature success metrics are leading indicators in this framework
/retention-analysis - Leading metrics for retention are identified here
/activation-analysis - Activation metrics are leading indicators for engagement

Pulls from:

/write-prod-strategy - Strategy determines what you're optimizing for
thoughts/shared/pm/metrics/ - Historical data validates correlations
/define-north-star - What is your ultimate success metric?

Related Skills

/define-north-star - Choose your North Star (lagging metric)
/activation-analysis - Find leading activation indicators
/feature-metrics - Choose experiment metrics with STEDII
/retention-analysis - Analyze retention predictors

Output Quality Self-Check

Before delivering the metrics framework, verify:

North Star is anchored: The framework starts from a clearly defined North Star or lagging metric. If none exists, recommend running /define-north-star first.
Three tiers defined: The hierarchy includes at least one metric at each tier: Lagging (quarterly), Leading (weekly), and Input (daily). A framework with only lagging metrics is just a scorecard, not a decision tool.
Correlation hypotheses stated: For each leading-to-lagging connection, the hypothesis is explicit and a validation method is described. Do not claim "X predicts Y" without explaining how to prove it.
Alert thresholds set: At least the top 3-5 metrics have green/yellow/red thresholds defined with specific actions for Red. A metric without an alert threshold is decoration, not instrumentation.
Owners assigned: Every metric has a team or person responsible. Unowned metrics do not get watched.
Dashboard design described: The output includes a dashboard layout description showing what goes where and at what cadence. The PM should be able to hand this to an analyst and get a working dashboard.
No vanity metrics: Every metric in the framework has a clear connection to either a lagging outcome or an actionable input. If a metric "looks good but does not drive decisions," remove it.
Multi-stream handled: If the product has multiple value streams, each stream has its own leading indicators with explicit roll-up to the shared North Star. If the product is single-stream, this is not needed.
Metric retirement guidance included: For any existing metrics being reviewed, include signals for when to retire them and a replacement process. New frameworks should note that metrics have a shelf life.
Data quality bar stated: For any correlation claims between leading and lagging metrics, state the sample size, time period, and confidence level. Do not present low-confidence correlations as validated findings.

Framework credit: Adapted from Aakash Gupta's metrics frameworks. Read: https://www.news.aakashg.com/p/become-an-ab-testing-expert-advanced