What makes a key result strong?

Four characteristics: measurable (specific number or testable threshold), outcome-aligned (achieving the KR moves the objective forward), within team influence (the team can affect it through their work), time-bounded (measurement window is the quarter). 3-5 KRs per objective is the sweet spot. Single KR is fragile; 6+ dilute focus. Each KR has a baseline (current value) and target (where the team commits to reach). Use the historical-trajectory test: targets should be meaningfully above where the metric would land without focused effort.

How is okr-design different from roadmap-planning?

OKRs are outcomes (results to be achieved). Roadmap items are outputs (features and initiatives sequenced). Same quarter often has both; they answer different questions. The OKR is 'improve activation for new sign-ups so more reach value-realization in week one.' The roadmap items contributing are 'onboarding redesign, welcome email sequence revision, activation triage automation.' Common conflation: treating OKRs as roadmap commitments. Mid-quarter tactical changes feel like broken promises if stakeholders interpret OKRs as output commitments.

When should OKRs be recalibrated mid-quarter?

The default is OKRs hold for the quarter. Tactics adapt; OKRs do not. Recalibration is the exception, not the rule. Three conditions warrant it: strategic shift (company strategy changed materially), major external disruption (market change, regulatory event, significant outage), or invalidating information (the metric does not actually represent the outcome). NOT triggers for recalibration: the team is not on track (the miss is the signal), the OKR is uncomfortable (stretch is uncomfortable by design), easier OKRs would feel achievable (sandbagging in slow motion).

Should OKRs be tied to compensation?

OKRs work best when not directly tied to compensation. When OKRs determine bonuses, sandbagging incentives become severe. Teams set OKRs they know they can hit. The 60-70% target collapses to 95% target because nobody wants to risk their bonus on stretch. Most healthy OKR cultures separate goal-setting from compensation. Compensation flows through performance reviews informed by many inputs (manager assessment, peer feedback, role expectations); OKR scores may be one input but should not be dominant.

What review cadence works for OKRs?

Four cadences compose. Weekly check-ins (15-30 min): tactical adjustment, surface blockers, not for adjusting OKRs. Mid-quarter review (60-90 min, week 6): honest trajectory assessment, decision point for any rare recalibration. End-of-quarter scoring (60-90 min): score honestly using the 0.0-1.0 scale; document context. Quarterly retrospective: review the practice itself; what worked, what did not, what should change. Skipping cadences is the failure mode; teams that skip weekly check-ins often discover at end of quarter that they could have course-corrected.

Flagship Skill · OKR design

The OKR design skill.

OKRs are accountability infrastructure.

A senior product leader's playbook for OKR design as actually shipped, not as conference-talk theory. Outcome statements that drive decisions, key results that measure the right thing, scoring discipline, mid-quarter recalibration, and the difference between sandbagged OKRs (always 100%), aspirational fantasy (always 30%), and stretch OKRs (genuine ambition with quarterly accountability).

Audience: senior PMs, product directors, engineering leaders, executives setting org-wide OKRs, in-house teams operating in OKR-driven cultures.

View the skill on GitHub Browse the full catalog

What this skill is for

The PM suite, grouped by where work happens.

OKR design sits in strategy and planning: the outcome-target discipline that ladders down from company strategy into team commitments. Distinct from roadmap-planning (which sequences outputs).

Upstream: Discovery & Strategy

discovery-research-synthesis
One-off research synthesis.
jtbd-framing
Jobs-to-be-Done framing technique.
user-feedback-aggregation
Continuous feedback streams.
ux-research
Structured research projects.

Strategy & Planning

okr-design (this skill)
Outcome targets for the quarter.
roadmap-planning
Initiatives sequenced by priority.
pm-spec-writing
Per-piece spec discipline.

Execution

experiment-design
Rigorous A/B testing.
feature-flagging
Rollout mechanics.
beta-program-management
Beta cohorts that produce signal.
feature-launch-playbook
Launch as discipline.

Measurement

product-analytics-setup
Instrumentation discipline.
experimentation-analytics
Reading experiment results.
data-warehouse-experimentation
Warehouse-native experimentation.
experimentation-platform-orchestrator
Platform decision.

The keystone distinction

Three positions. Both extremes are failure modes.

Failure mode

Sandbagged

OKRs designed to hit 100%. Targets at or below current trajectory. No ambition, no learning. Most "OKRs we hit every quarter" sets fall here.

Failure mode

Aspirational fantasy

OKRs nobody can hit. 1000% growth in 90 days. Demoralizing, performative, ignored after week 2.

The discipline

Stretch

Genuine ambition with quarterly accountability. Designed to hit 60-70% on average across KRs. Hits and misses both teach something.

The OKR scorecard

End-of-quarter scoring on the 0.0-1.0 scale.

End of quarter scorecard

Illustrative example. Average score 0.77 (in the stretch zone).

Improve activation for new sign-ups

Obj: 0.72

KR1First-week activation rate from 32% to 45% (41%)0.69
KR2Median time-to-first-value from 4.2 days to under 2 days (2.6 days)0.73
KR3Onboarding flow completion rate to 80% (76%)0.75

Make the support experience deflect predictable issues

Obj: 0.82

KR1Reduce ticket volume on top 3 deflection categories by 40% (31%)0.78
KR2Self-service resolution rate from 22% to 38% (33%)0.69
KR3Maintain CSAT at or above 4.4/5 (4.5/5)1.00

Metrics shown are illustrative.

The framework

Twelve considerations for OKR design.

01Stretch, not sandbag or fantasy
02Objectives are outcomes, not outputs
03Few objectives, well-focused (2-4)
04Key results are measurable
05KRs outcome-aligned
06KRs within team influence
073-5 KRs per objective
08Cascading explicit but not over-constrained
09Scoring honest, not rounded
10Recalibration rare
11Weekly + mid-quarter + end-of-quarter cadence
12OKRs separate from roadmap and metrics

What is in the skill

Twelve sections covered in the body.

01
What this skill is for
Outcomes (results to be achieved). Distinct from roadmap-planning (outputs). Pairs with feature-launch-playbook, product-analytics-setup.
02
Sandbagged vs aspirational-fantasy vs stretch
The keystone framing. 60-70% average score target.
03
Objective design
Outcome-focused, specific to quarter, inspiring without fantasy. Few in number (2-4 per team).
04
Key result design
Measurable, outcome-aligned, within team influence, time-bounded. 3-5 KRs per objective.
05
Cascading OKRs across the org
When to cascade strictly vs loosely. The middle path with explicit ladders.
06
Scoring discipline
0.0-1.0 scale. The 60-70% target. Scoring honesty. The compensation question.
07
Mid-quarter recalibration
When to recalibrate (rarely). When NOT to (uncomfortable, off-track, easier feels achievable).
08
The OKR review cadence
Weekly + mid-quarter + end-of-quarter + quarterly retrospective.
09
OKRs vs roadmap items vs metrics
Three different concepts. OKRs are outcomes; roadmap is outputs; metrics are continuous measurements.
10
Common failure modes
11+ patterns: always 100%, always 30%, output KRs, OKR proliferation, generous scoring, compensation coupling.
11
The framework: 12 considerations
Stretch not sandbag or fantasy, outcome objectives, measurable KRs, cascading explicit, scoring honest, recalibration rare, cadence kept.
12
Closing: OKRs are accountability infrastructure
Quarterly accountability when designed well; ceremony when designed badly.

Reference files

Nine references that go alongside the SKILL.md.

references/objective-design-patterns.md
Outcome-vs-output distinction. Strong vs weak objective characteristics. Worked examples across domains. The few-objectives discipline.
references/key-result-design-patterns.md
Four characteristics. Strong vs weak KRs. The 3-5 KR rule. Baseline and target setting. Leading vs lagging KRs.
references/cascading-okrs-decisions.md
When to cascade strictly vs loosely. The middle path. Cascading anti-patterns. Honest disclosure about cascading difficulty.
references/scoring-discipline.md
0.0-1.0 scale. The 60-70% target. Scoring honesty. What 100% and 30% mean. The compensation question.
references/mid-quarter-recalibration.md
When to recalibrate vs adapt tactics. Strategic shift vs uncomfortable OKRs. The recalibration discipline.
references/review-cadence-templates.md
Weekly check-ins, mid-quarter review, end-of-quarter scoring, quarterly retrospective. Format and time investment per cadence.
references/okrs-vs-roadmap-vs-metrics.md
Three concepts and their relationships. Common conflations. The complete picture across all three.
references/okr-anti-patterns.md
12+ anti-patterns including OKR-as-roadmap, sandbagging, fantasy, vanity metrics, OKR theater, compensation coupling.
references/common-okr-failures.md
15+ failure patterns with diagnoses and cures. The cross-cutting ceremony-vs-accountability pattern.

Browse all reference files on GitHub

Pairs with these platforms

Three platforms with OKR-relevant infrastructure.

The skill is platform-agnostic. These platforms ship workflows that fit OKR programs: Notion (OKR tracking docs and review templates), Mixpanel (KR measurement and tracking), BigQuery (KR data pipelines for harder-to-measure outcomes).

Bridges to other PM-suite skills

Five sister skills that compose with OKR design.

Outputs scope
roadmap-planning
Roadmap is outputs (features and initiatives sequenced); OKRs are outcomes. Same quarter has both; they answer different questions.
Post-ship execution
feature-launch-playbook
Launch discipline; OKRs may include KRs about post-launch outcomes. The two compose for major launches.
Measurement infrastructure
product-analytics-setup
Analytics provide the metrics KRs depend on. OKRs cannot be measured if metrics are not instrumented.
Quantitative validation
experiment-design
Experiments validate that specific initiatives produce OKR outcomes. The two compose for outcome-driven testing.
Cross-team orchestration
integration-orchestrator
Cross-team OKRs require coordination. Integration orchestration handles the dependencies and handoffs.

Direction 7 closes

The third of five PM skills closing Direction 7.

OKR design is the third of five PM skills shipped together in Direction 7 Dispatch B. Together with discovery-research-synthesis, jtbd-framing, beta-program-management, and user-feedback-aggregation, plus the Tier 2 content suite (Dispatch A), Direction 7 closes with 9 new skills total.

The catalog now carries 86 flagships across creative direction, content, design, SEO, project management, marketing, and operations.

Open source under MIT

Read the SKILL.md on GitHub.

The skill source lives in the rampstackco/claude-skills repository. MIT licensed.

View SKILL.md Browse the full catalog

Frequently asked questions.

What does 'sandbagged vs aspirational fantasy vs stretch' mean?: Sandbagged: OKRs designed to hit 100%. Targets at or below current trajectory. No ambition, no learning, no signal. Aspirational fantasy: OKRs nobody can hit. 1000% growth in 90 days. Demoralizing, ignored after week 2. Stretch: genuine ambition with quarterly accountability. Designed to hit 60-70% on average across KRs. Hits and misses both teach something. The litmus test: track average score over multiple quarters. 95%+ signals sandbagging; below 30% signals fantasy; 50-75% range is stretch territory.
What makes a key result strong?: Four characteristics: measurable (specific number or testable threshold), outcome-aligned (achieving the KR moves the objective forward), within team influence (the team can affect it through their work), time-bounded (measurement window is the quarter). 3-5 KRs per objective is the sweet spot. Single KR is fragile; 6+ dilute focus. Each KR has a baseline (current value) and target (where the team commits to reach). Use the historical-trajectory test: targets should be meaningfully above where the metric would land without focused effort.
How is okr-design different from roadmap-planning?: OKRs are outcomes (results to be achieved). Roadmap items are outputs (features and initiatives sequenced). Same quarter often has both; they answer different questions. The OKR is 'improve activation for new sign-ups so more reach value-realization in week one.' The roadmap items contributing are 'onboarding redesign, welcome email sequence revision, activation triage automation.' Common conflation: treating OKRs as roadmap commitments. Mid-quarter tactical changes feel like broken promises if stakeholders interpret OKRs as output commitments.
When should OKRs be recalibrated mid-quarter?: The default is OKRs hold for the quarter. Tactics adapt; OKRs do not. Recalibration is the exception, not the rule. Three conditions warrant it: strategic shift (company strategy changed materially), major external disruption (market change, regulatory event, significant outage), or invalidating information (the metric does not actually represent the outcome). NOT triggers for recalibration: the team is not on track (the miss is the signal), the OKR is uncomfortable (stretch is uncomfortable by design), easier OKRs would feel achievable (sandbagging in slow motion).
Should OKRs be tied to compensation?: OKRs work best when not directly tied to compensation. When OKRs determine bonuses, sandbagging incentives become severe. Teams set OKRs they know they can hit. The 60-70% target collapses to 95% target because nobody wants to risk their bonus on stretch. Most healthy OKR cultures separate goal-setting from compensation. Compensation flows through performance reviews informed by many inputs (manager assessment, peer feedback, role expectations); OKR scores may be one input but should not be dominant.
What review cadence works for OKRs?: Four cadences compose. Weekly check-ins (15-30 min): tactical adjustment, surface blockers, not for adjusting OKRs. Mid-quarter review (60-90 min, week 6): honest trajectory assessment, decision point for any rare recalibration. End-of-quarter scoring (60-90 min): score honestly using the 0.0-1.0 scale; document context. Quarterly retrospective: review the practice itself; what worked, what did not, what should change. Skipping cadences is the failure mode; teams that skip weekly check-ins often discover at end of quarter that they could have course-corrected.

The OKR design skill.

The PM suite, grouped by where work happens.

Three positions. Both extremes are failure modes.

Sandbagged

Aspirational fantasy

Stretch

End-of-quarter scoring on the 0.0-1.0 scale.

Improve activation for new sign-ups

Make the support experience deflect predictable issues

Twelve considerations for OKR design.

Twelve sections covered in the body.

What this skill is for

Sandbagged vs aspirational-fantasy vs stretch

Objective design

Key result design

Cascading OKRs across the org

Scoring discipline

Mid-quarter recalibration

The OKR review cadence

OKRs vs roadmap items vs metrics

Common failure modes

The framework: 12 considerations

Closing: OKRs are accountability infrastructure

Nine references that go alongside the SKILL.md.

Three platforms with OKR-relevant infrastructure.

Notion

Mixpanel

BigQuery

Five sister skills that compose with OKR design.

The third of five PM skills closing Direction 7.

Read the SKILL.md on GitHub.

Frequently asked questions.