Flagship Skill · OKR design
The OKR design skill.
OKRs are accountability infrastructure.
A senior product leader's playbook for OKR design as actually shipped, not as conference-talk theory. Outcome statements that drive decisions, key results that measure the right thing, scoring discipline, mid-quarter recalibration, and the difference between sandbagged OKRs (always 100%), aspirational fantasy (always 30%), and stretch OKRs (genuine ambition with quarterly accountability).
Audience: senior PMs, product directors, engineering leaders, executives setting org-wide OKRs, in-house teams operating in OKR-driven cultures.
What this skill is for
The PM suite, grouped by where work happens.
OKR design sits in strategy and planning: the outcome-target discipline that ladders down from company strategy into team commitments. Distinct from roadmap-planning (which sequences outputs).
Upstream: Discovery & Strategy
- discovery-research-synthesis
One-off research synthesis.
- jtbd-framing
Jobs-to-be-Done framing technique.
- user-feedback-aggregation
Continuous feedback streams.
- ux-research
Structured research projects.
Strategy & Planning
- okr-design (this skill)
Outcome targets for the quarter.
- roadmap-planning
Initiatives sequenced by priority.
- pm-spec-writing
Per-piece spec discipline.
Execution
- experiment-design
Rigorous A/B testing.
- feature-flagging
Rollout mechanics.
- beta-program-management
Beta cohorts that produce signal.
- feature-launch-playbook
Launch as discipline.
Measurement
- product-analytics-setup
Instrumentation discipline.
- experimentation-analytics
Reading experiment results.
- data-warehouse-experimentation
Warehouse-native experimentation.
- experimentation-platform-orchestrator
Platform decision.
The keystone distinction
Three positions. Both extremes are failure modes.
Failure mode
Sandbagged
OKRs designed to hit 100%. Targets at or below current trajectory. No ambition, no learning. Most "OKRs we hit every quarter" sets fall here.
Failure mode
Aspirational fantasy
OKRs nobody can hit. 1000% growth in 90 days. Demoralizing, performative, ignored after week 2.
The discipline
Stretch
Genuine ambition with quarterly accountability. Designed to hit 60-70% on average across KRs. Hits and misses both teach something.
The OKR scorecard
End-of-quarter scoring on the 0.0-1.0 scale.
End of quarter scorecard
Illustrative example. Average score 0.77 (in the stretch zone).
Improve activation for new sign-ups
Obj: 0.72- KR1First-week activation rate from 32% to 45% (41%)0.69
- KR2Median time-to-first-value from 4.2 days to under 2 days (2.6 days)0.73
- KR3Onboarding flow completion rate to 80% (76%)0.75
Make the support experience deflect predictable issues
Obj: 0.82- KR1Reduce ticket volume on top 3 deflection categories by 40% (31%)0.78
- KR2Self-service resolution rate from 22% to 38% (33%)0.69
- KR3Maintain CSAT at or above 4.4/5 (4.5/5)1.00
Metrics shown are illustrative.
The framework
Twelve considerations for OKR design.
- 01Stretch, not sandbag or fantasy
- 02Objectives are outcomes, not outputs
- 03Few objectives, well-focused (2-4)
- 04Key results are measurable
- 05KRs outcome-aligned
- 06KRs within team influence
- 073-5 KRs per objective
- 08Cascading explicit but not over-constrained
- 09Scoring honest, not rounded
- 10Recalibration rare
- 11Weekly + mid-quarter + end-of-quarter cadence
- 12OKRs separate from roadmap and metrics
What is in the skill
Twelve sections covered in the body.
01
What this skill is for
Outcomes (results to be achieved). Distinct from roadmap-planning (outputs). Pairs with feature-launch-playbook, product-analytics-setup.
02
Sandbagged vs aspirational-fantasy vs stretch
The keystone framing. 60-70% average score target.
03
Objective design
Outcome-focused, specific to quarter, inspiring without fantasy. Few in number (2-4 per team).
04
Key result design
Measurable, outcome-aligned, within team influence, time-bounded. 3-5 KRs per objective.
05
Cascading OKRs across the org
When to cascade strictly vs loosely. The middle path with explicit ladders.
06
Scoring discipline
0.0-1.0 scale. The 60-70% target. Scoring honesty. The compensation question.
07
Mid-quarter recalibration
When to recalibrate (rarely). When NOT to (uncomfortable, off-track, easier feels achievable).
08
The OKR review cadence
Weekly + mid-quarter + end-of-quarter + quarterly retrospective.
09
OKRs vs roadmap items vs metrics
Three different concepts. OKRs are outcomes; roadmap is outputs; metrics are continuous measurements.
10
Common failure modes
11+ patterns: always 100%, always 30%, output KRs, OKR proliferation, generous scoring, compensation coupling.
11
The framework: 12 considerations
Stretch not sandbag or fantasy, outcome objectives, measurable KRs, cascading explicit, scoring honest, recalibration rare, cadence kept.
12
Closing: OKRs are accountability infrastructure
Quarterly accountability when designed well; ceremony when designed badly.
Reference files
Nine references that go alongside the SKILL.md.
references/objective-design-patterns.md
Outcome-vs-output distinction. Strong vs weak objective characteristics. Worked examples across domains. The few-objectives discipline.
references/key-result-design-patterns.md
Four characteristics. Strong vs weak KRs. The 3-5 KR rule. Baseline and target setting. Leading vs lagging KRs.
references/cascading-okrs-decisions.md
When to cascade strictly vs loosely. The middle path. Cascading anti-patterns. Honest disclosure about cascading difficulty.
references/scoring-discipline.md
0.0-1.0 scale. The 60-70% target. Scoring honesty. What 100% and 30% mean. The compensation question.
references/mid-quarter-recalibration.md
When to recalibrate vs adapt tactics. Strategic shift vs uncomfortable OKRs. The recalibration discipline.
references/review-cadence-templates.md
Weekly check-ins, mid-quarter review, end-of-quarter scoring, quarterly retrospective. Format and time investment per cadence.
references/okrs-vs-roadmap-vs-metrics.md
Three concepts and their relationships. Common conflations. The complete picture across all three.
references/okr-anti-patterns.md
12+ anti-patterns including OKR-as-roadmap, sandbagging, fantasy, vanity metrics, OKR theater, compensation coupling.
references/common-okr-failures.md
15+ failure patterns with diagnoses and cures. The cross-cutting ceremony-vs-accountability pattern.
Pairs with these platforms
Three platforms with OKR-relevant infrastructure.
The skill is platform-agnostic. These platforms ship workflows that fit OKR programs: Notion (OKR tracking docs and review templates), Mixpanel (KR measurement and tracking), BigQuery (KR data pipelines for harder-to-measure outcomes).
Notion-centric teams
Notion
Briefs as a queryable database
Open the pageProduct teams and analysts asking questions of product event data
Mixpanel
Mixpanel's official hosted MCP for product analytics
Open the pageData teams, PMs, and analysts running warehouse-native experimentation and analytics
BigQuery
Google's official managed MCP for BigQuery
Open the page
Bridges to other PM-suite skills
Five sister skills that compose with OKR design.
Outputs scope
roadmap-planningRoadmap is outputs (features and initiatives sequenced); OKRs are outcomes. Same quarter has both; they answer different questions.
Post-ship execution
feature-launch-playbookLaunch discipline; OKRs may include KRs about post-launch outcomes. The two compose for major launches.
Measurement infrastructure
product-analytics-setupAnalytics provide the metrics KRs depend on. OKRs cannot be measured if metrics are not instrumented.
Quantitative validation
experiment-designExperiments validate that specific initiatives produce OKR outcomes. The two compose for outcome-driven testing.
Cross-team orchestration
integration-orchestratorCross-team OKRs require coordination. Integration orchestration handles the dependencies and handoffs.
Direction 7 closes
The third of five PM skills closing Direction 7.
OKR design is the third of five PM skills shipped together in Direction 7 Dispatch B. Together with discovery-research-synthesis, jtbd-framing, beta-program-management, and user-feedback-aggregation, plus the Tier 2 content suite (Dispatch A), Direction 7 closes with 9 new skills total.
The catalog now carries 86 flagships across creative direction, content, design, SEO, project management, marketing, and operations.
Open source under MIT
Read the SKILL.md on GitHub.
The skill source lives in the rampstackco/claude-skills repository. MIT licensed.
Frequently asked questions.
- What does 'sandbagged vs aspirational fantasy vs stretch' mean?
- Sandbagged: OKRs designed to hit 100%. Targets at or below current trajectory. No ambition, no learning, no signal. Aspirational fantasy: OKRs nobody can hit. 1000% growth in 90 days. Demoralizing, ignored after week 2. Stretch: genuine ambition with quarterly accountability. Designed to hit 60-70% on average across KRs. Hits and misses both teach something. The litmus test: track average score over multiple quarters. 95%+ signals sandbagging; below 30% signals fantasy; 50-75% range is stretch territory.
- What makes a key result strong?
- Four characteristics: measurable (specific number or testable threshold), outcome-aligned (achieving the KR moves the objective forward), within team influence (the team can affect it through their work), time-bounded (measurement window is the quarter). 3-5 KRs per objective is the sweet spot. Single KR is fragile; 6+ dilute focus. Each KR has a baseline (current value) and target (where the team commits to reach). Use the historical-trajectory test: targets should be meaningfully above where the metric would land without focused effort.
- How is okr-design different from roadmap-planning?
- OKRs are outcomes (results to be achieved). Roadmap items are outputs (features and initiatives sequenced). Same quarter often has both; they answer different questions. The OKR is 'improve activation for new sign-ups so more reach value-realization in week one.' The roadmap items contributing are 'onboarding redesign, welcome email sequence revision, activation triage automation.' Common conflation: treating OKRs as roadmap commitments. Mid-quarter tactical changes feel like broken promises if stakeholders interpret OKRs as output commitments.
- When should OKRs be recalibrated mid-quarter?
- The default is OKRs hold for the quarter. Tactics adapt; OKRs do not. Recalibration is the exception, not the rule. Three conditions warrant it: strategic shift (company strategy changed materially), major external disruption (market change, regulatory event, significant outage), or invalidating information (the metric does not actually represent the outcome). NOT triggers for recalibration: the team is not on track (the miss is the signal), the OKR is uncomfortable (stretch is uncomfortable by design), easier OKRs would feel achievable (sandbagging in slow motion).
- Should OKRs be tied to compensation?
- OKRs work best when not directly tied to compensation. When OKRs determine bonuses, sandbagging incentives become severe. Teams set OKRs they know they can hit. The 60-70% target collapses to 95% target because nobody wants to risk their bonus on stretch. Most healthy OKR cultures separate goal-setting from compensation. Compensation flows through performance reviews informed by many inputs (manager assessment, peer feedback, role expectations); OKR scores may be one input but should not be dominant.
- What review cadence works for OKRs?
- Four cadences compose. Weekly check-ins (15-30 min): tactical adjustment, surface blockers, not for adjusting OKRs. Mid-quarter review (60-90 min, week 6): honest trajectory assessment, decision point for any rare recalibration. End-of-quarter scoring (60-90 min): score honestly using the 0.0-1.0 scale; document context. Quarterly retrospective: review the practice itself; what worked, what did not, what should change. Skipping cadences is the failure mode; teams that skip weekly check-ins often discover at end of quarter that they could have course-corrected.