Platform Engineering

Book description

Until recently, infrastructure was the backbone of organizations operating software they developed in-house. But now that cloud vendors run the computers, companies can finally bring the benefits of agile custom-centricity to their own developers. Adding product management to infrastructure organizations is now all the rage.

But how's that possible when infrastructure is still the operational layer of the company?

This practical book guides engineers, managers, product managers, and leaders through the shifts that modern platform-led organizations require. You'll learn what platform engineering is—and isn't—and what benefits and value it brings to developers and teams. You'll understand what it means to approach a platform as a product and learn some of the most common technical and managerial barriers to success.

With this book, you'll:

  • Cultivate a platform-as-product, developer-centric mindset
  • Learn what platform engineering teams are and are not
  • Start the process of adopting platform engineering within your organization
  • Discover what it takes to become a product manager for a platform team
  • Understand the challenges that emerge when you scale platforms
  • Automate processes and self-service infrastructure to speed development and improve developer experience
  • Build out, hire, manage, and advocate for a platform team

Publisher resources

View/Submit Errata

Table of contents

  1. Foreword
  2. Preface
    1. A Note from Camille
    2. Who This Book Is For
    3. How to Read This Book
    4. O’Reilly Online Learning
    5. How to Contact Us
    6. Acknowledgments
      1. From Camille
      2. From Ian
      3. From Both of Us
  3. I. The What and Why of Platform Engineering
  4. 1. Why Platform Engineering Is Becoming Essential
    1. Defining “Platform” and Other Important Terms
    2. The Over-General Swamp
    3. How We Got Stuck in the Over-General Swamp
      1. Change #1: Explosion of Choice
      2. Change #2: Higher Operational Needs
      3. Result: Drowning in the Swamp
    4. How Platform Engineering Clears the Swamp
      1. Limiting Primitives While Minimizing Overhead
      2. Reducing Per-Application Glue
      3. Centralizing the Cost of Migrations
      4. Allowing Application Developers to Operate What They Develop
    5. Empowering Teams to Focus on Building Platforms
    6. Wrapping Up
  5. 2. The Pillars of Platform Engineering
    1. Taking a Curated Product Approach
    2. Developing Software-Based Abstractions
      1. The Major Abstractions: Platform Service and Its APIs
      2. Thick Clients
      3. OSS Customizations
      4. Integrating Metadata Registries
    3. Serving a Broad Base of Application Developers
    4. Operating as Foundations
      1. Responsibility for the Full Platform
      2. Supporting the Platform
      3. Operational Discipline
    5. Wrapping Up
  6. II. Platform Engineering Practices
  7. 3. How and When to Get Started
    1. Fostering Platform Cooperation at Small Scale
    2. Creating the Platform Teams That Replace Cooperation
      1. Are the Benefits of Centralizing Ownership Worth the Costs?
      2. Realize the Collective Dynamic Is Gone
      3. Focus on Solving Problems, Not New Technology or Architecture
      4. Beware of New Engineers Coming from Much Bigger Companies
      5. Be Slow to Hire Product Managers (and Avoid Project Managers)
      6. Bonus Problems for Integration/Shared Services Platforms
    3. Transforming a Traditional Infrastructure Organization
      1. Your Whole Engineering Culture Has to Change
      2. Identify the Most Promising Areas to Start
      3. Recognize That You Can’t Just Rub Product Managers on It and Call It a Day
      4. Change the Way You Support Your Products
      5. Update Your Interview Process
      6. Update Your Systems of Recognition and Reward
      7. Don’t Have Too Many Project Managers
      8. Accept That Your Team Will Spend More Time Talking to Customers and Less Time Writing Code
      9. Do the Necessary Restructuring
      10. Keep It Fun!
    4. Wrapping Up
  8. 4. Building Great Platform Teams
    1. The Risks of Single-Focus Platform Teams
      1. Too Much Systems Focus
      2. Too Much Development Focus
    2. The Different Roles of Platform Engineers
      1. Software Engineers
      2. Systems Engineers
      3. Reliability Engineers
      4. Systems Specialists
    3. Hiring and Recognizing Engineers in All Roles
      1. Allow Role-Specific Titles
      2. Avoid Creating a New Software Engineer Level Matrix
      3. Have, at Most, One Level Matrix for the Systems Roles
      4. If Needed, Create a New Software Engineer Interview Process
      5. Vary the Interview Only Slightly for Systems Roles
      6. Interview for Customer Empathy
    4. What Makes a Great Platform Engineering Manager?
      1. Experience Operating Platforms
      2. Experience on Big, Long-Running Projects
      3. Attention to Detail
    5. Other Roles on a Platform Team
      1. Product Managers
      2. Product Owners
      3. Project Managers/Technical Program Managers
      4. Developer Advocates, Technical Writers, and Support Engineers
    6. Creating a Platform Engineering Team Culture
      1. A Platform Split Between a Development and an SRE Team
      2. Strengths and Weaknesses of the Development Team
      3. Merging the Teams and Adding Product Management
      4. Instilling a Platform Engineering Culture
    7. Wrapping Up
  9. 5. Platform as a Product
    1. Product Culture Focuses on the Customer
      1. Characteristics of Internal Customers
      2. Collaborating with Internal Customers
      3. Empathizing with Customers
      4. Escaping the Feature Shop Trap to Serve Customers More Broadly
    2. Product Discovery and Market Analysis
      1. Identifying Potential Platform Products
      2. Evolving Existing Offerings: Smoothing the Edges or Rethinking the Problem
      3. Market Research: Validating New Investments
      4. Product Metrics
    3. Successful Product Execution: Creating a Product Roadmap
      1. Vision: Long Term
      2. Strategy: Middle Term
      3. Goals and Metrics: This Year
      4. Milestones: Quarterly
      5. The Customer-Facing Roadmap
      6. Specification of Features
      7. Practice Makes Perfect
    4. Product Failure Modes
      1. Underestimating the Migration Cost
      2. Overestimating the Change Budget for Users
      3. Overestimating the Value of New Features When Stability Is Poor
      4. Having Too Many Product Managers for the Size of the Engineering Team
      5. Having Product Managers Doing the Work That Engineering Managers Should Be Doing
    5. Wrapping Up
  10. 6. Operating Platforms
    1. On-Call Practices
      1. Why 24x7 On-Call Coverage Matters
      2. Why Merged DevOps?
      3. Getting to a Sustainable On-Call Load
    2. Support Practices
      1. Why Platform Engineers Should Do Support Work
      2. Stage 1: Formalize Support Levels
      3. Stage 2: Separate Noncritical Support from On-Call
      4. Stage 3: Hire a Support Specialist
      5. Stage 4: At Scale with an Engineering Support Organization
    3. Operational Feedback Practices
      1. SLOs and SLAs Are Necessary; Error Budgets Are Optional
      2. Change Management
      3. Synthetic Monitoring
      4. Operational Reviews
    4. Wrapping Up
  11. 7. Planning and Delivery
    1. Planning Long-Running Projects
      1. Clarifying Goals and Requirements in a Proposal Document
      2. Going from Proposal to Action Plan
      3. Avoiding the Long Slog
    2. Bottom-Up Roadmap Planning
      1. “Keep the Lights On” Work
      2. Mandates
      3. System Improvements
      4. Bringing It All Together
    3. Communicating Status with Biweekly Wins and Challenges
      1. The Basics
      2. Why: What’s the Value?
      3. What: Structuring Wins and Challenges Updates
      4. Don’t Forget the Challenges!
      5. Getting Your Team to Write Wins and Challenges
    4. Wrapping Up
  12. 8. Rearchitecting Platforms
    1. Why Rearchitecting Is Preferred to Building a v2
      1. Different Engineering Mindsets
      2. Architectural Needs Drive Mindset Demands
      3. Why It Is Hard to Build v2 Platforms, but Possible to Rearchitect
    2. Addressing Security with Architecture
    3. Guardrails for Rearchitectures
      1. Compatibility
      2. Testing
      3. Lower Environments
      4. Tranches, Slow Rollouts, and Staying a Version Behind
    4. Planning for Rearchitectures
      1. Step 1: Think Big on Final Rearchitecture Goals
      2. Step 2: Factor in Migration Costs
      3. Step 3: Determine Major 12-Month Wins
      4. Step 4: Get Leadership Buy-in, and Be Prepared to Wait
    5. Wrapping Up
  13. 9. Migrations and Sunsetting of Platforms
    1. Migration Antipatterns
    2. Engineering Easier Migrations
      1. Use Product Abstractions That Minimize Glue and Limit Variation
      2. Architect for Transparent Migrations
      3. Track Usage Metadata
      4. Develop Automation to Avoid Clipboards
      5. Document On-Ramps and Off-Ramps
    3. Coordinating Smoother Migrations
      1. Scope, Limit, and Prioritize Planned Changes
      2. Communicate Early and Publicly
      3. Push Through the Final 20%
      4. Use Mandates Sparingly
    4. Sunsetting Platforms
      1. Deciding When to Sunset
      2. Coordinating the Sunsetting
      3. Don’t Be Afraid to Sunset When It Makes Sense
    5. Wrapping Up
  14. 10. Managing Stakeholder Relationships
    1. Stakeholder Mapping: The Power-Interest Grid
    2. Communicating with the Right Transparency
      1. Beware of Oversharing Detail
      2. Use Regular 1:1s Judiciously
      3. Track Expectations and Commitments
      4. Scale Up with Interlock Meetings and Customer Advisory Boards
      5. Increase Communication During Rough Patches
    3. Finding Acceptable Compromises
      1. Be Clear About the Business Impact
      2. Sometimes Say “Yes, with Compromises”
      3. Saying “No” Without Ruining the Relationship
      4. Compromising on Shadow Platforms
    4. Money Troubles: Cost and Budget Management
      1. Step 1: Figure Out Who Will Benefit Tomorrow
      2. Step 2: Group the Work into Teams (Don’t Go Person-by-Person)
      3. Step 3: Come with Suggestions of What to Cut and Strong Opinions About What to Keep
    5. Wrapping Up
  15. III. What Does Success Look Like?
  16. 11. Your Platforms Are Aligned
    1. Alignment to Purpose
      1. Align Teams to Purpose with the Right Mix of People
      2. Align Culture to Purpose with Common Practices
      3. Align Culture to Purpose by Having Teams Collaborate
    2. Alignment of Product Strategy
      1. Foster Cross-Platform Thinking with Independent Product Management
      2. Foster Cross-Platform Architecture with Independent Lead ICs
      3. Seek Feedback from Comments in Platform-wide Customer Surveys
      4. Judiciously Resolve Misalignment with Restructuring
    3. Alignment of Plans
      1. Align Only on Larger Projects, Not on Every Detail
      2. Be Forthright in Confronting Misalignment
      3. Final Alignment Comes from Principled Leadership
    4. Tying It Together: Getting an Organization to Alignment
    5. Wrapping Up
  17. 12. Your Platforms Are Trusted
    1. Trust in How You Operate
      1. Accelerate Trust by Empowering Experienced Leaders
      2. Optimize Growth in Trust by Ordering Use Cases
    2. Trust in Your Big Investments
      1. Seek Technical Stakeholder Buy-in for Trust of Rearchitectures
      2. Seek Executive Sponsorship for Trust of New Products
      3. Maintain Old Systems to Retain Trust
      4. Gaining Trust Requires Flexibility on What Is “Right”
    3. Trust to Prioritize Delivery
      1. Create a Culture of Velocity
      2. Prioritize Projects to Free Up Team Capacity
      3. Challenge Assumptions About Product Scope
    4. Tying It Together: The Case of the Overcoupled Platform
    5. Wrapping Up
  18. 13. Your Platforms Manage Complexity
    1. Managing the Accidental Complexity of Human Coordination
    2. Managing the Complexity of Shadow Platforms
    3. Managing Complexity by Controlling Growth
    4. Managing Complexity Through Product Discovery
    5. Tying It Together: Balancing Internal and External Complexity
      1. Burning Out on OSS Operations
      2. Trying (and Failing) to Change the Game
      3. Shadow Platforms Force a Reset
      4. Executing on the Reset
    6. Wrapping Up
  19. 14. Your Platforms Are Loved
    1. Love Just Works
    2. Love Can Look Like a Hack
    3. Love Can Be Obvious
    4. Tying It Together: Love Makes Your Users Awesome
    5. Wrapping Up: What Is Love? Baby Don’t Hurt Me
  20. Concluding Remarks
  21. Index
  22. About the Authors

Product information

  • Title: Platform Engineering
  • Author(s): Camille Fournier, Ian Nowland
  • Release date: October 2024
  • Publisher(s): O'Reilly Media, Inc.
  • ISBN: 9781098153649