All thoughts and musings
Engineering LeadershipMar 17, 2026 · 6 min read

Owning the Whole Stack: Dev, Platform Ops, Support, and Data

Splitting dev, platform ops, support, and data into separate fiefdoms feels organized. It manufactures the worst failures. Single ownership fixes that.

OwnershipWhole stack

The cleanest org charts produce the ugliest outages. Development reports here, platform operations there, support sits under a different VP, and data is its own kingdom with its own roadmap. On paper it looks tidy. In practice you've drawn four borders straight through the path a single request travels, and every border is a place where accountability quietly evaporates.

I've spent 20 years in leadership watching this play out, and in the work I do as an interim and fractional CTO, the first thing I usually find isn't a technology problem. It's four teams who each believe their part is fine and the failure must belong to someone else.

The seams are where systems break

Real failures almost never live inside one box. They live in the seams between boxes. A deploy ships clean, the platform autoscaler reacts to a traffic pattern nobody flagged, support starts drowning in tickets that look like a product bug, and the data pipeline silently drops a column three hops downstream. Every individual team is technically correct, and the customer is still down.

When those four functions answer to four different leaders, the failure has no owner. It has a meeting. The incident becomes a negotiation about whose budget pays for the fix, and the root cause, the seam itself, never gets touched because no one is accountable for the seam.

  • Development optimizes for shipping features, not for what those features do to the platform at 3am.
  • Platform ops optimizes for stability, which quietly means saying no to the people shipping features.
  • Support absorbs the gap between the two and is measured on closing tickets, not on killing the cause.
  • Data inherits everyone's schema decisions last and has the least power to change them.
When four functions answer to four leaders, a failure in the seams between them has no owner. It has a meeting.

What single ownership buys you

Put development, platform operations, technical support, and data systems under one accountable owner and the physics change. The trade-offs that used to be cross-departmental wars become one person's call. Should we slow a release to harden the pipeline? Is this surge of tickets a symptom of an architecture choice, not a training gap? Those questions stop bouncing between orgs and get answered.

I learned this running technology in complex organizations, as a healthcare EMR CTO where a dropped record is a patient-safety event, and later as a CPTO in online learning where support volume and platform behavior were the same story told twice. The migrations I'm proudest of were sub-one-hour and zero-downtime, and that's not a heroics story. It's what happens when the people writing the code, running the platform, fielding the tickets, and owning the data are pointed at one goal by one person instead of defending four scorecards.

<1hr
Migration windows when ownership is unified, not negotiated
0
Downtime, because dev and ops plan the cutover as one team
4
Functions, one accountable owner across the request path

Coordination is not ownership

The usual objection is that you can fix this with better process, a shared dashboard, a steering committee, a post-incident review template. You can't. Coordination layers describe the seam; they don't own it. A committee can document that the autoscaler and the deploy and the schema change collided, then politely watch it collide again next quarter. Ownership means one person whose name is on all four outcomes and who can move resources across them without asking permission.

Where this is heading

The pressure is only going up. Agentic systems and AI-assisted pipelines blur the lines between code, infrastructure, support, and data even further, a model that misbehaves is simultaneously a dev bug, an ops incident, a support spike, and a data-quality failure, often all within the same minute. Org charts that pretend those are separate problems will keep manufacturing outages that have no owner, just a longer thread of people explaining why it wasn't theirs. The organizations that win the next decade will treat the whole technology surface as one accountable system rather than four adjacent ones. Stop optimizing the boxes. Start owning the seams between them.

Keep reading
Engineering Leadership · Jul 22, 2024

On-Prem to Cloud-Native With Under an Hour of Downtime

AI-Native · May 28, 2026

Automating Your Support Queue Is the Worst Use of AI Your Company Will Make This Year

Metrics & DORA · Nov 14, 2024

Measuring Engineer Productivity Without Breaking Trust