Dezen Technology
All articles
EngineeringApr 11, 20267 min read

SLOs that change how you ship

SLI · SLO · error budget — the three concepts in plain English, and the operational discipline that turns reliability from a values fight into a budget conversation.

SLOs that change how you ship

Service Level Objectives are one of those topics where the difference between textbook knowledge and operational knowledge is enormous. The textbook says “set a target, measure against it.” The reality is: SLOs are the single tool we’ve found that lets engineering and product teams have a non-emotional conversation about “ship faster” vs “stabilize.”

Done right, an SLO is a number you can defend with data, and an error budget you can spend. Done wrong, it’s a vanity dashboard that nobody acts on.

SLI / SLO / error budget — definitions and how budget consumption changes engineering behavior

The three concepts (in plain English)

SLI — what you measure

A Service Level Indicator is the raw signal. “p99 latency on the checkout endpoint.” “Percentage of API requests that return 5xx.” “Time from page request to interactive.” Concrete, observable, comparable over time.

SLO — the target for that signal

The objective is the line you draw on the SLI graph. “P99 latency < 300ms for 99.5% of the time.” The percentage is a service-quality commitment you’re making to your users.

Error budget — the inverse

If your SLO is 99.9% success, your error budget is 0.1%. In a 30-day window, that’s 43 minutes of downtime (or equivalent failed requests). The budget is what you’re ALLOWED to spend — and the entire point is that you spend it deliberately, not accidentally.

The conversation that SLOs unlock

Without an SLO, the “ship faster” vs “stabilize” conversation is a values argument. Product wants velocity; SRE wants reliability; nobody has a way to be objectively right.

With an SLO, the conversation is mechanical. Budget healthy? Ship faster — take more risks, roll out experiments aggressively. Budget depleted? Freeze risky work, pay down reliability debt. The data drives the decision; the team has cover for either path.

Setting your first SLO

  1. Pick the user journey, not the system.“Can the user complete checkout?” is a journey. “Is the database up?” is a system. SLOs are about user-facing outcomes.
  2. Pick TWO indicators for that journey. Usually a latency SLI and a success-rate SLI. One alone is gameable; two together describe quality properly.
  3. Set the SLO at the current performance level, minus a buffer. If your current p99 latency is 280ms, set 300ms. Set targets you’re currently meeting; tighten over time as you improve.
  4. Review monthly. Burn rate, missed minutes, what consumed the budget. Adjust if reality has changed.

Mistakes that turn SLOs into vanity metrics

  • Setting SLOs you can’t enforce.“99.99% uptime” looks great on a vendor pitch but means nothing if no one stops deploys when the budget is gone.
  • Measuring infrastructure, not user experience. Database CPU is not an SLI; checkout success is.
  • Setting one SLO for the whole product. Different surfaces have different reliability needs. Marketing pages can tolerate worse latency than the checkout endpoint.
  • Not reviewing budget burn weekly.The budget is most useful as a leading indicator. If you’re only looking at it once a quarter, you’re looking too late.

How we approach this

For products under our Ongoing Maintenance engagement, we publish SLO dashboards as part of the operating cadence. The monthly review is grounded in budget burn, not gut feel — which is what makes the product/engineering conversation tractable.

Takeaways

  • SLI is what you measure; SLO is your target; error budget is what you spend.
  • Pick user-journey SLIs, not infrastructure metrics.
  • Start at current performance + a buffer. Tighten over time.
  • Budget healthy = ship faster. Budget depleted = stabilize. The data picks.
Keep reading

More from the engine room

AI in QA: where it helps, where it doesn’t

May 27, 2026

AI in QA: where it helps, where it doesn’t

AI augments QA throughput — test generation, triage, visual regression. It doesn’t replace QA judgment: strategy, exploratory testing, and defining correctness stay human.

Read More
Controlling LLM costs in production

May 25, 2026

Controlling LLM costs in production

Four levers cut spend 10x without cutting quality: route by difficulty, cache, trim context, batch and stream. Measure cost-per-feature first; set budget guardrails always.

Read More
RAG vs fine-tuning: which do you actually need?

May 23, 2026

RAG vs fine-tuning: which do you actually need?

Facts → RAG. Behavior → maybe fine-tune. Most business AI features want RAG even when teams ask for fine-tuning. The decision rule and the order to try things in.

Read More
Agentic features in SaaS: the maturity ladder

May 21, 2026

Agentic features in SaaS: the maturity ladder

From manual to autonomous — four levels of autonomy and the guardrails each needs. Match autonomy to the cost of being wrong, not to how impressive it sounds.

Read More
Offline-first mobile: the app that works on the subway

May 19, 2026

Offline-first mobile: the app that works on the subway

The UI never waits on the network. Local DB, sync engine, server — with conflict resolution per data type. The architecture that makes mobile apps feel instant.

Read More
Lift-and-shift vs refactor: how to actually decide

May 17, 2026

Lift-and-shift vs refactor: how to actually decide

Lift-and-shift is fast, cheap to do, expensive to keep. Refactor is months of work with structural upside. The matrix — and why half-finished refactors are the worst path.

Read More
Monolith migration: the strangler-fig playbook

May 15, 2026

Monolith migration: the strangler-fig playbook

The big-bang rewrite is the most consistently bad idea in software. Proxy in front, extract one route at a time, shrink the monolith to nothing. No migration day.

Read More
SOC 2 readiness in plain English

May 13, 2026

SOC 2 readiness in plain English

Five Trust Service Criteria, Security mandatory and the rest optional. Type 1 vs Type 2. The pragmatic 6-month timeline — not the year-long ordeal it’s made out to be.

Read More

Let’s Build the Future Together!

Contact our team today and turn your ideas into reality.

Let’s Discuss
Contact Details : sales@dezentech.com Sy. No:40, Flat No:402, SIRISAMPADHA ARCADE I, Plot no:18-21, behind Union Bank of India, Khajaguda, Hyderabad, Telangana 500104