Kanban Service Level Expectations and how to use them in Scrum

(United States)

May 11, 2018

One of the new concepts we introduce in the Kanban Guide for Scrum Teams is the Service Level Expectation, defined as:

An SLE forecasts how long it should take a given item to flow from start to finish within your workflow. The SLE itself has two parts: a period of elapsed days and a probability associated with that period (e.g., "85% of work items will be finished in 8 days or less" which can also be stated as "8 days with 85% confidence/probability").

The term expectation is key here. We're not talking about commitments or promises. While it seems like an SLE is mainly serving the team's stakeholders its primary purpose is actually internally focused. The Development Team is using that expectation as a flow transparency, inspection, and adaptation mechanism. The team can start to actually compare active in-flight work to their SLE and look for items that are at risk of missing the SLE.

As work ages without completing, it becomes more and more likely this work item would not meet the team's SLE. For example, once a work item reaches a point where it's age is now at the point where half of the team's work items have already completed, it's a clear indication that there's more risk for this item than the typical item. We basically learn about the increased risk to specific items the more time passes without them completing. Common sense, no? The idea is that by visualizing these items and that growing risk we can focus the team's tactical inspection and adaptation during the Daily Scrum on tackling these risky items.

Let's look at an example. Let's assume that indeed we have a Development Team that learns from their cycle time scatterplot that 85% of their work items finish in 8 days or less and 50% of the items actually finish within 4 days. They have an item A that has been active for 5 days already and an item B that has been active for 3 days. Which of these items should the team feel more confident they can finish within their 8 days or less SLE? Without further information about where each of the items is in their workflow, they should feel more confident about item B that aged less. Item A has already been active more than it takes for 50% of the cards to finish, so it's quite a strong signal that there's a flow risk to it. Basically, with each day that passes in which a work item doesn't finish, the probability that a work item will not meet its SLE increases”.

Beyond its usefulness for in-Sprint flow focus, SLEs are also useful during Sprint Retrospectives. The "surprise" or "anomaly" of missing your SLE can drive an improvement discussion. I previously wrote about the Sprint Forecast as an expectation and the learning/improvement value of having an expectation that you miss vs having no expectations.

This "no expectations" problem is actually a common concern for Scrum practitioners when they look at Kanban. The Sprint Forecast/Commitment provides that expectation. Kanban without SLEs indeed is missing something. Having WIP limits as expectations improves flow but doesn't help with specific items that aren't flowing well.

How should a Scrum Team come up with their SLE? First of all - The SLE relates to the Development Team. They should figure out what their SLE should be. If possible based on historical cycle times. If there isn't enough data, make a best guess and replace it with empirical data-based information as soon as possible. If it isn't based on historical cycle times, you cannot really expect to make any educated confidence-level determinations (like the one above) based on your SLE.

Also, SLE's aren't SLA (Service Level Agreements). SLA is a loaded term that means different things in different contexts but generally is an external commitment about the service levels a team will provide. As mentioned an SLE is mainly internally focused.

Bottom line - SLEs are used by Scrum Teams to set flow expectations to themselves. SLEs are ideally created based on actual historical cycle time data. They are then used by teams to focus their flow inspection and adaptation effort - during the Sprint in the Daily Scrum and following the sprint in the Sprint Retrospective. They can also be used in the Sprint Review when the team is working with stakeholders that care about the team's cycle time.

Interested to learn more about SLEs and Kanban in a Scrum context? Join a public Professional Scrum with Kanban class or request a private training for your team.

This blog post was originally published on the AgileSparks Blog.

What did you think about this post?

Share with your network

Comments (8)

Erez Morabia

06:49 am May 13, 2018

"with each day that passes in which a work item doesn't finish, the probability that a work item will not meet its SLE increases" - this is based on the assumption that you remove the work item stage (where the work item is located in the workflow) from the equation.
You mentioned if we have work item that its age (4 days our ot SLE of 8 days) is at the point where 50% of the work items were already completed, we have a risk. Do we have a way to calculate this risk in percentage? (suppose the SLE is about 85%)

Alan Larimer

06:06 pm May 13, 2018

Rebranding "agreement" to "expectation" is the same as treating a forecast as a commitment or making a project manager the Product Owner.

yuvalyeret

09:31 pm May 13, 2018

Thanks for the comment Alan. I tried to define what "Expectation" means in our context. In order to change reality we need to both agree on a different definition as well as work to reinforce that different definition. Part of the role of a "Scrum Master" would be to enact the real "Product Owner" role, "Forecast" vs "Commitment" and "SLE" vs "SLA".

yuvalyeret

09:44 pm May 13, 2018

yes, you're right Erez, knowing where the item is in the workflow can help us see if there's real risk. especially if we track the typical cycle time to that point in the workflow and from that point to the finish line.

Regarding the calculation - upon starting the item, without any special knowledge about it, we have a risk of 15% it will miss our SLE since that's what we know about the historical cycle time dataset so far 4 days in, at the point where 50% of the items already finished, we know that this item is in the 50% of the items that wouldn't finish in less than 4 days. In other words its part of the riskiest half of the dataset. items that finish before 4 days have 0% chance to miss the SLE, don't they? so items that don't finish within 4 days have to take on all the risk. I will leave it as an exercise to the reader to figure out the last step of the calculation...

Hope this helps

wilmark johnatty

08:02 pm May 2, 2019

Sounds like SLE needs to be one of your "4 Key Flow Metrics"

Kristine Schindler

02:38 pm October 4, 2019

I'm having trouble understanding if the SLE is a metric for 1 work item or for a set of work items the team has committed to completing in an increment.

Balaji

09:11 am October 16, 2019

Nice Article about SLE, the ageing also can be substantiated with good reasons, in case if the team's learning is a part of work items in work in progress items. team to relook at the reasons and take actions and mitigate Risk

Daniela Gomes

04:18 pm October 19, 2021

Great article about SLE used by Scrum Teams!