Site Reliability Engineering – 2 Days

(c) CC0 Public Domain /

Course Description:

Monitoring a service you have deployed is a key aspect of providing quality products to your client, but deciding which parameters to monitor is not that easy. Come learn the basic principles of SRE and how you can monitor your products performance in order to meet your goals.

Note: Every student has assigned to him his own virtual lab environment setup.

What will you learn:

By the end of this course you will learn how what makes a good SLI and learn how to better use SLI, SLO and SLA that drives your business decision. You will go on a path of developing archivable SLO for greater system reliability.

Target audience:

  • Product Owners
  • DevOps Engineers
  • Operations Managers
  • Software Developers
  • Incident and Change Managers
  • System Administrators
  • IT Architects

There are no prerequisites for this course.

Nice to have:

  • Basic GIT knowledge
  • Basic Jenkins knowledge
  • Basic Docker/Kubernetes knowledge


Module 1: Introduction

  • The SRE principles
  • Defining SLOs and how they can help you
  • How you make SLOs work for your business

Module 2: Reliability

  • What reliability means and why we need it
  • SLOs vs SLAs
  • Testing and how to measure reliability
  • Setting you targets (100% is not possible :)
  • How to iterate over your SLOs and targets

Module 3: Trade-offs for reliability

  • How many errors can we afford?
  • Choosing the right trade-off
  • Axes of improvement
  • How to increase reliability

Module 4: Choosing a good SLI

  • Properties of good SLI metrics
  • How to measure SLIs
  • Request/Response/Data processing SLIs
  • Managing complex systems — data aggregation, bucketing
  • Achievable SLO
  • Aspirational SLO
  • Improve!

Module 5: Automation and tools

  • Why automation is good and how much time do we want to spend on it
  • Continuous Delivery concepts
  • Basics of container operation with Docker
  • Revision control with Git
  • Continuous Integration with Jenkins
  • Docker and how it can simplify Continuous Integration pipelines
  • Packaging applications with Docker
  • Continuous automated testing
  • Using Kubernetes to run applications at scale


Additional details:

To attend this course, you need to have:

  • PC/Laptop with internet access
  • Updated web browser


Kubernauts Worldwide Meetup

Join our online sessions at Kubernauts Worldwide Meetup and enjoy free trainings and great presentations from the kommunity!

meet us

Learn about Kubernautic

Kubrnautic is a Platform ready to build Cloud Agnostic Kubernetes Platforms!

Learn more +