Get in Touch

Course Outline

Introduction

  • How SRE bridges traditional IT and software development.
  • The need for automation and observability
  • The roles of software engineers versus system administrators.
  • Site Reliability Engineers versus DevOps engineers.

Overview of an IT System

  • System architecture, including on-premise and cloud environments.

Overview of SRE Principles and Practices

  • Infrastructure as Code.
  • The role of containerization and orchestration (e.g., Docker, Kubernetes).
  • Continuous Integration, Continuous Deployment, and Continuous Delivery.
  • Observability.

Evaluating an IT System

  • Assessing team and organizational resources.
  • Mapping out systems and processes.
  • Estimating the potential impact of SRE.
  • The role of the software engineering team.
  • The role of the operational team.
  • The role of management.

Maintaining System Reliability

  • Describing and measuring desired service reliability.
  • Understanding Service Level Objectives (SLOs).
  • Understanding Service Level Indicators (SLIs) and Service Level Agreements (SLAs).
  • Working with Error Budgets.
  • Developing SLOs.

Optimizing System Administration

  • Setting up a development environment.
  • Evaluating SRE tools.
  • Prioritizing tasks for automation.
  • Writing software.

Deploying "Infrastructure as Code"

  • Testing and iterating code.
  • Creating anti-fragile systems.
  • Learning from failure.

Monitoring a System

  • Observing system performance.
  • SRE tools and techniques.

The Future of SRE

Summary and Conclusion

Requirements

  • A foundational understanding of IT infrastructure.
  • A general awareness of the software development lifecycle.
  • Programming or scripting experience in any language.

Audience

  • Developers
  • System Administrators
  • Software Architects
  • DevOps Engineers
  • IT Managers
 21 Hours

Number of participants


Price per participant

Testimonials (7)

Upcoming Courses

Related Categories