Course Outline
SRE Anti-patterns
- Identifying counterproductive practices
- Recognizing the impact of anti-patterns on reliability
- Best practices and corrective alternatives
SLO as a Proxy for Customer Satisfaction
- Defining Service Level Indicators (SLIs) and Service Level Objectives (SLOs)
- Managing error budgets and balancing innovation with reliability
- Understanding limits of distributed systems
Building Secure and Reliable Systems
- Designing for fault tolerance and resilience
- Integrating security into reliability engineering
- Scalability and data protection strategies
Full-stack Observability
- Instrumentation and metrics collection
- Distributed tracing and synthetic monitoring
- Observability-driven development
Platform Engineering and AIOps
- Platform-centered engineering approaches
- Automation and orchestration in SRE
- Leveraging DataOps and operational intelligence
Incident Management in SRE
- Roles and responsibilities in incident response
- Applying frameworks such as OODA
- Automated remediation and AI/ML-assisted resolution
Chaos Engineering
- Principles and strategies for resilience testing
- Planning and executing “game day” exercises
- Learning from controlled failure experiments
SRE as a Pure Form of DevOps
- Integrating SRE into DevOps workflows
- Cultural alignment and collaboration practices
- Driving organizational transformation through SRE
Post-class Exercises
- Large-scale system design case studies
- Advanced instrumentation and monitoring scenarios
- Real-world reliability problem-solving
Review and Exam Preparation
- Final review of the DevOps Institute SRE Practitioner syllabus
- Sample questions and practice tests
- Exam-taking strategies and recommendations
Summary and Next Steps
Requirements
- Understanding of core Site Reliability Engineering principles
- Experience with DevOps practices and related tools
- Familiarity with system monitoring, incident management, and automation
Audience
- SRE professionals seeking DevOps Institute SRE Practitioner certification
- DevOps engineers aiming to expand into reliability-focused roles
- Operations leaders responsible for reliability strategy and execution
Testimonials (5)
En general todo, el ver y realizar la aplicación directa de los conocimientos trae más experiencia con un entorno real de funcionamiento, la aparición de problemas reales y la aplicación de soluciones con el instructor apoyando con su experiencia fue de gran ayuda para un aprovechamiento más completo de los conocimientos expuestos
Alberto Barragan Espinosa - MIRACLE BUSINESS NETWORK
Course - DevSecOps Practitioner (DSOP)®
High level of commitment and knowledge of the trainer
Jacek - Softsystem
Course - DevOps Engineering Foundation (DOEF)®
The break down of what DevOps can do. Possible Automation Integration.
Adeyinka Adekoya - NTPF
Course - Continuous Testing Foundation (CTF)®
working with DevOps Toolchain
Kesh - Vodacom
Course - DevOps Foundation®
new information