Table of Contents
- SRE Job Role – Activities and Responsibilities
- Fundamental Numbers – Reliability Statistics
- Imperfect Habits – Duct Tape Architecture and Spaghetti Code
- Essential Observability – Metrics, Events, Logs, and Traces (MELT)
- Resolution Path – Master Troubleshooting
- Operational Framework – Managing Infrastructure and Systems
- Data Consumed – Observability Data Science
- Reliable Architecture – Systems Strategy and Design
- Valued Automation – Toil Discovery and Elimination
- Exposing Pipelines – GitOps and Testing Essentials
- Worker Bees – Orchestrations of Serverless, Containers, and Kubernetes
- Final Exam – Tests and Capacity Planning
- First Thing – Runbooks and Low Noise Outage Notifications
- Rapid Response – Outage Management Techniques
- Postmortem Candor – Long-Term Resolution
- Chaos Injector – Advanced Systems Stability
- Interview Advice – Hiring and Being Hired
- Appendix A The Site Reliability Engineer Manifesto
- Appendix B The 12-Factor App Questionnaire

