No detailed description available for "Becoming a Rockstar SRE".
Sprache
Verlagsort
Basel/Berlin/Boston
Großbritannien
Zielgruppe
Editions-Typ
Produkt-Hinweis
Dateigröße
ISBN-13
978-1-80461-456-3 (9781804614563)
Schweitzer Klassifikation
Proffitt Jeremy:
Jeremy Proffitt is passionate about solving problems with an unmatched sense of urgency - the definition of a Site Reliability Engineer. A master of solutions and technology knowledge, Jeremy is a rockstar SRE with AWS Professional Certifications in Architecture and DevOps. He has routinely saved millions in potential lost revenue in his career. In his free time, Jeremy enjoys sending time in his rockstar-appropriate man cave and loves venturing into 3D printing, electronics, and Internet of Things (IoT) projects. Jeremy currently manages a team of top SRE and DevOps talent, driving constant improvement, and is often cited in the company as a visionary of observability and emergency response.Anami Rod:
Rod Anami is a seasoned engineer who works with cloud infrastructure and software engineering technologies. As one of the SREs at the Kyndryl CoE, he coaches other SREs on running IT modernization, transformation, and automation projects for clients worldwide. Rod leads the global SRE guild inside Kyndryl, where he helps plant and grow SRE chapters in many countries. Rod is certified as an SRE, technical specialist, and DevOps engineer professional at the ultimate level. He holds AWS, HashiCorp, Azure, and Kubernetes certifications, among many others. He is passionate about contributing to open source software at large with Node.js libraries.
Table of Contents - SRE Job Role - Activities and Responsibilities
- Fundamental Numbers - Reliability Statistics
- Imperfect Habits - Duct Tape Architecture and Spaghetti Code
- Essential Observability - Metrics, Events, Logs, and Traces (MELT)
- Resolution Path - Master Troubleshooting
- Operational Framework - Managing Infrastructure and Systems
- Data Consumed - Observability Data Science
- Reliable Architecture - Systems Strategy and Design
- Valued Automation - Toil Discovery and Elimination
- Exposing Pipelines - GitOps and Testing Essentials
- Worker Bees - Orchestrations of Serverless, Containers, and Kubernetes
- Final Exam - Tests and Capacity Planning
- First Thing - Runbooks and Low Noise Outage Notifications
- Rapid Response - Outage Management Techniques
- Postmortem Candor - Long-Term Resolution
- Chaos Injector - Advanced Systems Stability
- Interview Advice - Hiring and Being Hired
- Appendix A The Site Reliability Engineer Manifesto
- Appendix B The 12-Factor App Questionnaire