Schweitzer Fachinformationen
Wenn es um professionelles Wissen geht, ist Schweitzer Fachinformationen wegweisend. Kunden aus Recht und Beratung sowie Unternehmen, öffentliche Verwaltungen und Bibliotheken erhalten komplette Lösungen zum Beschaffen, Verwalten und Nutzen von digitalen und gedruckten Medien.
Transform enterprise IT by adopting site reliability engineering (SRE) practices that reduce downtime, build resilience, and drive business value. This book is a comprehensive guide designed to help site reliability engineers, DevOps teams, and platform engineers identify, address, and mitigate system weaknesses before they become significant critical failures.
Authors Francesco Sbaraglia and Florian Hoeppner highlight the paradigm shift from IT as a cost center to a core business function, emphasizing the central role of developers and the need for speed and reliability. They detail the challenges of transitioning to SRE, including overcoming cultural resistance and legacy infrastructure limitations, while bringing to the forefront the importance of building resilience in systems and processes. Specific SRE capabilities like chaos engineering, observability, and toil management are explored, along with strategies for successful implementation, including building a Center of Excellence, selecting the right tools, and fostering a culture of collaboration and continuous improvement.
Looking ahead, the book examines emerging trends like Agentic AI SRE Agents, the use of generative AI (GenAI) in SRE and the future evolution of chaos engineering. You'll learn how to embed SRE practices into your existing enterprise tech operating model and unlock tangible business outcomes: reduced downtime, increased resilience, and measurable gains in stability. Additionally, discover how GenAI can support SRE teams in planning, executing, and optimizing reliability experiments and automating toil reduction and continuous improvement efforts.
By the end of this book, you'll know how to apply core SRE practices to strengthen reliability: establishing a chaos engineering practice led by SREs, running reliability-focused "game days," improving observability, troubleshooting failure scenarios, and fortifying the digital resilience of your systems and teams.
What You Will Learn
Who This Book Is For
Professionals, architects, engineers, and practitioners eager to design, plan and implement enterprise system resilience with proven SRE practices.
Francesco Sbaraglia is a distinguished Site Reliability Engineer (SRE) and a recognised expert in the field of Chaos Engineering and DevOps. With an extensive career spanning over two decades, Francesco has garnered a wealth of hands-on experience as a practitioner and innovator, establishing a profound mastery of cutting-edge AIOPS technologies and methodologies.
In addition to his technical prowess, Francesco has distinguished himself as an accomplished author, contributing numerous insightful tech articles and authoritative books across a spectrum of subjects surrounding SRE, Chaos Engineering, operations, and DevOps. Francesco is also an author and public speaker, sharing his insights and best practices in SRE, observability, and chaos engineering at renowned industry conferences, such as SRECon21 and DevOpsCon. He is passionate about combining systems engineering principles with observability tools to ensure seamless operations and improve software engineering practices.
Florian Hoeppner is a seasoned professional technology strategist and advisor for tech operating models. He is an Enterprise Site Reliability Engineer subject-matter -expert and DevOps expert with a deep understanding of tech operating model transformations. Florian is passionate about tech strategy, combined build-run teams, and optimising tech operations, and he has spoken and published extensively on these topics. He created a professional global community in his organisation with more than 500 members, constantly sharing and evaluating the latest around these critical topics. He is also the creator of the EngineeringOps radar, a yearly publication showing tech engineering and operational capabilities. He holds a degree in Media Information Systems and a Master of Science in Digital Media. Florian currently lives in New York and has a blog that offers practical insights into SRE, Chaos Engineering, and DevOps practices and solutions on an enterprise level. He has published the book "Competition as Motivation" with AV Akademikerverlag.
Chapter 1: Introduction to Site Reliability Engineering.
Chapter 2: New Capabilities for the Tech Operating Model.
Chapter 3: SRE in a Legacy Enterprise: Navigating Deep-Seated Challenges and Driving Transformative Change.
Chapter 4: Culture Shift for SRE Adoption.
Chapter 5: Essential Skills for SRE Practitioners.
Chapter 6: SRE Transformation exemplified on two deep dives.
Chapter 7: Scaling SRE: Business Benefits and Value Drivers.
Chapter 8: Tools and Techniques for Scaling SRE.
Chapter 9: Influence of AI and Generative AI in SRE adoption.
Chapter 10: Future and Innovations of Site Reliability Engineerin
Dateiformat: PDFKopierschutz: Wasserzeichen-DRM (Digital Rights Management)
Systemvoraussetzungen:
Das Dateiformat PDF zeigt auf jeder Hardware eine Buchseite stets identisch an. Daher ist eine PDF auch für ein komplexes Layout geeignet, wie es bei Lehr- und Fachbüchern verwendet wird (Bilder, Tabellen, Spalten, Fußnoten). Bei kleinen Displays von E-Readern oder Smartphones sind PDF leider eher nervig, weil zu viel Scrollen notwendig ist. Mit Wasserzeichen-DRM wird hier ein „weicher” Kopierschutz verwendet. Daher ist technisch zwar alles möglich – sogar eine unzulässige Weitergabe. Aber an sichtbaren und unsichtbaren Stellen wird der Käufer des E-Books als Wasserzeichen hinterlegt, sodass im Falle eines Missbrauchs die Spur zurückverfolgt werden kann.
Weitere Informationen finden Sie in unserer E-Book Hilfe.