Check nearby libraries
Buy this book
Members of the SRE team explain how their engagement with the entire software lifecycle has enabled Google to build, deploy, monitor, and maintain some of the largest software systems in the world.
Check nearby libraries
Buy this book
Previews available in: English
Subjects
Systems engineering, Reliability (Engineering), Management, Internet industry, Google (Firm), SRE, reliability, engineering, Computer software, testing, Managementgoogle (firm), Google (firm), Systems engineering--management, Reliability (engineering), Internet industry--management, Internet industry--united states--management, Computer engineering, Hd9696.8.u64 g6666 2016Places
United StatesShowing 1 featured edition. View all 1 editions?
Edition | Availability |
---|---|
1
Site Reliability Engineering: How Google Runs Production Systems
2016, O'Reilly Media, Inc.
Paperback
in English
- First edition.
149192912X 9781491929124
|
aaaa
Libraries near you:
WorldCat
|
Book Details
Table of Contents
Introduction. The production environment at Google, from the viewpoint of an SRE
Principles. Embracing risk
Service level objectives
Eliminating toil
Monitoring distributed systems
The evolution of automation at Google
Release engineering
Simplicity
Practices. Practical alerting from time-series data
Being on-call
Effective troubleshooting
Emergency response
Managing incidents
Postmortem culture: learning from failure
Tracking outages
Testing for reliability
Software engineering in SRE
Load balancing at the frontend
Load balancing in the datacenter
Handling overload
Addressing cascading failures
Managing critical state: distributed consensus for reliability
Distributed periodic scheduling with Cron
Data processing pipelines
Date integrity: what you read is what your wrote
Reliable product launches at scale
Management. Accelerating SREs to on-call and beyond
Dealing with interrupts
Embedding an SRE to recover from operational overload
Communication and collaboration in SRE
The evolving SRE engagement model
Conclusions. Lessons learned from other industries.
Edition Notes
Includes bibliographical references (pages 501-512) and index.
Classifications
The Physical Object
ID Numbers
Source records
marc_openlibraries_sanfranciscopubliclibrary MARC recordBetter World Books record
Internet Archive item record
Promise Item
ISBNdb
Links outside Open Library
Community Reviews (0)
Feedback?December 20, 2023 | Edited by ImportBot | import existing book |
August 24, 2020 | Edited by ImportBot | import existing book |
December 14, 2019 | Edited by l9i | Link to the online version |
December 14, 2019 | Edited by l9i | Added new cover |
July 19, 2019 | Created by MARC Bot | import new book |