Promise yourselves to increase Service Reliability in the new year!!!
A recent survey across 400 organizations that practice Site Reliability Engineering report that Application Availability and Uptime are the two most important measures for businesses. Nothing else matters if your application/site is down! They also report that the SRE teams need the following technical skills to effectively manage application availability and uptime.
We, at Appranix, wish that you promise yourselves to bring enhanced application reliability and reduce error rates for your business with better Service Level Indicators (SLI) and automation. Here is a list of some of the curated links on how various organizations use SRE principles to increase systems reliability.
- The Realities of the Job of Delivering Reliability
- Fail at Scale by Ben Maurer
- Embracing Failure: Fault-Injection and Service Reliability
- 10 Years of Crashing Google
- How we break things at Twitter: failure testing
- Reliable Cron across the Planet
- Push our limits — reliability testing at Twitter
- The Verification of a Distributed System by Caitie McCaffrey
- Weathering the Unexpected
- The Remediation Ballet
- SRE Hour: Tech Talks by Box & Yelp
- Simplicity: A Prerequisite for Reliability
Reference:
Based on the SRE market survey by Catchpoint — http://pages.catchpoint.com/2018-SRE-Report-mkty.html"
Find More Blogs at: https://www.appranix.com/resources/blogs/index.html
Contact Appranix
Email: sales@appranix.com
Website: www.appranix.com
Phone: +1 508–656–0756