Fault tolerance, anomaly detection, and anticipation patterns by Jon Allspaw at QConLondon 2012

Jon Allspaw (@allspaw) from Etsy talked about the role that Anomaly Detection, Fault Tolerance and Anticipation play in producing highly scalable software systems (Fault tolerance, anomaly detection, and anticipation patterns, slides [PDF, 5MB]).

As head of technical operations at Etsy, whose web traffic is pretty substantial, Jon focused on resilience in software systems: what it is, and how to achieve it.

QConLondon 2012 blog posts
See all QConLondon 2012 blog posts…

Continue reading Fault tolerance, anomaly detection, and anticipation patterns by Jon Allspaw at QConLondon 2012

Five Interview Questions for Hiring DevOps Staff

Over the past seven or eight years I have developed a list of five key interview questions for recruiting staff to software development teams. These five questions have come to stand out as being highly indicative of the candidate’s aptitude for approaching software in [what is now called] a “DevOps” manner, namely, seeing software as the running, evolving system in the Production environment.

Continue reading Five Interview Questions for Hiring DevOps Staff

Site Reliabililty at Scale – Discussion Roundup

There have been several useful discussion threads on the LinkedIn Site Reliability at Scale group (http://www.linkedin.com/groups?home=&gid=4200099) recently:

Continue reading Site Reliabililty at Scale – Discussion Roundup

UK Scale Camp 2010 – Braindump

I’ve just returned from UK Scale Camp 2010 (@scalecampuk), organised by The Guardian (and the indefatigable Michael Brunton-Spall, ). Here are some notes:

Overview

I liked the “unconference” format (no formal programme; attendees vote for their favourite sessions in advance), and ended up in four of the many sessions:

  • DevOps on Windows
  • Log Analysis for Search Results
  • DB Changes without Downtime
  • Handling Errors at Scale

You are invited to ScaleCamp 2010

Very pleased to receive this email today:

From: Michael Brunton-Spall
Sent: 19 November 2010 16:07
To: Matthew Skelton
Subject: You are invited to ScaleCamp 2010 – 10th December at the Guardian offices, London

Hey,

We are so pleased to be able to invite you to Scale Camp 2010 on the 10th December at the Guardian Offices here in London.

I’m looking forward to some great conversations and debate, particularly around DevOps and how that can contribute to scaling a software platform.