On today's episode we chat with Tom Limoncelli, a site reliability engineering manager at Stack Overflow. Tom talks about his time at places like Bell Labs and Google, how he creates runbooks, and the secret to building a healthy relationship between developers and operations.
Episode Notes
You can check out more of Tom's work and some of his books on his website, Everything SysAdmin.
Tom also wrote a great blog post for our site that explains his method for crafting a positive feedback loop between Dev and Ops using real-time documentation.
You can find Tom on Twitter and check out his books on Sys Admin and Cloud System Administration.