So, last Friday night, I decided to turn my infrastructure into code by learning Ansible, and capture the entire demo configuration.
Google Cloud Next '18: What datacentre operators can learn from how Google SRE teams operate
The noughties witnessed many experimental breakthroughs in technology, from the introduction of the iPod to the launch of YouTube. This era also saw a fresh-faced Google, embarking on a quest to expand its portfolio of services beyond search. Much like any highly ambitious, innovative technology initiative, the firm encountered a number of challenges along the way.
In response, Google began evolving a discipline called Site Reliability Engineering (SRE), about which they published a very useful and fascinating book in 2016. SRE and DevOps share a lot of conceptual and an increasing amount of practical DNA; particularly true since cloud software and tooling have now evolved to enable ambitious folks to begin emulating parts of Google’s infrastructure using open source software like Kubernetes.
More like this
Kubernetes’ extraordinary resilience tends to change the emphasis of monitoring from alerting to resource and performance management.
I was sitting on Ansible’s IP bridge this morning, waiting for a call to start, when I realized that wow, Ursula K. LeGuin died, yesterday.