An interview with one of our Technical Interns here at Opsview.
What data center operators can learn from Google SRE teams
The noughties witnessed many experimental breakthroughs in technology, from the introduction of the iPod to the launch of YouTube. This era also saw a fresh-faced Google, embarking on a quest to expand its portfolio of services beyond search. Much like any highly ambitious, innovative technology initiative, the firm encountered a number of challenges along the way.
In response, Google began evolving a discipline called Site Reliability Engineering (SRE), about which they published a very useful and fascinating book in 2016. SRE and DevOps share a lot of conceptual and an increasing amount of practical DNA; particularly true since cloud software and tooling have now evolved to enable ambitious folks to begin emulating parts of Google’s infrastructure using open source software like Kubernetes.
Does you company have too many tools and inconsistent monitoring data? Here are three keys to monitoring IT from a single pane of glass.
We want to learn and keep pushing forward our monitoring capabilities to solve monitoring challenges as they emerge.