Tag: observability
How eBPF and OpenTelemetry Have Simplified the Observability Function
Overview arguing that OpenTelemetry eBPF Instrumentation (OBI) — combined with OpenTelemetry Injector — removes barriers to full observability by enabling zero-code, kernel-level telemetry for Kubernetes and Linux environments, solving language, legacy, and ...
AI Is Forcing DevOps Teams to Rethink Observability Data Management
As AI coding tools accelerate software delivery, they are also intensifying a problem DevOps and SRE teams have been dealing with for years: the unchecked growth of observability data. In this conversation, ...
Zero Downtime Multicloud Migrations for Observability Control Planes
Most platform teams aren’t deciding whether they’ll run across multiple clouds. They already are, or they’ll be soon. The real question is how to migrate critical systems without turning on-call into a ...
The Observability Bill is Coming Due – and AI Wrote Most of It
Observability has always had a data quality problem. AI coding agents just made it catastrophically worse. Shimmy sits down with the co-founders of Sawmills to dig into why unmanaged telemetry is the ...
How We Got Here: Alert Fatigue to Decision Fatigue
AI and observability reduced alert fatigue, but decision fatigue remains. Decision architecture helps DevOps teams scale operational judgment ...
On-Call Rotation Best Practices: Reducing Burnout and Improving Response
Practical SRE on‑call guide covering rotation models, alert hygiene, runbooks, metrics, compensation, shadowing, and automation to cut pager load and prevent engineer burnout ...
Unlocking Observability by Design With Inferred Schemas
Observability systems generate massive telemetry, but schema drift creates friction. Learn how inferred schemas and OpenTelemetry Weaver restore structure ...
Context Engineering is the Key to Unlocking AI Agents in DevOps
Explore how context engineering is essential for transforming AI agents from experimental prototypes to reliable production tools in DevOps. Understand its impact on automation workflows, accuracy, and scalability ...
Google ADK Opens the Door to AI Agents That Work Inside Your DevOps Toolchain
Google ADK adds integrations for GitHub, GitLab, Jira, MongoDB, and seven observability tools. AI agents can now work inside your DevOps toolchain ...
OpenTelemetry Is Paving the Way for the Rise of the Observability Warehouse
Eric Tschetter, chief architect at Imply and creator of Apache Druid, explains how the rapid adoption of open source OpenTelemetry for instrumenting applications is reshaping modern observability architectures. As telemetry data volumes ...
When DevOps Meets the Cloud: A Real-World Transformation Story
A real-world DevOps and cloud transformation story showing how automation, observability, and cultural change improved reliability and delivery ...
What to do About AI’s Forced Rethink of Reliability in Modern DevOps
As systems become more distributed and AI-driven, traditional uptime metrics are no longer enough. The 2026 SRE Report shows how reliability is shifting toward user experience, speed, and business impact, and how ...

