Engineering posts about Opentelemetry
Curated summaries and key learnings for engineers working with Opentelemetry.
Governance-Aware Agent Telemetry for Closed-Loop Enforcement in Multi-Agent AI Systems
The article presents Governance-Aware Agent Telemetry (GAAT), a novel architecture designed to enhance the observability and enforcement capabilities of multi-agent AI systems. Traditional...
Building a high-volume metrics pipeline with OpenTelemetry and vmagent
This article outlines a comprehensive approach to migrating a high-volume metrics pipeline from StatsD to OpenTelemetry and Prometheus. It discusses the challenges faced during the migration, such as...
From Custom to Open: Scalable Network Probing and HTTP/3 Readiness with Prometheus
The article outlines Slack's transition to HTTP/3 and the challenges faced due to the lack of client-side observability with existing monitoring tools. It highlights the development of QUIC support...
Autonomous Observability at Pinterest (Part 1 of 2)
The article outlines Pinterest's journey towards enhancing its observability tools by integrating AI-driven solutions and the Model Context Protocol (MCP). It highlights the challenges posed by...