Staff+ Software Engineer, Observability
Anthropic
22 days ago
London, United Kingdom
Staff+
H1B Sponsor
Responsibilities
- Design and build scalable telemetry ingest and storage pipelines for metrics, logs, traces, and error data.
- Own and evolve core observability platforms, driving migrations and architectural improvements.
- Build instrumentation libraries, SDKs, and integrations for high-quality telemetry emission.
- Drive alerting and SLO infrastructure for monitoring reliability targets.
- Reduce mean time to detection and resolution through cross-signal correlation and AI-assisted tools.
- Partner with various teams to ensure observability solutions meet their needs.
Requirements
- 10+ years of relevant industry experience in large-scale observability or monitoring infrastructure.
- Deep experience with at least one observability signal area and familiarity with others.
- Understanding of high-throughput data pipelines and columnar storage engines.
- Experience with observability platforms like Prometheus, Grafana, or OpenTelemetry.
- Strong proficiency in at least one of Python, Rust, or Go.
- Excellent communication skills and a collaborative mindset.
Benefits
- Competitive compensation and benefits.
- Optional equity donation matching.
- Generous vacation and parental leave.
- Flexible working hours.
- A collaborative office space.
Tech Stack
ClickHouseGoGrafanaKubernetesPrometheusPythonRust
Categories
AI & MLData EngineeringDevOps