Anthropic

Staff+ Software Engineer, Observability

Anthropic

Apply
22 days ago
London, United Kingdom
Staff+
H1B Sponsor

Responsibilities

  • Design and build scalable telemetry ingest and storage pipelines for metrics, logs, traces, and error data.
  • Own and evolve core observability platforms, driving migrations and architectural improvements.
  • Build instrumentation libraries, SDKs, and integrations for high-quality telemetry emission.
  • Drive alerting and SLO infrastructure for monitoring reliability targets.
  • Reduce mean time to detection and resolution through cross-signal correlation and AI-assisted tools.
  • Partner with various teams to ensure observability solutions meet their needs.

Requirements

  • 10+ years of relevant industry experience in large-scale observability or monitoring infrastructure.
  • Deep experience with at least one observability signal area and familiarity with others.
  • Understanding of high-throughput data pipelines and columnar storage engines.
  • Experience with observability platforms like Prometheus, Grafana, or OpenTelemetry.
  • Strong proficiency in at least one of Python, Rust, or Go.
  • Excellent communication skills and a collaborative mindset.

Benefits

  • Competitive compensation and benefits.
  • Optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours.
  • A collaborative office space.

Tech Stack

ClickHouseGoGrafanaKubernetesPrometheusPythonRust

Categories

AI & MLData EngineeringDevOps