GrepJob
Anthropic

Staff Software Engineer, Kubernetes Platform

Anthropic
Apply
about 4 hours ago
London, United KingdomStaff+
H1B Sponsor

Responsibilities

  • Own, operate, and extend the Kubernetes scheduler for accelerator fleets.
  • Scale the Kubernetes control plane to support large clusters.
  • Design, build, and operate core cluster services like service discovery.
  • Build and maintain custom controllers, operators, and CRDs.
  • Partner with research and training teams to understand workload requirements.
  • Collaborate with cloud providers on feature requirements.
  • Participate in on-call duties and lead incident response efforts.

Requirements

  • Significant software engineering experience with production distributed systems.
  • Proficiency in systems-appropriate languages such as Go, Python, Rust, or C++.
  • Deep hands-on experience with Kubernetes, including scheduler and controllers.
  • Ability to debug complex issues across the technology stack.
  • Track record of designing reliable and correct systems.
  • Strong communication skills and ability to build consensus.

Benefits

  • Competitive compensation and benefits.
  • Optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours.
  • Collaborative office environment.

Tech Stack

AWSC++ConsulGoGoogle Cloud PlatformKubernetesLinuxPythonRust

Categories

AI & MLData EngineeringDevOps