about 4 hours ago
Responsibilities
- Own the design, build, and operation of Kubernetes cluster management tooling.
- Build developer-facing tooling and workflows for improved Kubernetes interactions.
- Deliver new compute capabilities such as cron scheduling and automated right-sizing.
- Drive operational excellence by automating toil and improving incident response.
- Collaborate with Security, Reliability, and Observability teams to meet performance standards.
Requirements
- 5+ years of software engineering experience, including 3+ years with Kubernetes or similar systems.
- Hands-on experience with AWS and/or GCP infrastructure services in a production environment.
- Ability to design, implement, and operate distributed infrastructure systems.
- Experience with the CNCF ecosystem and applying these tools to infrastructure problems.
- Proven ability to apply AI tooling to improve automation and operational efficiency.