
Software Engineer, Infrastructure
Bretton AI3 months ago
San Francisco, CA, USASenior / Staff+
Base Salary
$168k - $213k/yr
Responsibilities
- Own and evolve Kubernetes infrastructure, including cluster management and container security policies.
- Design and implement progressive delivery pipelines with automated rollbacks and deployment health validation.
- Build and maintain observability infrastructure in Datadog, including dashboards and distributed tracing.
- Drive incident response for high-severity outages and model capacity needs for AI inference.
- Architect and automate secure infrastructure using Infrastructure-as-Code for cloud deployments.
- Maintain and improve infrastructure controls for SOC 2 compliance.
- Lead customer engagements for enterprise rollouts and mentor mid-level engineers.
Requirements
- 8+ years in infrastructure engineering or DevOps at high-growth companies.
- Experience with Docker and Kubernetes, including production cluster management.
- Proven track record of architecting and operating AWS, GCP, or Azure at an enterprise scale.
- Experience with observability platforms, preferably Datadog.
- Strong background in Infrastructure-as-Code and safe deployment practices.
- Strong programming skills in Python.
Benefits
- $168k - $213k + equity.
- Comprehensive healthcare and 401k matching.
- 15 days PTO + holidays and unlimited sick days.
- Flexible leave options.
- DoorDash and Uber home coverage for late work.