7 months ago
Responsibilities
- Design and scale Kubernetes- and Terraform-based infrastructure across customer environments.
- Define standards for networking, security, CI/CD, and multi-region deployments.
- Build and maintain metrics, logging, tracing, dashboards, and SLOs.
- Diagnose and improve performance across distributed systems and AI workloads.
- Support high-performance inference, data pipelines, and large-scale backend services.
- Ensure systems scale reliably under fast-growing and unpredictable workloads.
- Partner with technical leaders on architecture and mentor engineers.
Requirements
- Have 5+ years building large-scale backend or infrastructure systems.
- Know Kubernetes, Terraform, cloud networking, orchestration, and observability tools.
- Bring strong distributed systems, performance, and incident-response experience.
- Work well with customers and teams to deliver reliable solutions.
- Thrive in high-agency environments and set engineering standards.
- Enjoy ambiguity, autonomy, and solving tough infrastructure problems.
Benefits
- Competitive salary plus equity.
- Daily lunches.
- Commuter benefits.
- 401(k).
- Medical, Dental and Vision.
- Unlimited PTO.
