9 days ago
San Francisco, CA, USAMid Level / Senior
Base Salary
$130k - $160k/yr
Responsibilities
- Design, deploy, and maintain Kubernetes-based platforms for mission-critical applications.
- Harden infrastructure for resilience and build self-healing systems with minimal downtime.
- Develop observability pipelines to provide insights into cluster health and cost.
- Collaborate with engineering teams to ensure application requirements and SLAs are met.
- Champion best practices in software reliability, security, and scalability.
Requirements
- 3+ years of experience in Infrastructure and/or Site Reliability engineering.
- Bachelor of Science in a related discipline such as Computer Science or Information Technology.
- Experience deploying and maintaining production Kubernetes clusters.
- Familiarity with infrastructure-as-code tools like Terraform or Ansible.
- Proficiency in Python or another scripting language.
- Understanding of cloud spend optimization and cost management best practices.
Benefits
- Compensation package includes equity and robust benefits.
- High-quality company-subsidized healthcare, disability, and life insurance.
- 401(k) retirement planning.
- Flexible PTO.
- Free on-site catered meals.
