about 4 hours ago
Base Salary
$175k - $205k/yr
Responsibilities
- Design and build autoscaling systems for storage clusters.
- Own the infrastructure-as-code stack using Terraform for AWS deployment.
- Build self-healing automation for health checks and failover.
- Develop CI/CD pipelines and deployment tooling for storage services.
- Implement observability for the storage platform, including metrics and alerting.
- Manage cluster provisioning and upgrades with zero downtime.
- Drive performance and cost optimization across the storage data path.
- Collaborate with product engineering on scalability and load testing.
- Contribute to incident response and lead post-mortems.
Requirements
- Significant experience in building autonomous platform/infrastructure systems.
- Strong software engineering skills in TypeScript/Node.js, Go, or similar languages.
- Deep experience with infrastructure-as-code (Terraform) and AWS services.
- Experience designing autoscaling systems and cluster orchestration.
- Track record of operating data-intensive systems at scale.
- Strong platform engineering fundamentals including SLOs and incident response.
- Comfortable working autonomously in a remote, distributed team.
- Strong understanding of Linux systems and performance profiling.
Benefits
- Generous benefits package including health, dental, and vision insurance.
- Paid holidays and paid time off.
- Fertility treatment benefit.
- 401(k) plan.
- Equity options.