6 months ago
Stockholm, Sweden +2 moreSenior / Staff+
H1B Sponsor
Base Salary
$150k - $350k/yr
Responsibilities
- Identify architectural changes to improve reliability, performance, and availability.
- Foster a culture of reliability across Modal’s engineering organization.
- Design and implement key operational processes such as deployments, upgrades, rollbacks, and postmortem review.
- Join a core engineering team and participate in on-call rotation, responding to production incidents.
- Build monitoring systems that ensure the highest quality service for our customers.
- Debug production issues across all services and levels of the stack.
Requirements
- 5+ years of experience writing high-quality production code.
- 2+ years of on-call experience for critical production services.
- Strong cloud skills, and deep familiarity with at least one hyperscaler cloud (AWS preferred).
- Familiarity with auto scaling, fleet management, and capacity planning at scale.
- Experience owning and scaling Kubernetes clusters to thousands of nodes is a plus.
- Experience with systems safety research (e.g. STAMP) and control theory is a plus.
- Ability to work in-person in our NYC, SF, or Stockholm offices.
- Ability to participate in on-call rotation and respond to production incidents.
