3 days ago
Base Salary
$119k - $170k/yr
Responsibilities
- Own the reliability of a large-scale cloud service by collaborating with engineering and network teams.
- Develop and operate end-to-end observability and incident tooling to manage SLOs and improve system detection.
- Participate in on-call rotations to lead incident response and perform troubleshooting.
- Build and maintain everything-as-code for service lifecycle management.
- Continuously improve platform hygiene through upgrades and performance tuning.
Requirements
- US Citizenship is required due to the nature of assigned customers.
- 5+ years of industry experience in software engineering, infrastructure software, or platform engineering.
- Proficiency in at least one programming language such as Python, Bash, or Go.
- Strong Linux/Unix systems fundamentals and understanding of networking protocols.
- Proven experience operating production services and participating in on-call rotations.
Benefits
- Various health plans.
- Time off plans for vacation and sick time.
- Parental leave options.
- Retirement options.
- Education reimbursement.
- In-office perks, and more.
