
Site Reliability Engineer
Basata Incabout 5 hours ago
Tempe, AZ, USAMid Level / Senior
Responsibilities
- Own the reliability, availability, and performance of the production platform.
- Define service level objectives (SLOs) and build observability to measure them.
- Establish end-to-end incident response practices, including triage and resolution.
- Design and build next-generation infrastructure and deployment systems.
- Reduce operational toil through automation.
- Collaborate with engineers to improve service operability and resilience.
- Set operational culture and engineering standards for reliability.
Requirements
- Strong software engineering fundamentals with experience in Java, Python, and TypeScript.
- Experience running production systems with containerized services and cloud infrastructure.
- Depth in observability and incident response, including monitoring and alerting.
- Ability to understand and analyze unfamiliar codebases for reliability.
- Experience in designing for reliability at the architecture level.
- Calm judgment during incidents and a focus on continuous improvement.
- Comfort with defining practices from the ground up in a greenfield environment.
Benefits
- Drive real impact in healthcare by improving clinic operations.
- Shape meaningful product decisions and user experiences.
- High ownership and trust to lead and innovate.
- Work with purpose on tools that solve real healthcare problems.