about 4 hours ago
Base Salary
$230k - $385k/yr
Responsibilities
- Improve systems for validating inference engine releases.
- Enhance release, validation, branching, and deployment processes.
- Develop canary and large-scale validation workflows.
- Strengthen CI, testing, and validation infrastructure.
- Minimize flaky failures from infrastructure instability.
- Automate failure triage, ownership detection, and debugging.
- Collaborate with teams to improve release quality and safety.
- Reduce developer friction in testing and release workflows.
Requirements
- Strong experience with CI/CD systems and testing infrastructure.
- Excitement for high-impact infrastructure affecting production systems.
- Ability to build trusted systems for engineers.
- Strong developer empathy and a focus on improving workflows.
- Demonstrated high ownership and problem-solving skills.
- Comfortable in Python-heavy environments and debugging distributed systems.
- Experience with automation to enhance operational effectiveness.
- Ability to navigate ambiguous, cross-functional operational problems.