about 1 year ago
Base Salary
$205k - $225k/yr
Responsibilities
- Lead incident response and establish sustainable on-call practices.
- Develop and maintain self-service observability solutions using modern monitoring tools.
- Create and maintain infrastructure as code for scalable and secure cloud environments.
- Partner with feature teams to architect resilient infrastructure for critical components.
- Design and implement robust CI/CD pipelines with advanced deployment strategies.
- Advocate for best practices in feature design to ensure reliability.
Requirements
- Expertise in leading incident response for high-availability production systems.
- Experience designing highly available deployment architectures across multiple targets.
- Track record of implementing effective monitoring and observability solutions.
- Strong knowledge of AWS cloud services and infrastructure-as-code practices.
- Experience with CI/CD pipelines and automation for reliable deployments.
- Excellent communication skills and experience documenting processes.
