17 days ago
Base Salary
$170k - $196k/yr
Responsibilities
- Monitor system performance, application health, and infrastructure metrics.
- Provide on-call support for production uptime and customer escalations.
- Release upgrades and perform maintenance activities including hotfixes.
- Lead incident response and resolution efforts, conducting root cause analysis.
- Implement security best practices and controls in cloud environments.
- Drive continuous improvement initiatives to enhance infrastructure reliability.
Requirements
- Bachelor’s degree in computer science, engineering, or related field.
- 5+ years of experience as a Site Reliability Engineer or similar role.
- Hands-on experience in designing and managing AWS and/or Azure infrastructure.
- Proficiency in scripting languages such as PowerShell, Python, or Go.
- Strong understanding of CI/CD principles and experience with relevant tools.
- Experience with containerization technologies like Docker and Kubernetes is a plus.
- Excellent analytical, problem-solving, and communication skills.
