1 day ago
Base Salary
$170k - $196k/yr
Responsibilities
- Monitor system performance, application health, and infrastructure metrics.
- Provide on-call support for production uptime and customer escalations.
- Manage release upgrades and maintenance activities.
- Lead incident response and resolution efforts, including root cause analysis.
- Implement security best practices and controls in cloud environments.
- Drive continuous improvement initiatives for infrastructure and services.
Requirements
- Bachelor’s degree in computer science, engineering, or related field.
- 5+ years of experience as a Site Reliability Engineer or similar role.
- Hands-on experience with AWS and/or Azure infrastructure management.
- Proficiency in scripting languages such as PowerShell, Python, or Go.
- Strong understanding of CI/CD principles and related tools.
- Experience with containerization technologies like Docker and Kubernetes is a plus.
- Excellent analytical, problem-solving, and communication skills.
