Site Reliability Engineer
Five9
about 2 months ago
Remote, United States
Mid Level / Senior
H1B Sponsor
Base Salary
$72k - $190k/yr
Responsibilities
- Design and implement comprehensive observability dashboards for monitoring.
- Establish and maintain SLIs, SLOs, and error budgets for services.
- Build alerting systems to proactively identify and resolve issues.
- Participate in on-call rotations and lead incident response efforts.
- Maintain CI/CD pipelines for cloud and on-premise deployments.
- Develop infrastructure using tools like Terraform and Ansible.
- Ensure security scanning systems are in place and review vulnerabilities.
- Monitor and optimize cloud resource usage and costs.
- Build and maintain common services like notification systems and caching layers.
Requirements
- 3+ years managing large-scale production environments.
- Comfortable with 24/7 on-call responsibilities and incident response.
- Strong Linux/Unix system administration skills.
- Proficiency in at least two programming languages such as Python or Java.
- Experience with AWS, GCP, or Azure infrastructure and services.
- Hands-on experience with Docker and Kubernetes.
- Experience defining and maintaining service level objectives.
- Understanding of error budget concepts and implementation.
Benefits
- 100% coverage of employee health, dental, and vision insurance.
- Access to a mental health support platform.
- Generous employee stock purchase plan.
- Paid Time Off, company paid holidays, and 12 weeks paid parental leave.
Tech Stack
AnsibleAWSAzureDockerGitGoogle Cloud PlatformGrafanaJavaKubernetesPHPPrometheusPythonTerraform
Categories
DevOpsSecurity