
Production Engineer/Site Reliability Engineer (Shift Basis)
Rubrik
6 days ago
Bengaluru, India
Mid Level / Senior
H1B Sponsor
Responsibilities
- Manage and support critical infrastructure and services in multi-cloud environments.
- Oversee staging and production environments to ensure maximum uptime.
- Implement and maintain observability solutions for monitoring and alerting.
- Lead incident management efforts and coordinate teams for timely resolution.
- Analyze recurring incidents to identify root causes and improve system resilience.
- Design and develop automation tools for proactive issue detection and remediation.
- Maintain and update runbooks to support incident response.
Requirements
- Solid understanding of distributed system concepts.
- Practical experience with production systems in public cloud infrastructures.
- Familiarity with container orchestration platforms, especially Kubernetes.
- Hands-on experience with infrastructure management tools like CloudFormation and Terraform.
- Strong analytical and problem-solving skills for diagnosing system issues.
- Proficient in data structures, algorithms, UNIX, networking, and database systems like MySQL.
- Proficient in Python programming.
- Excellent verbal and written communication skills.
Tech Stack
KubernetesMySQLPythonTerraform
Categories
BackendDevOps