Rubrik

Production Engineer/Site Reliability Engineer (Shift Basis)

Rubrik

Apply
6 days ago
Bengaluru, India
Mid Level / Senior
H1B Sponsor

Responsibilities

  • Manage and support critical infrastructure and services in multi-cloud environments.
  • Oversee staging and production environments to ensure maximum uptime.
  • Implement and maintain observability solutions for monitoring and alerting.
  • Lead incident management efforts and coordinate teams for timely resolution.
  • Analyze recurring incidents to identify root causes and improve system resilience.
  • Design and develop automation tools for proactive issue detection and remediation.
  • Maintain and update runbooks to support incident response.

Requirements

  • Solid understanding of distributed system concepts.
  • Practical experience with production systems in public cloud infrastructures.
  • Familiarity with container orchestration platforms, especially Kubernetes.
  • Hands-on experience with infrastructure management tools like CloudFormation and Terraform.
  • Strong analytical and problem-solving skills for diagnosing system issues.
  • Proficient in data structures, algorithms, UNIX, networking, and database systems like MySQL.
  • Proficient in Python programming.
  • Excellent verbal and written communication skills.

Tech Stack

KubernetesMySQLPythonTerraform

Categories

BackendDevOps