GrepJob
PhonePe

Site Reliability Engineer 3 - Cloud

PhonePe
Apply
5 days ago
Bengaluru, India
Mid Level / Senior

Responsibilities

  • Configure, maintain, and manage Ubuntu/Linux Virtual Machines in Azure or AWS.
  • Design and manage cloud-native components for log storage and database management.
  • Configure and maintain critical network components, including Firewalls and Route Tables.
  • Establish and manage high-speed connectivity via Express Route or Direct Connect.
  • Drive automation for all BAU tasks using Terraform.
  • Use Saltstack or Ansible for automated deployment and configuration of services.
  • Set up and manage high availability services like MySQL and Aerospike.
  • Implement and manage monitoring systems like Prometheus and Grafana.

Requirements

  • 5 to 12 years of experience in an SRE or high-level DevOps role.
  • Deep hands-on experience with either Azure or AWS core services.
  • Expert proficiency in Linux (Ubuntu) for system administration.
  • Experience with Nginx and HAProxy for web/proxy management.
  • Mastery of DNS, BGP routing, and private connectivity troubleshooting.
  • Proactive approach to identifying and solving infrastructure challenges.
  • Ability to lead incident response and create Root Cause Analysis documents.

Benefits

  • Medical, Critical Illness, Accidental, and Life Insurance.
  • Employee Assistance Program and Onsite Medical Center.
  • Maternity and Paternity Benefits, Adoption Assistance, and Day-care Support.
  • Relocation benefits and Transfer Support Policy.
  • Employee PF Contribution, Flexible PF Contribution, and Gratuity.
  • Higher Education Assistance and Car Lease options.

Tech Stack

AnsibleAWSAzureDockerGoGrafanaJavaLinuxMySQLPrometheusPythonRabbitMQTerraform

Categories

DevOpsSecurity