GrepJob
Udemy

Senior Staff Site Reliability Engineer

Udemy
Apply
4 days ago
Dublin, Ireland
Senior / Staff+
H1B Sponsor

Responsibilities

  • Lead projects to develop and improve infrastructure and tooling.
  • Mentor other engineers on the SRE team.
  • Champion SRE best practices across the organization.
  • Participate in an on-call rotation.
  • Manage infrastructure components including load balancers and Kubernetes clusters.
  • Develop tools to meet internal customer needs using Python and Golang.
  • Respond to incidents and drive standards of reliability.

Requirements

  • Experience managing Kubernetes clusters and cloud environments.
  • Proficiency in infrastructure as code tools for deployment.
  • Experience writing tools and applications in Python, Golang, and Kotlin.
  • Experience being on call and managing incidents.
  • Ability to guide engineering teams on best practices.
  • Strong communication skills for feedback and collaboration.
  • Extensive knowledge of cloud technologies, particularly AWS.
  • Experience with Terraform and Helm for infrastructure management.

Benefits

  • Full access to Udemy courses for personal development.
  • Monthly UDay to invest in self-improvement.
  • Budget for tools and resources to enhance skills.
  • Collaborative work environment that values diverse ideas.

Tech Stack

AWSGoHelmKotlinKubernetesPythonTerraform

Categories

BackendDevOps