Bumble

Senior Site Reliability Engineer

Bumble

Apply
3 months ago
London, United Kingdom
Senior

Responsibilities

  • Manage and optimize large-scale production environments with over 5,000 hosts.
  • Independently troubleshoot incidents and lead post-incident service recovery.
  • Drive improvements to system stability, performance, and observability.
  • Utilize technologies such as Kafka, Redis, and Kubernetes in operations.

Requirements

  • 5+ years of experience in Linux system administration or SRE roles.
  • Proven experience managing large-scale infrastructure environments.
  • Strong troubleshooting and performance tuning skills at the infrastructure level.
  • Basic scripting/automation experience in Bash or Python.
  • Familiarity with Infrastructure as Code (IaC) tools like Ansible or Puppet.
  • Knowledge of distributed systems and container orchestration technologies.

Benefits

  • Own meaningful projects that impact millions of Bumble users.
  • Learn and grow in a high-performing engineering team committed to mentorship.
  • Be part of a culture that values respect, excellence, curiosity, courage, and joy.
  • Enjoy competitive compensation, equity, and world-class benefits.

Tech Stack

AnsibleApache KafkaBashKubernetesLinuxPuppetPythonRedis

Categories

DevOps