Senior Site Reliability Engineer
Bumble
3 months ago
London, United Kingdom
Senior
Responsibilities
- Manage and optimize large-scale production environments with over 5,000 hosts.
- Independently troubleshoot incidents and lead post-incident service recovery.
- Drive improvements to system stability, performance, and observability.
- Utilize technologies such as Kafka, Redis, and Kubernetes in operations.
Requirements
- 5+ years of experience in Linux system administration or SRE roles.
- Proven experience managing large-scale infrastructure environments.
- Strong troubleshooting and performance tuning skills at the infrastructure level.
- Basic scripting/automation experience in Bash or Python.
- Familiarity with Infrastructure as Code (IaC) tools like Ansible or Puppet.
- Knowledge of distributed systems and container orchestration technologies.
Benefits
- Own meaningful projects that impact millions of Bumble users.
- Learn and grow in a high-performing engineering team committed to mentorship.
- Be part of a culture that values respect, excellence, curiosity, courage, and joy.
- Enjoy competitive compensation, equity, and world-class benefits.
Tech Stack
AnsibleApache KafkaBashKubernetesLinuxPuppetPythonRedis
Categories
DevOps