Senior Site Reliability Engineer
GetYourGuide
3 months ago
Berlin, Germany
Senior
Responsibilities
- Build and scale cloud-based infrastructure managing Kubernetes clusters and AWS environment.
- Ensure high availability, autoscaling, and failure recovery of production systems.
- Develop custom controllers for automating cluster management.
- Leverage Istio and Envoy for service communication and network observability.
- Drive initiatives for better system design and new technology implementation.
- Participate in infrastructure on-call rotations.
- Champion operations culture to deliver highly available services.
Requirements
- Availability from 13:00 to 17:00 Central European Standard Time for team collaboration.
- Experience with Kubernetes and running containers at scale.
- Good understanding of the Linux operating system.
- Strong coding skills in at least one programming language, preferably Go.
- Understanding of distributed systems, networking, and container technology.
- Familiarity with public cloud environments like AWS.
- Proactive team player passionate about helping the team succeed.
- Strong problem-solving skills for diagnosing production issues.
- Excellent written and verbal communication skills in English.
Benefits
- Annual personal growth budget and mentorship programs.
- Work from anywhere in the world for 40 days per year.
- Flexible working arrangements for work-life balance.
- Opportunities for team collaboration and social events.
- Monthly transportation and fitness budget.
- Discounts on GetYourGuide activities for you and your family.
- Language reimbursement program.
- Health and wellness benefits.
Tech Stack
AmbassadorAWSGoIstioKubernetesLinux
Categories
BackendDevOps