GetYourGuide

Senior Site Reliability Engineer

GetYourGuide

Apply
3 months ago
Berlin, Germany
Senior

Responsibilities

  • Build and scale cloud-based infrastructure managing Kubernetes clusters and AWS environment.
  • Ensure high availability, autoscaling, and failure recovery of production systems.
  • Develop custom controllers for automating cluster management.
  • Leverage Istio and Envoy for service communication and network observability.
  • Drive initiatives for better system design and new technology implementation.
  • Participate in infrastructure on-call rotations.
  • Champion operations culture to deliver highly available services.

Requirements

  • Availability from 13:00 to 17:00 Central European Standard Time for team collaboration.
  • Experience with Kubernetes and running containers at scale.
  • Good understanding of the Linux operating system.
  • Strong coding skills in at least one programming language, preferably Go.
  • Understanding of distributed systems, networking, and container technology.
  • Familiarity with public cloud environments like AWS.
  • Proactive team player passionate about helping the team succeed.
  • Strong problem-solving skills for diagnosing production issues.
  • Excellent written and verbal communication skills in English.

Benefits

  • Annual personal growth budget and mentorship programs.
  • Work from anywhere in the world for 40 days per year.
  • Flexible working arrangements for work-life balance.
  • Opportunities for team collaboration and social events.
  • Monthly transportation and fitness budget.
  • Discounts on GetYourGuide activities for you and your family.
  • Language reimbursement program.
  • Health and wellness benefits.

Tech Stack

AmbassadorAWSGoIstioKubernetesLinux

Categories

BackendDevOps