GrepJob
Doctolib

Senior Site Reliability Engineer - Observability (x/f/m)

Doctolib
Apply
about 3 hours ago
Berlin, GermanySenior / Mid Level

Responsibilities

  • Lead the observability strategy across the platform.
  • Build scalable, developer-friendly logging and tracing capabilities.
  • Identify and lead large-scale cross-cutting reliability initiatives.
  • Improve incident detection, response, and postmortem analysis capabilities.
  • Participate in the on-call rotation and enhance the on-call experience.

Requirements

  • 3+ years of hands-on experience on a large-scale production platform.
  • Proven experience with cloud platforms such as AWS, Azure, or Google Cloud.
  • Solid understanding of containerization and orchestration technologies like Docker and Kubernetes.
  • Strong understanding of Helm and ArgoCD for GitOps workflows.
  • Deep expertise in observability tooling and architecture.
  • Proficiency in at least one programming language (Ruby, Python, Go, Java, etc.).
  • Experience with monitoring and observability tools.
  • Fluency in English.

Benefits

  • Fully paid Deutschlandticket for public transport.
  • 28 vacation days plus an additional day for each full calendar year of employment.
  • Work from abroad for up to 10 days per year.
  • Company health insurance with supplementary benefits.
  • Company pension scheme with employer subsidy.
  • Doctolib Parent Care program with additional parental leave.
  • Enrollment in long-term employee value sharing plan.
  • Free mental health and coaching services.
  • Subsidized sports membership.
  • Flexible workplace policy with hybrid options.
  • Healthy snacks and subsidized meal benefits.
  • Relocation support for international mobility.
  • Access to AI tools for coding and development.

Tech Stack

AWSAzureDatadogDockerElasticsearchGoGoogle CloudHelmJavaKotlinKubernetesLogstashPrometheusPythonReact NativeRubySwift

Categories