Grafana

Senior Software Engineer - Grafana Databases, SRE | Sweden | Remote

Grafana

Apply
6 days ago
Remote, Sweden
Senior

Responsibilities

  • Partner closely with product engineering squads to enhance reliability.
  • Own production reliability for high-SLA customer environments.
  • Design and implement automation to scale reliability practices.
  • Ensure customers meet SLO targets and define per-tenant SLOs.
  • Proactively reduce SLO burn to prevent repeat incidents.
  • Serve as a primary escalation point and on-call for incidents.
  • Lead customer-impacting incident response and post-incident reviews.
  • Contribute to design docs and code reviews.
  • Influence feature design for production scalability and operability.
  • Build automation to eliminate toil and improve alert quality.

Requirements

  • 6+ years of engineering experience, with 3+ in SRE/CRE/production engineering.
  • Strong Kubernetes experience in AWS, GCP, or Azure.
  • Familiarity with infrastructure-as-code tooling like Helm and Terraform.
  • Experience operating multi-tenant systems in production.
  • Strong experience designing and implementing SLOs.
  • Proficiency in one or more programming languages (e.g., Go, Python, Java).
  • Knowledge of Linux internals and cloud storage scaling.
  • Excellent problem-solving and troubleshooting skills.
  • Experience in blame-free incident response and writing high-quality PIRs.
  • Ability to reason about performance, scaling, and failure modes.

Benefits

  • 100% remote work with a global culture.
  • Opportunities for career growth and development.
  • Transparent communication and open decision-making.
  • Access to modern AI coding assistants and tools.
  • 30 days of annual leave, including Grafana Shutdown Days.
  • In-person onboarding to foster team connections.

Tech Stack

AWSAzureGoGoogle Cloud PlatformGrafanaHelmJavaKubernetesLinuxPythonTerraform

Categories

BackendDevOps