about 3 hours ago
Remote, Canada
Staff+
Responsibilities
- Design, build, and operate reconciliation systems for Grafana Cloud stacks.
- Collaborate across teams to ensure reliable stack lifecycle workflows.
- Improve operational efficiency by reducing deployment complexity.
- Manage rollout mechanisms for plugins, dashboards, and configurations.
- Support new region and cluster rollouts for Grafana Cloud.
- Enhance incident response and recovery paths for stack issues.
- Partner with various teams on customer-impacting stack lifecycle work.
- Contribute to roadmap planning and technical design.
Requirements
- At least 1 year of fully remote work experience.
- Experience with a large SaaS platform and distributed systems.
- Professional experience with Golang and backend service development.
- Strong focus on developer and user experience.
- Experience delivering projects from requirements gathering to shipping.
- Ability to write clean, robust, and well-tested software.
- Experience mentoring junior engineers in a collaborative environment.
- Strong Kubernetes experience in AWS, GCP, or Azure.
- Experience with incident response and post-incident reviews.
Benefits
- 100% remote work with a global culture.
- Opportunities for career growth and development.
- Transparent communication and decision-making.
- Access to modern AI coding assistants.
- 30 days of annual leave with Grafana Shutdown Days.
Tech Stack
Argo CDAWSAzureGoGoogle Cloud PlatformGrafanaHelmKubernetesNode.jsTerraformTypeScript
Categories
BackendDevOps