about 3 hours ago
Responsibilities
- Enhance the scalability and reliability of Grafana Cloud's observability platform.
- Collaborate with engineers to deliver large distributed systems.
- Define SLOs/SLIs and perform capacity planning.
- Participate in on-call rotations to maintain system health.
- Work with cloud-native architectures and operational practices.
Requirements
- Proven experience in delivering large distributed systems.
- Deep understanding of system design tradeoffs.
- Hands-on experience with cloud-native architectures.
- Strong coding skills in Go, Python, or similar languages.
- Comfort with AI-assisted development tools.
- Excellent communication skills for cross-functional collaboration.
Benefits
- 100% remote work with a global culture.
- Career growth pathways and opportunities for development.
- Transparent communication and open decision-making.
- 30 days of annual leave with Grafana Shutdown Days.
- Access to modern AI coding assistants and tools.