
Sr. Staff Site Reliability Engineer
Obsidian Security5 days ago
Palo Alto, CA, USASenior / Staff+
H1B Sponsor
Base Salary
$232k - $263k/yr
Responsibilities
- Define and lead long-term reliability strategy across services.
- Establish end-to-end system visibility frameworks.
- Partner across teams to embed reliability and standardize SLI/SLOs.
- Build intelligent detection systems and enable self-service observability.
- Define and evolve a tiered incident communication strategy.
- Contribute hands-on to system design, monitoring, and debugging.
Requirements
- 5+ years in SRE, Production Engineering, or related roles.
- 3+ years operating at a senior or technical leadership level.
- Deep expertise in AWS and/or GCP.
- Experience with Kubernetes and Helm.
- Familiarity with observability stacks like Prometheus and Grafana.
- Proven experience designing reliability systems for multi-tenant SaaS platforms.
- Strong debugging skills across distributed microservices.
Benefits
- Competitive compensation with equity and 401k.
- Comprehensive healthcare with dental and vision coverage.
- Flexible paid time off and paid holiday time off.
- 12 weeks of new parent or family leave.
- Personal and professional development resources.