GrepJob
Obsidian Security

Sr. Staff Site Reliability Engineer

Obsidian Security
Apply
5 days ago
Palo Alto, CA, USASenior / Staff+
H1B Sponsor

Base Salary

$232k - $263k/yr

Responsibilities

  • Define and lead long-term reliability strategy across services.
  • Establish end-to-end system visibility frameworks.
  • Partner across teams to embed reliability and standardize SLI/SLOs.
  • Build intelligent detection systems and enable self-service observability.
  • Define and evolve a tiered incident communication strategy.
  • Contribute hands-on to system design, monitoring, and debugging.

Requirements

  • 5+ years in SRE, Production Engineering, or related roles.
  • 3+ years operating at a senior or technical leadership level.
  • Deep expertise in AWS and/or GCP.
  • Experience with Kubernetes and Helm.
  • Familiarity with observability stacks like Prometheus and Grafana.
  • Proven experience designing reliability systems for multi-tenant SaaS platforms.
  • Strong debugging skills across distributed microservices.

Benefits

  • Competitive compensation with equity and 401k.
  • Comprehensive healthcare with dental and vision coverage.
  • Flexible paid time off and paid holiday time off.
  • 12 weeks of new parent or family leave.
  • Personal and professional development resources.

Tech Stack

AWSGitLab CI/CDGoogle Cloud PlatformGrafanaHelmKubernetesPrometheus

Categories