GrepJob
Okta

Staff Site Reliability Engineer - Observability

Okta
Apply
4 days ago
San Francisco, CA, USA
Staff+
H1B Sponsor

Base Salary

$194k - $267k/yr

Responsibilities

  • Design, build, and maintain scalable observability infrastructure using Terraform.
  • Optimize the collection, processing, and storage of observability data for high reliability and low latency.
  • Participate in on-call rotations and lead post-incident reviews.
  • Automate the deployment and scaling of observability agents and collectors.

Requirements

  • Minimum 5+ years of experience managing observability in Google Cloud.
  • Expertise in creating actionable Splunk or Grafana dashboards.
  • Minimum 3+ years in an SRE, DevOps, or Systems Engineering role focused on high-availability systems.
  • Strong coding skills in Python or Go for building internal tools.
  • Deep understanding of Linux internals, networking, and container orchestration.

Benefits

  • Comprehensive benefits including health, dental, and vision insurance.
  • 401(k) plan and flexible spending account.
  • Paid leave including PTO and parental leave.
  • Opportunities for social impact and community connection.

Tech Stack

AWSGoGoogle CloudGrafanaKubernetesPythonRubySplunkTerraform

Categories

DevOps