GrepJob
Okta

Staff Site Reliability Engineer (SRE), Agile

Okta
Apply
about 3 hours ago
San Francisco, CA, USA
Staff+
H1B Sponsor

Base Salary

$182k - $249k/yr

Responsibilities

  • Investigate and resolve infrastructure issues reported by internal teams.
  • Provide technical guidance and support across multiple technical domains.
  • Contribute to runbooks, documentation, and knowledge sharing.
  • Mentor junior team members on SRE best practices and troubleshooting methodologies.
  • Identify and implement improvements to monitoring, alerting, and incident response processes.

Requirements

  • 7+ years of Site Reliability Engineering or equivalent systems administration experience.
  • Proficiency with Kubernetes and container orchestration.
  • Strong Linux/Unix systems administration background.
  • Good understanding of CI/CD and deployment strategies.
  • Good grasp of networking concepts.
  • Experience with infrastructure as code, infrastructure troubleshooting, and general architecture.
  • Excellent communication and documentation skills.

Benefits

  • Work on critical infrastructure supporting multiple teams.
  • Opportunity to grow expertise in modern infrastructure tooling.
  • Collaborative environment with strong knowledge-sharing culture.
  • Impact on infrastructure reliability and team efficiency.

Tech Stack

GoKubernetesPythonTerraform

Categories

DevOps