about 3 hours ago
San Francisco, CA, USA
Staff+
H1B Sponsor
Base Salary
$182k - $249k/yr
Responsibilities
- Investigate and resolve infrastructure issues reported by internal teams.
- Provide technical guidance and support across multiple technical domains.
- Contribute to runbooks, documentation, and knowledge sharing.
- Mentor junior team members on SRE best practices and troubleshooting methodologies.
- Identify and implement improvements to monitoring, alerting, and incident response processes.
Requirements
- 7+ years of Site Reliability Engineering or equivalent systems administration experience.
- Proficiency with Kubernetes and container orchestration.
- Strong Linux/Unix systems administration background.
- Good understanding of CI/CD and deployment strategies.
- Good grasp of networking concepts.
- Experience with infrastructure as code, infrastructure troubleshooting, and general architecture.
- Excellent communication and documentation skills.
Benefits
- Work on critical infrastructure supporting multiple teams.
- Opportunity to grow expertise in modern infrastructure tooling.
- Collaborative environment with strong knowledge-sharing culture.
- Impact on infrastructure reliability and team efficiency.
Tech Stack
GoKubernetesPythonTerraform
Categories
DevOps