
Site Reliability Engineer
Obsidian Securityabout 4 hours ago
Palo Alto, CA, USAMid Level / Senior
H1B Sponsor
Base Salary
$165k - $190k/yr
Responsibilities
- Support and maintain the service quality of our customer-facing SaaS security platform.
- Address complex challenges around scalability, reliability, observability, and cost efficiency.
- Collaborate with Engineering teams to maintain and enhance Helm charts, application deployment, monitoring, and CI/CD pipelines.
- Embed into the engineering team to gain a deep understanding of the application.
- Define service verification strategies and implement them as part of the CI/CD process to meet SLAs.
- Improve developer experience by optimizing CI/CD workflows and performance.
- Participate in the on-call rotation, providing 24/7 support in coordination with the global SRE team.
- Monitor, debug, and optimize production infrastructure and services on AWS/GCP.
Requirements
- 3+ years of experience in a DevOps or SRE role supporting SaaS services on GCP and/or AWS.
- Bachelor's degree in Computer Science or related field.
- Strong proficiency in Kubernetes, microservices architecture, Helm, GitLab CI/CD, and ArgoCD, Prometheus, Grafana.
- Programming experience in at least one language; Golang or Python preferred.
- Deep understanding of autoscaling, version upgrades, and cloud service optimization.
- Bonus if familiar with technologies like Kafka, Elasticsearch, PostgreSQL, ScyllaDB, Databricks, Dagster, Sentry, Kong.
Benefits
- Competitive compensation with equity and 401k.
- Comprehensive healthcare with dental and vision coverage.
- Flexible paid time off and paid holiday time off.
- 12 weeks of new parent or family leave.
- Personal and professional development resources.
Tech Stack
Apache KafkaAWSDatabricksElasticsearchGitLab CI/CDGoGoogle Cloud PlatformGrafanaHelmKubernetesPostgreSQLPrometheusPython