Site Reliability Engineer (Auth0)
Okta
3 months ago
Barcelona, Spain
Mid Level / Senior
H1B Sponsor
Responsibilities
- Design and build custom software in Go to enhance platform reliability.
- Partner with engineering teams to improve service availability and performance.
- Identify opportunities for improvement in product infrastructure and observability.
- Contribute to on-call rotation and respond to critical incidents.
- Develop and refine SRE tooling and processes for operational efficiency.
- Define and document reliability best practices across the organization.
Requirements
- Proactive and systematic problem-solving approach with high ownership.
- Experience in a production environment supporting large-scale applications.
- Proficiency in at least one programming language, preferably Go.
- Experience with infrastructure as code (Terraform) and container orchestration (Kubernetes, Docker).
- Expertise in a major cloud provider (Azure, AWS, or GCP).
- Strong understanding of microservices architecture and databases.
- Knowledge of core SRE principles, including SLIs, SLOs, and error budgets.
- Experience in an on-call rotation for a 24/7 cloud-based environment.
- Exceptional communication and collaboration skills in a remote team.
Benefits
- Comprehensive benefits package.
- Opportunities for social impact initiatives.
- Support for talent development and community building.
Tech Stack
AWSAzureDockerGoGoogle Cloud PlatformKubernetesSQLTerraform
Categories
DevOps