20 days ago
Responsibilities
- Partner with engineers to build developer tools and deployment infrastructure.
- Ensure reliability of multi-cloud Kubernetes clusters and pipelines.
- Implement metrics, logging, analytics, and alerting for performance and security.
- Develop Infrastructure-as-code deployment tooling across multiple cloud providers.
- Automate operations and engineering processes.
- Build machine learning infrastructure for AI teams to work with large-scale datasets.
Requirements
- 5+ years of experience in DevOps, Site Reliability Engineering, or a related field.
- Deep proficiency in coding languages such as Golang or Python.
- Familiarity with container-related security best practices.
- Production experience with Kubernetes and its ecosystem.
- Experience with Kubernetes templating tools like Helm or Kustomize.
- Experience with Infrastructure-as-Code tools such as Terraform or CloudFormation.
- Experience with AWS services like IAM, S3, EC2, and EKS.
- Experience with other cloud providers like Google Cloud and Azure is a bonus.
- Production experience with database software such as PostgreSQL.
- Experience with GitOps tooling like Flux or Argo.
- Experience with CI/CD tools such as GitHub Actions.
Benefits
- Variety of medical benefits designed to fit your stage of life.
- Flexible vacation time to promote a healthy work-life blend.
- Paid parental leave to support you and your family.
