20 days ago
Base Salary
$205k - $270k/yr
Responsibilities
- Partner with engineers to build developer tools that enhance workflows and deployment infrastructure.
- Ensure the reliability of multi-cloud Kubernetes clusters and pipelines.
- Implement metrics, logging, analytics, and alerting for performance and security.
- Develop Infrastructure-as-Code deployment tooling across multiple cloud providers.
- Automate operations and engineering processes to focus on impactful work.
- Build machine learning infrastructure for AI teams to work with large-scale datasets.
Requirements
- 5+ years of experience in DevOps, Site Reliability Engineering, or a related field.
- Deep proficiency in coding languages such as Golang or Python.
- Strong familiarity with container-related security best practices.
- Production experience with Kubernetes and its ecosystem, including tools like cert-manager or external-dns.
- Experience with Kubernetes templating tools such as Helm or Kustomize.
- Proficient in Infrastructure-as-Code tools like Terraform or CloudFormation.
- Experience with AWS services such as IAM, S3, EC2, and EKS.
- Familiarity with other cloud providers like Google Cloud and Azure is a plus.
- Experience with database software such as PostgreSQL.
- Knowledge of GitOps tooling like Flux or Argo.
- Experience with CI/CD tools such as GitHub Actions.
Benefits
- Comprehensive medical, dental, and vision coverage for you and your family.
- Flexible PTO to take time off as needed.
- Paid parental leave for new parents.
- Retirement savings plan to help you plan for the future.
- Remote work setup budget for a productive home office.
- Monthly wellness and communication stipend.
- In-office meal program and commuter benefits for onsite employees.
