20 days ago
Responsibilities
- Partner with engineers to build developer tools that empower workflows and deployment infrastructure.
- Ensure reliability of multi-cloud Kubernetes clusters and pipelines.
- Implement metrics, logging, analytics, and alerting for performance and security.
- Develop Infrastructure-as-code deployment tooling and supporting services on multiple cloud providers.
- Automate operations and engineering processes.
- Build machine learning infrastructure for AI teams to work with large-scale datasets.
Requirements
- 5+ years of experience in DevOps, Site Reliability Engineering, or Production Engineering.
- Deep proficiency in coding languages such as Golang or Python.
- Familiarity with container-related security best practices.
- Production experience with Kubernetes and its ecosystem, including tools like cert-manager or external-dns.
- Experience with Kubernetes templating tools such as Helm or Kustomize.
- Experience with Infrastructure-as-Code tools like Terraform or CloudFormation.
- Experience with AWS services such as IAM, S3, EC2, and EKS.
- Experience with other cloud providers like Google Cloud and Azure is a bonus.
- Production experience with database software such as PostgreSQL.
- Experience with GitOps tooling like Flux or Argo.
- Experience with CI/CD tools such as GitHub Actions.
Benefits
- Paid parental leave to support you and your family.
- Monthly Health & Wellness allowance.
- 28 days of PTO in Berlin.
