
AI infrastructure Engineer (SRE) Amsterdam
Together AI4 days ago
Amsterdam, NetherlandsSenior / Staff+
H1B Sponsor
Responsibilities
- Participate in an on-call rotation to address incidents affecting service availability.
- Utilize Ansible, Terraform, and Kubernetes to build and manage scalable infrastructure.
- Develop monitoring systems to ensure high-quality service delivery.
- Design and implement operational processes for deployments and upgrades.
- Troubleshoot production issues across various services and stack levels.
- Identify architectural improvements for reliability, performance, and availability.
- Plan the growth of Together AI’s infrastructure.
Requirements
- 7+ years of professional SRE or related experience.
- Bachelor's degree in Computer Science or a related field or equivalent work experience.
- Expert knowledge of Ansible, Terraform, and Kubernetes.
- Proficiency in programming and scripting languages.
- Direct experience in monitoring and observability practices.
- Advanced knowledge of cloud services.
- Ability to collaborate effectively with diverse stakeholders.