
Together Cloud Infrastructure Engineer
Together AI4 days ago
Responsibilities
- Design, build, and maintain backend services for data centers.
- Automate hardware management tasks such as VM provisioning.
- Develop the IaaS software layer for a new GB200 data center.
- Work on a global high-performance object store for massive datasets.
- Build observability stacks for automated node lifecycle management.
- Conduct architecture and research for decentralized AI workloads.
- Contribute to the core open-source Together AI platform.
- Create services, tools, and developer documentation.
- Develop testing frameworks for robustness and fault-tolerance.
Requirements
- 5+ years of professional software development experience.
- Proficiency in at least one backend programming language, preferably Golang.
- Experience writing high-performance, production-quality code.
- Demonstrated experience with high-performance micro-service architectures.
- Excellent communication skills for technical documentation and collaboration.
- Deep experience with Kubernetes internals is a plus.
- Experience with VMs/hypervisors and DC networking technologies is a plus.
- Familiarity with infrastructure automation tools and CI/CD pipelines.
- Experience building IaaS or PaaS systems at scale is a plus.
- Knowledge of GPU programming and related technologies is a plus.
Benefits
- Hybrid working model with two days a week in the Amsterdam office.