GrepJob
Together AI

Together Cloud Infrastructure Engineer

Together AI
Apply
4 days ago
Amsterdam, NetherlandsSenior
H1B Sponsor

Responsibilities

  • Design, build, and maintain backend services for data centers.
  • Automate hardware management tasks such as VM provisioning.
  • Develop the IaaS software layer for a new GB200 data center.
  • Work on a global high-performance object store for massive datasets.
  • Build observability stacks for automated node lifecycle management.
  • Conduct architecture and research for decentralized AI workloads.
  • Contribute to the core open-source Together AI platform.
  • Create services, tools, and developer documentation.
  • Develop testing frameworks for robustness and fault-tolerance.

Requirements

  • 5+ years of professional software development experience.
  • Proficiency in at least one backend programming language, preferably Golang.
  • Experience writing high-performance, production-quality code.
  • Demonstrated experience with high-performance micro-service architectures.
  • Excellent communication skills for technical documentation and collaboration.
  • Deep experience with Kubernetes internals is a plus.
  • Experience with VMs/hypervisors and DC networking technologies is a plus.
  • Familiarity with infrastructure automation tools and CI/CD pipelines.
  • Experience building IaaS or PaaS systems at scale is a plus.
  • Knowledge of GPU programming and related technologies is a plus.

Benefits

  • Hybrid working model with two days a week in the Amsterdam office.

Tech Stack

AnsibleAWSAzureGitHub ActionsGoGoogle Cloud PlatformGrafanaKubernetesPrometheusTerraform

Categories

AI & MLBackendData EngineeringDevOps