7 months ago
Toronto, Canada +2 moreEntry Level / Mid Level / Senior / Staff+
H1B Sponsor
Responsibilities
- Build and operate Kubernetes compute superclusters across multiple clouds.
- Partner with cloud providers to optimize infrastructure costs, performance, and reliability for AI workloads.
- Work closely with research teams to understand their infrastructure needs.
- Design and build resilient, scalable systems for training AI models.
- Encourage software best practices and participate in team processes.
Requirements
- Deep experience running Kubernetes clusters at scale.
- Strong programming skills in Go or Python.
- Preference for contributing to Open Source solutions.
- Self-directed and adaptable with problem-solving skills.
- Excellent communication skills and ability to thrive in fast-paced environments.
Benefits
- Open and inclusive culture and work environment.
- Weekly lunch stipend, in-office lunches, and snacks.
- Full health and dental benefits, including mental health support.
- 100% Parental Leave top-up for up to 6 months.
- Personal enrichment benefits for arts, culture, fitness, and workspace improvement.
- Remote-flexible work options and co-working stipend.
- 6 weeks of vacation (30 working days).
