about 1 year ago
Toronto, Canada +5 moreMid Level / Senior
H1B Sponsor
Responsibilities
- Design and write high-performant and scalable software for training.
- Improve training setup from an infrastructure and codebase performance standpoint.
- Craft and implement tools to speed up training cycles and enhance training infrastructure efficacy.
- Research, implement, and experiment with ideas on supercompute and data infrastructure.
- Collaborate with leading researchers in the AI field.
Requirements
- Extremely strong software engineering skills.
- Proficiency in Python and ML frameworks such as JAX, Pytorch, and XLA/MLIR.
- Experience with distributed training infrastructures like Kubernetes and Slurm.
- Hands-on experience in training large models at scale.
- Bonus: published papers at top-tier venues in AI.
Benefits
- Open and inclusive culture and work environment.
- Weekly lunch stipend, in-office lunches, and snacks.
- Full health and dental benefits, including mental health support.
- 100% Parental Leave top-up for up to 6 months.
- Personal enrichment benefits for arts, culture, fitness, and workspace improvement.
- Remote-flexible work arrangements with offices in major cities.
- 6 weeks of vacation (30 working days).
