GrepJob
Cohere

Member of Technical Staff, Post-Training

Cohere
Apply
11 months ago
Toronto, Canada +5 moreMid Level / Senior
H1B Sponsor

Responsibilities

  • Design and write high-performant and scalable software for training models.
  • Post-train models to achieve state-of-the-art performance.
  • Coordinate with specialist teams to enhance model performance.
  • Implement techniques to improve training cycle results.
  • Research and experiment with ideas on supercompute and data infrastructure.

Requirements

  • Extremely strong software engineering skills.
  • Proficiency in Python and ML frameworks like JAX and Pytorch.
  • Experience with distributed training infrastructures such as Kubernetes and Slurm.
  • Hands-on experience with large-scale distributed training strategies.
  • Experience in the post-training phase with a focus on performance optimization.

Benefits

  • Open and inclusive culture and work environment.
  • Weekly lunch stipend, in-office lunches, and snacks.
  • Full health and dental benefits, including mental health support.
  • 100% Parental Leave top-up for up to 6 months.
  • Personal enrichment benefits for arts, culture, fitness, and workspace improvement.
  • Remote-flexible work options with offices in major cities.
  • 6 weeks of vacation (30 working days).

Tech Stack

KubernetesPythonPyTorch

Categories

AI & MLData Engineering