GrepJob
Cohere

Staff Research Engineer, Model Efficiency

Cohere
Apply
8 months ago
Toronto, Canada +3 moreStaff+
H1B Sponsor

Responsibilities

  • Develop and prototype techniques to improve model efficiency in production.
  • Optimize LLM architecture and inference processes.
  • Enhance decoding and inference-time algorithms.
  • Collaborate on software/hardware co-design for GPU acceleration.
  • Contribute to performance optimization without compromising model quality.

Requirements

  • PhD in Machine Learning or a related field.
  • Understanding of LLM architecture and optimization under resource constraints.
  • Significant experience with techniques that enhance model efficiency.
  • Strong software engineering skills.
  • Ability to thrive in a fast-paced, high-ambiguity start-up environment.
  • Publications at top-tier conferences (ICLR, ACL, NeurIPS) are preferred.
  • Passion for mentoring others.

Benefits

  • Open and inclusive culture and work environment.
  • Work closely with a cutting-edge AI research team.
  • Weekly lunch stipend, in-office lunches, and snacks.
  • Full health and dental benefits, including mental health support.
  • 100% Parental Leave top-up for up to 6 months.
  • Personal enrichment benefits for arts, culture, fitness, and workspace improvement.
  • Remote-flexible work options and co-working stipend.
  • 6 weeks of vacation (30 working days).

Categories

AI & MLData Science