Staff Research Engineer, Model Efficiency

8 months ago

Toronto, Canada +3 moreStaff+

H1B Sponsor

Responsibilities

Develop and prototype techniques to improve model efficiency in production.
Optimize LLM architecture and inference processes.
Enhance decoding and inference-time algorithms.
Collaborate on software/hardware co-design for GPU acceleration.
Contribute to performance optimization without compromising model quality.

Requirements

PhD in Machine Learning or a related field.
Understanding of LLM architecture and optimization under resource constraints.
Significant experience with techniques that enhance model efficiency.
Strong software engineering skills.
Ability to thrive in a fast-paced, high-ambiguity start-up environment.
Publications at top-tier conferences (ICLR, ACL, NeurIPS) are preferred.
Passion for mentoring others.

Benefits

Open and inclusive culture and work environment.
Work closely with a cutting-edge AI research team.
Weekly lunch stipend, in-office lunches, and snacks.
Full health and dental benefits, including mental health support.
100% Parental Leave top-up for up to 6 months.
Personal enrichment benefits for arts, culture, fitness, and workspace improvement.
Remote-flexible work options and co-working stipend.
6 weeks of vacation (30 working days).

Categories

AI & MLData Science