8 months ago
Responsibilities
- Develop and prototype techniques to improve model efficiency in production.
- Optimize LLM architecture and inference processes.
- Enhance decoding and inference-time algorithms.
- Collaborate on software/hardware co-design for GPU acceleration.
- Contribute to performance optimization without compromising model quality.
Requirements
- PhD in Machine Learning or a related field.
- Understanding of LLM architecture and optimization under resource constraints.
- Significant experience with techniques that enhance model efficiency.
- Strong software engineering skills.
- Ability to thrive in a fast-paced, high-ambiguity start-up environment.
- Publications at top-tier conferences (ICLR, ACL, NeurIPS) are preferred.
- Passion for mentoring others.
Benefits
- Open and inclusive culture and work environment.
- Work closely with a cutting-edge AI research team.
- Weekly lunch stipend, in-office lunches, and snacks.
- Full health and dental benefits, including mental health support.
- 100% Parental Leave top-up for up to 6 months.
- Personal enrichment benefits for arts, culture, fitness, and workspace improvement.
- Remote-flexible work options and co-working stipend.
- 6 weeks of vacation (30 working days).
Categories
AI & MLData Science
