6 months ago
San Francisco, CA, USAMid Level / Senior
Responsibilities
- Advance the capabilities of small, high-performance language models.
- Work on training methodology, optimization, evaluation, and model architecture.
- Collaborate with infrastructure and product teams to deploy models quickly.
- Push the limits of machine learning research and engineering.
Requirements
- Strong background in machine learning, deep learning, or related fields.
- 2+ years of experience in ML research or production systems.
- Fluency in Python and frameworks like PyTorch or JAX.
- Experience with training and optimizing large or efficient models.
- Strong understanding of applied optimization, distributed training, or model evaluation.
- Familiarity with code models, retrieval systems, or language modeling is a plus.
- Advanced degree (MS or PhD) in a quantitative field, or equivalent industry experience.
- Willingness to work in-person from our SF office in FiDi.
