
Research Engineer, Frontier Speculative Decoding
Together AI4 days ago
San Francisco, CA, USA or New York, NY, USAMid Level / Senior
H1B Sponsor
Base Salary
$190k - $270k/yr
Responsibilities
- Design and iterate on novel speculator algorithms to enhance accuracy and efficiency.
- Serve as the link between raw data and production-ready models.
- Work in a fast-paced, high-impact role at the cutting edge of generative AI.
- Collaborate with experts to solve real-world, high-performance challenges.
- Engage directly with customers to understand their needs and integrate solutions.
Requirements
- A genuine love for data curation and processing with meticulous attention to detail.
- Demonstrated ability to perform effective hyperparameter searches.
- Experience working with and improving existing training codebases.
- Strong attention to detail in evaluating model checkpoints for quality and performance.
- Proficiency in Python and PyTorch.
- Familiarity with SLURM and/or Kubernetes clusters in high-performance computing.
- Knowledge of modern LLMs and generative models.
- Basic understanding of distributed training frameworks like FSDP and DeepSpeed.
- Bachelor’s, Master’s degree, or Ph.D. in Computer Science or related field, or equivalent experience.
Benefits
- Competitive compensation and startup equity.
- Health insurance and other competitive benefits.