Research Engineer, Frontier Speculative Decoding

about 2 months ago

San Francisco, CA, USA or New York, NY, USAMid Level / Senior

H1B Sponsor

Base Salary

$190k - $270k/yr

Responsibilities

Design and iterate on novel speculator algorithms to enhance accuracy and efficiency.
Serve as the link between raw data and production-ready models.
Work in a fast-paced, high-impact role at the cutting edge of generative AI.
Collaborate with experts to solve real-world, high-performance challenges.
Engage directly with customers to understand their needs and integrate solutions.

Requirements

A genuine love for data curation and processing with meticulous attention to detail.
Demonstrated ability to perform effective hyperparameter searches.
Experience working with and improving existing training codebases.
Strong attention to detail in evaluating model checkpoints for quality and performance.
Proficiency in Python and PyTorch.
Familiarity with SLURM and/or Kubernetes clusters in high-performance computing.
Knowledge of modern LLMs and generative models.
Basic understanding of distributed training frameworks like FSDP and DeepSpeed.
Bachelor’s, Master’s degree, or Ph.D. in Computer Science or related field, or equivalent experience.

Benefits

Competitive compensation and startup equity.
Health insurance and other competitive benefits.

Tech Stack

Kubernetes Python PyTorch

Categories

AI & MLData Science