Research Engineer, Core ML

about 2 months ago

San Francisco, CA, USAMid Level / Senior / Staff+

H1B Sponsor

Base Salary

$200k - $280k/yr

Responsibilities

Design and prototype algorithms for low-latency, high-throughput inference.
Implement and maintain changes in high-performance inference engines.
Profile and optimize performance across GPU, networking, and memory layers.
Design and operate RL and post-training pipelines.
Make RL and post-training workloads more efficient with inference-aware training loops.
Train, evaluate, and iterate on frontier models using inference stack.
Profile, debug, and optimize inference and post-training services under production workloads.
Drive roadmap items requiring engine modifications.
Establish metrics and experimentation frameworks for validation.
Set technical direction for cross-team efforts.
Mentor engineers and researchers on full-stack ML systems work.