
Research Engineer, Core ML
Together AI4 days ago
Base Salary
$200k - $280k/yr
Responsibilities
- Design and prototype algorithms for low-latency, high-throughput inference.
- Implement and maintain changes in high-performance inference engines.
- Profile and optimize performance across GPU, networking, and memory layers.
- Design and operate RL and post-training pipelines.
- Make RL and post-training workloads more efficient with inference-aware training loops.
- Train, evaluate, and iterate on frontier models using inference stack.
- Profile, debug, and optimize inference and post-training services under production workloads.
- Drive roadmap items requiring engine modifications.
- Establish metrics and experimentation frameworks for validation.
- Set technical direction for cross-team efforts.
- Mentor engineers and researchers on full-stack ML systems work.
Requirements
- 3+ years of experience in ML systems or large-scale model training.
- Advanced degree in Computer Science, EE, or related field.
- Demonstrated experience owning complex technical projects end-to-end.
- Strong coding ability in Python.
- Expertise in large-scale inference systems or RL/post-training for LLMs.
- Comfortable working from algorithms to engines.
- Solid research foundation in ML systems or RL.
- Ability to operate as a full-stack problem solver.
Benefits
- Competitive compensation and startup equity.
- Health insurance and other competitive benefits.