GrepJob
Together AI

Research Engineer, Core ML

Together AI
Apply
4 days ago
San Francisco, CA, USAMid Level / Senior / Staff+
H1B Sponsor

Base Salary

$200k - $280k/yr

Responsibilities

  • Design and prototype algorithms for low-latency, high-throughput inference.
  • Implement and maintain changes in high-performance inference engines.
  • Profile and optimize performance across GPU, networking, and memory layers.
  • Design and operate RL and post-training pipelines.
  • Make RL and post-training workloads more efficient with inference-aware training loops.
  • Train, evaluate, and iterate on frontier models using inference stack.
  • Profile, debug, and optimize inference and post-training services under production workloads.
  • Drive roadmap items requiring engine modifications.
  • Establish metrics and experimentation frameworks for validation.
  • Set technical direction for cross-team efforts.
  • Mentor engineers and researchers on full-stack ML systems work.

Requirements

  • 3+ years of experience in ML systems or large-scale model training.
  • Advanced degree in Computer Science, EE, or related field.
  • Demonstrated experience owning complex technical projects end-to-end.
  • Strong coding ability in Python.
  • Expertise in large-scale inference systems or RL/post-training for LLMs.
  • Comfortable working from algorithms to engines.
  • Solid research foundation in ML systems or RL.
  • Ability to operate as a full-stack problem solver.

Benefits

  • Competitive compensation and startup equity.
  • Health insurance and other competitive benefits.

Tech Stack

Categories

AI & MLBackendData Science