GrepJob
OpenAI

Software Engineer, ML Systems & Training Architecture

OpenAI
Apply
about 3 hours ago

Base Salary

$295k - $380k/yr

Responsibilities

  • Review, improve, and clean up code across training frameworks and adjacent infrastructure.
  • Identify risky or low-quality changes before they land and raise the code quality bar.
  • Debug issues across ML training systems, GPUs, clusters, networking, and related infrastructure.
  • Help researchers and engineers unblock broken training jobs and flaky workflows.
  • Improve the reliability, maintainability, and usability of the robotics team’s training framework.
  • Move quickly on practical engineering problems that directly affect team velocity.

Requirements

  • Strong software engineering fundamentals and excellent code review judgment.
  • Experience with ML systems, training frameworks, GPUs, distributed systems, or complex technical environments.
  • Ability to read and debug unfamiliar codebases quickly.
  • Proven track record of shipping high-quality code with strong velocity.
  • Low-ego, responsive, and motivated by helping others move faster.
  • Experience reviewing messy, fast-moving, or AI-generated codebases.

Benefits

  • Relocation assistance for new employees.
  • In-office work environment in San Francisco, CA.

Categories