OpenAI

Tech Lead, Model Serving Efficiency

OpenAI

Apply
10 months ago
San Francisco, CA, USA
Staff+ / Senior / Mid Level

Base Salary

$380k - $380k/yr

Responsibilities

  • Lead engineering efforts to improve model serving and inference performance.
  • Provide technical mentorship and oversight for junior engineers.
  • Drive optimizations for system throughput and reliability.
  • Collaborate with research and product teams to ensure effective model performance.
  • Design and improve serving infrastructure to support growth and reliability.

Requirements

  • Deep expertise in model performance optimization at the inference layer.
  • Strong background in kernel-level systems and low-level performance tuning.
  • Enjoy mentoring junior engineers without formal management responsibilities.
  • Excited about scaling high-performing AI systems for multimodal workloads.
  • Ability to navigate ambiguity and drive complex initiatives to completion.

Benefits

  • Hybrid work model with 3 days in the office per week.
  • Relocation assistance for new employees.

Categories

AI & MLBackend