Tech Lead, Model Serving Efficiency
OpenAI
10 months ago
San Francisco, CA, USA
Staff+ / Senior / Mid Level
Base Salary
$380k - $380k/yr
Responsibilities
- Lead engineering efforts to improve model serving and inference performance.
- Provide technical mentorship and oversight for junior engineers.
- Drive optimizations for system throughput and reliability.
- Collaborate with research and product teams to ensure effective model performance.
- Design and improve serving infrastructure to support growth and reliability.
Requirements
- Deep expertise in model performance optimization at the inference layer.
- Strong background in kernel-level systems and low-level performance tuning.
- Enjoy mentoring junior engineers without formal management responsibilities.
- Excited about scaling high-performing AI systems for multimodal workloads.
- Ability to navigate ambiguity and drive complex initiatives to completion.
Benefits
- Hybrid work model with 3 days in the office per week.
- Relocation assistance for new employees.
Categories
AI & MLBackend