Tech Lead, Model Serving Efficiency

about 1 year ago

San Francisco, CA, USAStaff+ / Senior / Mid Level

H1B Sponsor

Base Salary

$380k - $380k/yr

Responsibilities

Lead engineering efforts to improve model serving and inference performance.
Provide technical mentorship and oversight for junior engineers.
Drive optimizations for system throughput and reliability.
Collaborate with research and product teams to ensure effective model performance.
Design and improve serving infrastructure to support growth and reliability.

Requirements

Deep expertise in model performance optimization at the inference layer.
Strong background in kernel-level systems and low-level performance tuning.
Enjoy mentoring junior engineers without formal management responsibilities.
Excited about scaling high-performing AI systems for multimodal workloads.
Ability to navigate ambiguity and drive complex initiatives to completion.

Benefits

Hybrid work model with 3 days in the office per week.
Relocation assistance for new employees.

Categories

AI & ML Backend