Staff Software Engineer, Ads ML Inference Infrastructure
17 days ago
Palo Alto, CA, USA +2 more
Staff+
H1B Sponsor
Base Salary
$208k - $365k/yr
Responsibilities
- Lead efforts to build next-generation model inference and feature serving systems.
- Design and optimize low-latency, high-throughput inference pipelines.
- Partner with Ads ML and product teams to productionize new model architectures.
- Evolve the online feature platform to improve coverage and consistency for Ads models.
- Evaluate and integrate new technologies to advance the inference stack.
- Build partnerships with other teams to improve end-to-end reliability and developer velocity.
- Mentor and coach other engineers in technical decisions and system design.
Requirements
- BS (or higher) degree in Computer Science or a related field.
- 8+ years of experience designing and operating large-scale ML or distributed infrastructure systems.
- Deep knowledge of at least one programming language (Java, C++, Python).
- Experience with distributed systems or recommendation/ads serving infrastructure.
- Hands-on experience with a deep learning framework (PyTorch or TensorFlow).
- Preferred experience with model/hardware accelerator libraries (e.g., CUDA).
- Preferred experience with inference optimization and serving frameworks (e.g., Triton).
- Proven track record of leading complex projects and mentoring engineers.
Tech Stack
C++JavaPythonPyTorchTensorFlow
Categories
AI & MLData Engineering