about 4 hours ago
San Francisco, CA, USA
Mid Level / Senior
Base Salary
$380k - $555k/yr
Responsibilities
- Build and scale retrieval infrastructure across indexing, serving, and query execution.
- Develop low-latency, high-throughput systems for real-time model interaction.
- Partner with research to productionize embedding and retrieval techniques.
- Support dense, sparse, and hybrid retrieval pipelines.
- Own system performance, reliability, and observability at scale.
- Collaborate across Pretraining, Inference, and Product teams to integrate retrieval end-to-end.
- Contribute to model-system interfaces for agentic workflows.
Requirements
- Experience building and scaling distributed systems.
- Background in search, retrieval, or indexing systems.
- Familiarity with embedding-based or ML-powered systems.
- Experience with performance optimization and production reliability.
- Ability to work across ML and systems boundaries.
- First-principles thinking in ambiguous problem spaces.
Categories
AI & MLBackendData Engineering