about 1 month ago
San Francisco, CA, USAMid Level / Senior
Base Salary
$183k - $310k/yr
Responsibilities
- Design, deploy, and operate production inference infrastructure including model serving and autoscaling.
- Own the platform architecture for embedding and retrieval pipelines for multimodal robotics data.
- Build and maintain training and evaluation infrastructure for rapid model iteration.
- Drive cloud infrastructure decisions impacting latency, throughput, and reliability.
- Define platform abstractions and tooling for product engineers to ship ML features easily.
- Evaluate and integrate third-party ML infrastructure components.
Requirements
- Deep experience in production ML infrastructure including inference serving and model optimization.
- Strong foundation in distributed systems and cloud infrastructure (AWS/GCP).
- Experience architecting retrieval systems at scale with vector databases.
- A platform engineer's mindset with a focus on system reliability and ownership.
- Proven ability to make infrastructure tradeoffs independently and communicate effectively.
Benefits
- $300 monthly budget for commuter benefits or workspace setup.
- Competitive equity grant in a Series B company.
- 100% medical, dental, vision, and term life insurance for employees; 75% for dependents.
- 401(k) matching up to 4%.
- 4 weeks vacation plus holidays and winter break.
- All expenses paid company off-sites twice a year.
