about 1 month ago
New York, NY, USAMid Level / Senior
Base Salary
$130k - $230k/yr
Responsibilities
- Design and maintain cloud infrastructure to support real-time model serving.
- Deploy and optimize APIs for low-latency access to ML models.
- Manage and optimize the end-to-end training data flow.
- Build observability tooling for production ML pipelines.
- Automate model deployment, retraining, and evaluation pipelines.
- Work with ML engineers to package models for serving.
- Help manage vector databases and semantic search infrastructure.
- Ensure security, compliance, and uptime of infrastructure.
Requirements
- 3–8 years of experience deploying machine learning systems or high-availability backend systems.
- Experience with production infrastructure at scale supporting ML workflows.
- Familiarity with GCP, AWS, or similar platforms.
- Proficiency in Terraform, Docker, Kubernetes, or similar tools.
- Understanding of performance tradeoffs in serving models.
- Ability to work cross-functionally with ML, security, and product teams.
- A builder's mindset and bias for ownership in ambiguous environments.
Benefits
- Salary range of $130K–$230K, depending on experience and location.
- Performance-based annual bonus.
- Support for continuing education, conferences, or training.
- Fully remote, U.S.-based work environment.
- Comprehensive health, dental, and vision coverage.
- Generous PTO and paid holiday schedule.
- 401(k) plan.
