about 1 month ago
Remote, United StatesSenior / Staff+
H1B Sponsor
Base Salary
$180k - $200k/yr
Responsibilities
- Design, build, and maintain ML infrastructure across training, evaluation, serving, and monitoring.
- Own data pipelines including generation, cleaning, validation, and versioning.
- Build and improve experiment tracking, orchestration, and reproducibility tooling.
- Implement and maintain CI/CD pipelines and A/B testing infrastructure.
- Lead and formalize design review processes across the team.
- Identify architectural risks early and guide the team toward sustainable systems decisions.
Requirements
- 5+ years of industry experience building and operating ML systems in production.
- Proven track record as a key player in the success of a production-grade ML system end-to-end.
- Deep familiarity with training pipelines, serving infrastructure, and experiment management.
- Strong software engineering fundamentals and systems design sensibility.
- Experience driving design reviews and improving engineering processes within a team.
- Comfort operating with high autonomy in ambiguous problem spaces.
- Experience with GPU-accelerated workloads and orchestration.
- Strong communication and collaboration skills.
Benefits
- Interesting and challenging work.
- Competitive salary and equity benefits.
- Health, dental, and vision insurance.
- Regular team events and offsites (~4x / year).
- Unlimited paid time off.
- Paid parental leave.
