Staff Machine Learning Engineer, Training Runtime Performance
Nuro
5 months ago
Mountain View, CA, USA
Staff+
H1B Sponsor
Base Salary
$235k - $352k/yr
Responsibilities
- Collaborate with ML practitioners to integrate optimized input pipelines into workflows.
- Detect, diagnose, and resolve performance bottlenecks in training and evaluation workflows.
- Optimize training performance and resource utilization for consistent model outcomes.
- Enhance input data pipelines to maximize runtime goodput.
- Champion best practices for robust and debuggable ML experimentation.
Requirements
- B.S./M.S./Ph.D. in Computer Science, Electrical Engineering, or related field.
- 4+ years of experience in ML infrastructure or systems engineering.
- Understanding of workflows for billion-parameter models.
- Expert-level knowledge in distributed systems and Python.
- Strong skills in profiling and optimizing quantized workloads.
- Experience with ML compilers and reducing startup overhead.
Benefits
- Eligible for an annual performance bonus.
- Equity options available.
- Competitive benefits package.
Tech Stack
C++Python
Categories
AI & MLData Engineering