5 days ago
Remote, United States
Mid Level / Senior
Base Salary
$144k - $160k/yr
Responsibilities
- Utilize profiling tools to identify bottlenecks in data loading, gradient computation, and communication.
- Implement optimizations like kernel fusion, sharding, and tiling to improve step time.
- Optimize distributed training pipelines using frameworks such as PyTorch Distributed.
- Design and maintain high-performance GPU kernels in Triton or CUDA.
- Optimize robust data loading pipelines to maximize training throughput.
Requirements
- Bachelor’s, Master’s degree, or PhD in Computer Science, Computer Engineering, or a related technical discipline.
- Strong proficiency in Python.
- Extensive hands-on experience with PyTorch.
- Experience optimizing machine learning model execution during training and inference.
- Exceptional analytical and problem-solving skills with a data-driven approach.
Benefits
- Medical, dental, and vision insurance.
- 401k with a company match.
- Health savings accounts.
- Life insurance and pet insurance.
- Flexible hybrid work schedule or fully remote option.
Tech Stack
PythonPyTorch
Categories
AI & MLData Engineering