2 months ago
Toronto, CanadaIntern
H1B Sponsor
Responsibilities
- Implement high performance kernels in low-level languages.
- Develop, test, and tune kernels for machine learning models.
- Create and automate reference implementations and unit tests.
- Analyze scalability and performance, collect metrics, and troubleshoot bottlenecks.
- Package and share implementations with partner teams.
Requirements
- Ability to implement high performance kernels in low-level languages.
- Proficiency in Python and/or C++.
- Solid background in Machine Learning model architecture.
- Experience with ML frameworks such as PyTorch and ML packages like Numpy.
- General understanding of computer architecture.
- Currently enrolled in a graduate program in a relevant discipline.
