7 days ago
Responsibilities
- Design and implement kernels for linear algebra and tensor operations in C++.
- Own performance and correctness by adding microbenchmarks and regression tests.
- Profile and optimize for next-generation AI hardware.
- Debug issues and improve product quality and functionality.
- Engage in Agile practices within the team.
- Mentor colleagues and share knowledge.
Requirements
- Excellent programming and scripting skills in C++ and Python.
- Understanding of processor architectures and profiling on Linux.
- Experience testing numerical and performance-sensitive code.
- Hands-on experience with reproducibility, tolerance design, and benchmarking.
- Strong communication skills and a collaborative work ethic.
- Desirable: Strong command of algorithmic performance and experience with BLAS/DNN stacks.
Benefits
- Competitive salary and annual leave policy.
- Medical and dental health plans.
- Gym card and employee pension matched up to 4%.
- Commitment to building an inclusive work environment.