3 days ago
Responsibilities
- Design and implement kernels for linear algebra and tensor operations in C++.
- Own performance and correctness by adding microbenchmarks and regression tests.
- Profile and optimize for next-generation AI hardware focusing on threading and memory layout.
- Debug issues and improve the quality and functionality of the product.
- Engage in Agile practices within the team.
- Mentor colleagues and share knowledge.
Requirements
- Excellent programming and scripting skills in C++ and Python.
- Understanding of processor architectures and profiling on Linux.
- Strong written and oral communication skills.
- Ability to work well in a team and produce quality work.
- Desirable: Strong command of algorithmic performance and experience with BLAS/DNN stacks.
Benefits
- Competitive salary and annual leave policy.
- Medical and dental health plans.
- Gym card and employee pension matched up to 4%.
- Commitment to building an inclusive work environment.