over 2 years ago
Remote, Worldwide or San Francisco, CA, USAMid Level / Senior
H1B Sponsor
Base Salary
$225k - $550k/yr
Responsibilities
- Design and implement kernels for high-performance long-context behavior.
- Own the design, implementation, deployment, and production reliability of kernels.
- Focus on robustness, extensive testing, and functional correctness while enhancing performance.
- Evaluate porting compute kernels to alternative hardware options.
- Co-design kernels in collaboration with training, inference, and RL teams.
Requirements
- Low-level programming experience targeting AI accelerators like NVIDIA Blackwell or Google TPUs.
- Experience developing and optimizing GPU kernels in frameworks such as NCCL, Triton, and Flash-Attention.
- Familiarity with other kernel authoring frameworks like Pallas and Mojo.
- Deep expertise in computer architecture, low-level machine optimizations, and code generation.
- Agility, ownership mindset, and grit.
Benefits
- Annual salary ranges between $225K - $550K based on experience.
- Equity is a significant part of total compensation.
- 401(k) plan with 6% salary matching.
- Generous health, dental, and vision insurance for you and your dependents.
- Unlimited paid time off.
- Visa sponsorship and relocation stipend available.
