GrepJob
Pragmatike

CUDA Kernel Engineer (Remote US)

Pragmatike
Apply
17 days ago
Cambridge, MA, USA +5 moreMid Level / Senior

Responsibilities

  • Design, implement, and optimize custom CUDA kernels for NVIDIA GPUs.
  • Profile GPU workloads using tools like Nsight Compute and nvprof.
  • Analyze and eliminate performance bottlenecks in GPU workloads.
  • Improve GPU memory pipelines and ensure proper memory coalescing.
  • Collaborate with AI systems and backend distributed systems teams.
  • Contribute to GPU architecture decisions and performance-engineering best practices.

Requirements

  • Proven experience building NVIDIA CUDA kernels from scratch.
  • Strong optimization skills for kernels, including tiling strategies and warp scheduling.
  • Deep understanding of CUDA threads, warps, and GPU memory hierarchy.
  • Experience diagnosing PCIe bottlenecks and optimizing host-device transfers.
  • Familiarity with C++, CUDA runtime APIs, and GPU debugging tools.

Benefits

  • Competitive salary & equity options
  • Sign-on bonus
  • Health, Dental, and Vision insurance
  • 401k plan

Tech Stack

C++

Categories