Databricks

Senior GenAI Research Engineer - Optimization and Kernels

Databricks

Apply
3 months ago
San Francisco, CA, USA
Senior
H1B Sponsor

Base Salary

$166k - $225k/yr

Responsibilities

  • Drive performance improvements through advanced optimization techniques.
  • Design, implement, and optimize high-performance GPU kernels for training workloads.
  • Create distributed training frameworks for large language models.
  • Profile, debug, and optimize end-to-end training workflows.

Requirements

  • BS/MS/PhD in Computer Science or related field.
  • Hands-on experience writing and tuning CUDA kernels for ML training applications.
  • Strong understanding of NVIDIA GPU architecture and proficiency with CUDA debugging tools.
  • Deep understanding of parallelism techniques and memory optimization strategies.
  • Strong software engineering skills in Python and PyTorch.

Tech Stack

PythonPyTorch

Categories

AI & MLBackendData Science