GrepJob
DigitalOcean

Senior Engineer 2: Inference Optimizations

DigitalOcean
Apply
about 3 hours ago
Boston, MA, USA
Senior
H1B Sponsor

Base Salary

$167k - $209k/yr

Responsibilities

  • Lead the technical strategy for benchmarking and performance optimizations at the inference engine and GPU kernel layers.
  • Engineer solutions for complex performance issues, including attention layer optimizations and memory management.
  • Implement cutting-edge optimization techniques to enhance DigitalOcean's AI capabilities.
  • Act as a subject matter expert on modern GPU families and their software stacks.
  • Mentor team members through code and design reviews, elevating the technical standards.
  • Collaborate with Product Management to translate hardware limits into product features.
  • Maintain a presence in the GPU infrastructure and model performance optimization communities.

Requirements

  • 5+ years of experience in high-performance computing or AI infrastructure.
  • Deep familiarity with the Gen AI landscape, including major model families.
  • Hands-on experience with attention-layer optimizations and parallelization strategies.
  • Comprehensive understanding of NVIDIA and AMD GPU architectures and software ecosystems.
  • Extensive experience with open-source software projects.
  • Excellent system design skills related to low-level GPU programming.
  • Experience acting as a technical lead and driving cross-functional alignment.

Benefits

  • Competitive array of benefits including Employee Assistance Program and flexible time off policy.
  • Reimbursement for relevant conferences, training, and education.
  • Access to LinkedIn Learning's 10,000+ courses for continued growth.
  • Potential for bonuses based on performance and equity compensation options.

Categories

AI & MLData Engineering