GrepJob
DigitalOcean

Senior Engineer 2: Inference Optimizations

DigitalOcean
Apply
about 4 hours ago
Austin, TX, USA
Senior
H1B Sponsor

Base Salary

$167k - $209k/yr

Responsibilities

  • Lead the technical strategy for benchmarking and performance optimizations.
  • Engineer solutions for complex performance issues in AI inference.
  • Implement cutting-edge optimization techniques for AI models.
  • Act as a subject matter expert on modern GPU families and software stacks.
  • Conduct high-quality code and design reviews to mentor team members.
  • Collaborate with product management to translate hardware limits into product features.
  • Contribute to the GPU infrastructure and model performance optimization communities.

Requirements

  • 5+ years of experience in high-performance computing or AI infrastructure.
  • Deep familiarity with the Gen AI landscape and major model families.
  • Hands-on experience with attention-layer optimizations and parallelization strategies.
  • Comprehensive understanding of NVIDIA and AMD GPU architectures.
  • Extensive experience with open-source software projects.
  • Excellent system design skills related to low-level GPU programming.
  • Experience acting as a technical lead and driving cross-functional alignment.

Benefits

  • Competitive array of benefits including Employee Assistance Program and flexible time off.
  • Reimbursement for relevant conferences, training, and education.
  • Access to LinkedIn Learning's 10,000+ courses for continued growth.
  • Potential for bonuses and equity compensation based on performance.

Categories

AI & MLData Engineering