Senior Engineer 2: Inference Optimizations

5 months ago

H1B Sponsor

Base Salary

$167k - $209k/yr

Responsibilities

Lead the technical strategy for benchmarking and performance optimizations at the inference engine and GPU kernel layers.
Engineer solutions for complex performance issues, including attention layer optimizations and memory management.
Implement cutting-edge optimization techniques to enhance DigitalOcean's AI capabilities.
Act as a subject matter expert on modern GPU families and their software stacks.
Conduct high-quality code and design reviews to elevate the technical standards of the team.
Collaborate with Product Management to translate hardware limits into shippable product features.
Maintain a strong presence in the GPU infrastructure and model performance optimization communities.

5+ years of experience in high-performance computing or AI infrastructure.
Deep familiarity with the Gen AI landscape, including major model families.
Hands-on experience with attention-layer optimizations and parallelization strategies.
Comprehensive understanding of NVIDIA and AMD GPU architectures and software ecosystems.
Extensive experience with open-source software projects.
Excellent system design skills related to low-level GPU programming.
Experience acting as a technical lead and driving cross-functional alignment.

Competitive array of benefits including an Employee Assistance Program and flexible time off policy.
Reimbursement for relevant conferences, training, and education.
Access to LinkedIn Learning's 10,000+ courses for continued growth.
Potential for bonuses based on company and individual performance.
Equity compensation options for eligible employees.