
Senior Engineer 2: Inference Optimizations
DigitalOceanabout 3 hours ago
Seattle, WA, USA
Senior
H1B Sponsor
Base Salary
$167k - $209k/yr
Responsibilities
- Lead the technical strategy for benchmarking and performance optimizations at the inference engine and GPU kernel layers.
- Engineer solutions for complex performance issues, including attention layer optimizations and memory management.
- Implement cutting-edge optimization techniques to enhance DigitalOcean's AI capabilities.
- Act as a subject matter expert on modern GPU families and their software stacks.
- Conduct high-quality code and design reviews to elevate the technical standards of the team.
- Collaborate with Product Management to translate hardware limits into shippable product features.
- Maintain a strong presence in the GPU infrastructure and model performance optimization communities.
Requirements
- 5+ years of experience in high-performance computing or AI infrastructure.
- Deep familiarity with the Gen AI landscape, including major model families.
- Hands-on experience with attention-layer optimizations and parallelization strategies.
- Comprehensive understanding of NVIDIA and AMD GPU architectures and software ecosystems.
- Extensive experience with open-source software projects.
- Excellent system design skills related to low-level GPU programming.
- Experience acting as a technical lead and driving cross-functional alignment.
Benefits
- Competitive array of benefits including an Employee Assistance Program and flexible time off policy.
- Reimbursement for relevant conferences, training, and education.
- Access to LinkedIn Learning's 10,000+ courses for continued growth.
- Potential for bonuses based on company and individual performance.
- Equity compensation options for eligible employees.
Categories
AI & MLData Science