
Senior Engineer 2: Inference Optimizations
DigitalOceanabout 4 hours ago
Austin, TX, USA
Senior
H1B Sponsor
Base Salary
$167k - $209k/yr
Responsibilities
- Lead the technical strategy for benchmarking and performance optimizations.
- Engineer solutions for complex performance issues in AI inference.
- Implement cutting-edge optimization techniques for AI models.
- Act as a subject matter expert on modern GPU families and software stacks.
- Conduct high-quality code and design reviews to mentor team members.
- Collaborate with product management to translate hardware limits into product features.
- Contribute to the GPU infrastructure and model performance optimization communities.
Requirements
- 5+ years of experience in high-performance computing or AI infrastructure.
- Deep familiarity with the Gen AI landscape and major model families.
- Hands-on experience with attention-layer optimizations and parallelization strategies.
- Comprehensive understanding of NVIDIA and AMD GPU architectures.
- Extensive experience with open-source software projects.
- Excellent system design skills related to low-level GPU programming.
- Experience acting as a technical lead and driving cross-functional alignment.
Benefits
- Competitive array of benefits including Employee Assistance Program and flexible time off.
- Reimbursement for relevant conferences, training, and education.
- Access to LinkedIn Learning's 10,000+ courses for continued growth.
- Potential for bonuses and equity compensation based on performance.
Categories
AI & MLData Engineering