
Senior Engineer 2: Inference Data Plane
DigitalOceanabout 5 hours ago
Seattle, WA, USA
Senior / Staff+
H1B Sponsor
Base Salary
$167k - $209k/yr
Responsibilities
- Act as a technical leader on the team, driving the design and delivery of data plane components.
- Architect and refine system design proposals for a multi-tenant AI inference cloud ecosystem.
- Implement and optimize distributed inference hosting using advanced techniques.
- Collaborate with Product Managers and other teams to align technical roadmaps with customer needs.
- Coach and mentor junior engineers to foster a culture of technical excellence.
- Maintain and operate critical high-scale services using observability tools.
Requirements
- Strong experience with distributed systems, microservices, and infrastructure as code.
- Hands-on experience with large language or multimodal models using inference engines.
- Familiarity with distributed inference serving frameworks like llm-d or NVIDIA Dynamo.
- Understanding of GPU-level optimization and interconnect technologies.
- Knowledge of common LLM architectures and optimization techniques.
- Expert-level proficiency in GoLang or Python and familiarity with gRPC.
- Proven experience in shipping customer-facing software products in a high-scale environment.
- Experience integrating and building with open-source software.
Benefits
- Competitive array of benefits including an Employee Assistance Program and flexible time off policy.
- Reimbursement for relevant conferences, training, and education.
- Access to LinkedIn Learning's 10,000+ courses for continued growth.
- Potential for bonuses and equity compensation based on performance.
Tech Stack
GogRPCPython
Categories
AI & MLBackendData Engineering