Staff Machine Learning Engineer, Voice AI

about 2 months ago

H1B Sponsor

Base Salary

$220k - $280k/yr

Responsibilities

Own the voice inference roadmap end-to-end, defining and executing the technical strategy for optimizing models.
Drive best-in-class inference performance by architecting systems for optimal latency and throughput.
Lead the productionization of voice models at scale, designing serving architecture for real-time audio.
Build a rigorous evaluation platform for model selection and roadmap decisions.
Shape architecture for next-generation model support, anticipating emerging paradigms.
Serve as the technical DRI for model partner integrations, managing the full lifecycle.
Diagnose and resolve performance issues through systematic profiling and analysis.
Influence platform architecture to meet real-time voice API demands.
Define and scale voice fine-tuning capabilities for customer differentiation.
Lay technical foundations for future voice products with minimal rework.

8+ years of ML engineering experience focused on model serving and inference optimization.
Deep expertise in LLM serving engines and experience modifying engine internals.
Expert-level proficiency in Python and PyTorch with strong GPU optimization skills.
Proven system design judgment with architectural decisions that held up at scale.
Strong technical leadership and ability to define problems and raise engineering quality.
Sharp product intuition for developer tooling in voice applications.
Ability to thrive in ambiguous environments and early-stage teams.
Strong foundation in speech and audio ML, with relevant experience preferred.
Familiarity with audio codec and tokenization schemes is a plus.
Experience training or fine-tuning speech models at scale is advantageous.
Bachelor's or Master's in Computer Science, Electrical Engineering, or related field.