
Staff Machine Learning Engineer, Voice AI
Together AI1 day ago
Base Salary
$220k - $280k/yr
Responsibilities
- Own the voice inference roadmap end-to-end, defining and executing the technical strategy for optimizing models.
- Drive best-in-class inference performance by architecting systems for optimal latency and throughput.
- Lead the productionization of voice models at scale, designing serving architecture for real-time audio.
- Build a rigorous evaluation platform for model selection and roadmap decisions.
- Shape architecture for next-generation model support, anticipating emerging paradigms.
- Serve as the technical DRI for model partner integrations, managing the full lifecycle.
- Diagnose and resolve performance issues through systematic profiling and analysis.
- Influence platform architecture to meet real-time voice API demands.
- Define and scale voice fine-tuning capabilities for customer differentiation.
- Lay technical foundations for future voice products with minimal rework.
Requirements
- 8+ years of ML engineering experience focused on model serving and inference optimization.
- Deep expertise in LLM serving engines and experience modifying engine internals.
- Expert-level proficiency in Python and PyTorch with strong GPU optimization skills.
- Proven system design judgment with architectural decisions that held up at scale.
- Strong technical leadership and ability to define problems and raise engineering quality.
- Sharp product intuition for developer tooling in voice applications.
- Ability to thrive in ambiguous environments and early-stage teams.
- Strong foundation in speech and audio ML, with relevant experience preferred.
- Familiarity with audio codec and tokenization schemes is a plus.
- Experience training or fine-tuning speech models at scale is advantageous.
- Bachelor's or Master's in Computer Science, Electrical Engineering, or related field.
Benefits
- Competitive compensation and startup equity.
- Health insurance and other competitive benefits.