
Senior Machine Learning Engineer, Voice AI
Together AI4 days ago
Base Salary
$200k - $260k/yr
Responsibilities
- Optimize inference performance for voice models targeting best-in-class TTFB, throughput, and GPU utilization.
- Productionize voice models on serverless and dedicated endpoints, including batching strategies and streaming inference.
- Build and maintain a voice model evaluation framework measuring WER and naturalness across various conditions.
- Enable new model architectures in the serving stack as the field evolves.
- Collaborate with model partners to integrate and optimize their models on Together's infrastructure.
- Profile and debug performance across the full inference stack and implement measurable improvements.
- Work with the platform engineering team to meet latency and reliability requirements for real-time voice APIs.
- Contribute to voice model fine-tuning capabilities for customer differentiation.
Requirements
- 5+ years of experience in ML engineering with a focus on model serving and inference optimization.
- Hands-on experience with LLM serving engines and comfortable modifying engine internals.
- Strong proficiency in Python and PyTorch, with experience in GPU profiling and optimization.
- Track record of shipping ML systems to production with measurable performance improvements.
- Strong product sense focused on developer needs in voice applications.
- Comfortable working in a small, fast-paced, early-stage team environment.
- Experience with speech and audio ML is a strong plus but not required.
- Familiarity with audio codecs and tokenization schemes is a plus.
- Experience training or fine-tuning speech models is a plus.
- Bachelor's or Master's degree in Computer Science, Electrical Engineering, or related field.
Benefits
- Competitive compensation and startup equity.
- Health insurance and other competitive benefits.