Senior Machine Learning Engineer, Voice AI

about 2 months ago

H1B Sponsor

Base Salary

$200k - $260k/yr

Responsibilities

Optimize inference performance for voice models targeting best-in-class TTFB, throughput, and GPU utilization.
Productionize voice models on serverless and dedicated endpoints, including batching strategies and streaming inference.
Build and maintain a voice model evaluation framework measuring WER and naturalness across various conditions.
Enable new model architectures in the serving stack as the field evolves.
Collaborate with model partners to integrate and optimize their models on Together's infrastructure.
Profile and debug performance across the full inference stack and implement measurable improvements.
Work with the platform engineering team to meet latency and reliability requirements for real-time voice APIs.
Contribute to voice model fine-tuning capabilities for customer differentiation.

5+ years of experience in ML engineering with a focus on model serving and inference optimization.
Hands-on experience with LLM serving engines and comfortable modifying engine internals.
Strong proficiency in Python and PyTorch, with experience in GPU profiling and optimization.
Track record of shipping ML systems to production with measurable performance improvements.
Strong product sense focused on developer needs in voice applications.
Comfortable working in a small, fast-paced, early-stage team environment.
Experience with speech and audio ML is a strong plus but not required.
Familiarity with audio codecs and tokenization schemes is a plus.
Experience training or fine-tuning speech models is a plus.
Bachelor's or Master's degree in Computer Science, Electrical Engineering, or related field.