ML Research Engineer

about 1 year ago

New York, NY, USAMid Level / Senior

Base Salary

$120k - $250k/yr

Responsibilities

Optimize speech recognition, large language models, and text-to-speech for real-world use.
Fine-tune LLMs with retrieval-augmented generation, reinforcement learning, and prompt engineering.
Integrate AI components into autonomous agents for complex tasks like scheduling and order-taking.
Create systems to monitor performance and improve models from real-world feedback.
Develop pipelines to construct knowledge graphs from business data.
Work with infrastructure teams to scale models across GPU/TPU clusters and edge devices.
Manage rapid experimentation, training, and production inference.
Lead evaluations and iterative improvements for robustness and scalability.
Balance research innovation with practical usability by collaborating with product teams.
Publish research and present at industry conferences.

3-7+ years of experience deploying impactful ML models, preferably in voice or NLP.
Deep knowledge in speech recognition, language models, and agent systems.
Proficiency in PyTorch or JAX; CUDA/Triton optimization experience preferred.
Proven ability to minimize latency and resource use on GPUs/TPUs.
Strong data-driven approach with measurable improvements.
Passion for creating intuitive AI experiences.
BS, MS, or PhD in Computer Science, Electrical Engineering, Mathematics, or equivalent.