about 1 year ago
New York, NY, USAMid Level / Senior
Base Salary
$120k - $250k/yr
Responsibilities
- Optimize speech recognition, large language models, and text-to-speech for real-world use.
- Fine-tune LLMs with retrieval-augmented generation, reinforcement learning, and prompt engineering.
- Integrate AI components into autonomous agents for complex tasks like scheduling and order-taking.
- Create systems to monitor performance and improve models from real-world feedback.
- Develop pipelines to construct knowledge graphs from business data.
- Work with infrastructure teams to scale models across GPU/TPU clusters and edge devices.
- Manage rapid experimentation, training, and production inference.
- Lead evaluations and iterative improvements for robustness and scalability.
- Balance research innovation with practical usability by collaborating with product teams.
- Publish research and present at industry conferences.
Requirements
- 3-7+ years of experience deploying impactful ML models, preferably in voice or NLP.
- Deep knowledge in speech recognition, language models, and agent systems.
- Proficiency in PyTorch or JAX; CUDA/Triton optimization experience preferred.
- Proven ability to minimize latency and resource use on GPUs/TPUs.
- Strong data-driven approach with measurable improvements.
- Passion for creating intuitive AI experiences.
- BS, MS, or PhD in Computer Science, Electrical Engineering, Mathematics, or equivalent.
Benefits
- Competitive salary and meaningful equity.
- Strong in-person culture with fast feedback loops.
- Full health, dental, vision, 401k, life insurance, and unlimited PTO.
- Tools budget and resources to support your work.
Tech Stack
PyTorch
Categories
AI & MLData Science
