
Senior Machine Learning Engineer
Hippocratic AI4 days ago
Responsibilities
- Design and build evaluation frameworks for LLM safety, clinical accuracy, and conversational quality.
- Develop synthetic data generation pipelines to stress-test models across diverse clinical scenarios.
- Build automated and human-in-the-loop evaluation pipelines at scale.
- Create benchmarks, metrics, and LLM-as-judge systems for healthcare tasks and conversational experience.
- Analyze failure modes and translate findings into actionable model improvements by collaborating with the LLM post-training team.
- Collaborate with research, engineering, and clinical teams to define and raise the quality bar.
Requirements
- MS or PhD in CS or related field.
- 5+ years in ML engineering, evaluation systems, or applied ML.
- Strong software engineering skills — Python, PyTorch, and production-quality code.
- Hands-on experience with LLM evaluation, benchmarking, or synthetic data generation.
- Comfort building robust data analysis and evaluation infrastructure, not just running experiments.
- Experience with UI/UX and front-end development toolkits such as Streamlit, Gradio, React, etc.