Senior Machine Learning Engineer

2 months ago

Palo Alto, CA, USASenior

H1B Sponsor

Responsibilities

Design and build evaluation frameworks for LLM safety, clinical accuracy, and conversational quality.
Develop synthetic data generation pipelines to stress-test models across diverse clinical scenarios.
Build automated and human-in-the-loop evaluation pipelines at scale.
Create benchmarks, metrics, and LLM-as-judge systems for healthcare tasks and conversational experience.
Analyze failure modes and translate findings into actionable model improvements by collaborating with the LLM post-training team.
Collaborate with research, engineering, and clinical teams to define and raise the quality bar.

MS or PhD in CS or related field.
5+ years in ML engineering, evaluation systems, or applied ML.
Strong software engineering skills — Python, PyTorch, and production-quality code.
Hands-on experience with LLM evaluation, benchmarking, or synthetic data generation.
Comfort building robust data analysis and evaluation infrastructure, not just running experiments.
Experience with UI/UX and front-end development toolkits such as Streamlit, Gradio, React, etc.