GrepJob
Hippocratic AI

Senior Machine Learning Engineer

Hippocratic AI
Apply
4 days ago
Palo Alto, CA, USASenior
H1B Sponsor

Responsibilities

  • Design and build evaluation frameworks for LLM safety, clinical accuracy, and conversational quality.
  • Develop synthetic data generation pipelines to stress-test models across diverse clinical scenarios.
  • Build automated and human-in-the-loop evaluation pipelines at scale.
  • Create benchmarks, metrics, and LLM-as-judge systems for healthcare tasks and conversational experience.
  • Analyze failure modes and translate findings into actionable model improvements by collaborating with the LLM post-training team.
  • Collaborate with research, engineering, and clinical teams to define and raise the quality bar.

Requirements

  • MS or PhD in CS or related field.
  • 5+ years in ML engineering, evaluation systems, or applied ML.
  • Strong software engineering skills — Python, PyTorch, and production-quality code.
  • Hands-on experience with LLM evaluation, benchmarking, or synthetic data generation.
  • Comfort building robust data analysis and evaluation infrastructure, not just running experiments.
  • Experience with UI/UX and front-end development toolkits such as Streamlit, Gradio, React, etc.

Tech Stack

PythonPyTorchReact

Categories

AI & MLData ScienceFrontend