GrepJob
Garner Health

Staff Machine Learning Operations Engineer

Garner Health
Apply
about 2 hours ago

Base Salary

$298k - $351k/yr

Responsibilities

  • Own the reliability, performance, functionality, and cost-efficiency of production ML systems.
  • Architect the ML platform including data infrastructure and standardized service patterns.
  • Implement ML-specific CI/CD pipelines for automated deployment processes.
  • Drive down cost and latency through improved architecture and model optimization.
  • Lay the foundation for a future MLOps team with workflows, standards, and KPIs.
  • Design and implement automated data drift monitoring systems.

Requirements

  • 7+ years of software engineering experience with ML or data-intensive systems.
  • Deep experience with the modern ML production stack including model serving and CI/CD.
  • Strong fundamentals in infrastructure and platform engineering, including Kubernetes and AWS.
  • Experience designing ML platforms or significant components of one.
  • Ability to collaborate with various engineering teams and set technical direction.
  • Healthcare or regulated-data experience is a plus.

Benefits

  • Flexible PTO.
  • Medical/Dental/Vision plan options.
  • 401(k) plan.
  • Equity incentive participation.

Tech Stack

Apache AirflowAWSDatadogKubernetesPythonSnowflakeTerraform

Categories

AI & MLData EngineeringDevOps