GrepJob
Deepgram

ML Ops Infrastructure Engineer

Deepgram
Apply
3 months ago
Remote, Worldwide or New York, NY, USAMid Level / Senior
H1B Sponsor

Base Salary

$160k - $220k/yr

Responsibilities

  • Design and build CI/CD pipelines for ML model development and deployment.
  • Architect and maintain model deployment pipelines from research to production.
  • Build A/B testing infrastructure for controlled rollouts of new models.
  • Implement comprehensive monitoring for model performance in production.
  • Develop automated retraining pipelines based on data changes or performance issues.
  • Create and maintain build and test environments that mirror production.
  • Establish model versioning and rollback capabilities for safe deployments.
  • Collaborate with research engineers to define model quality gates.
  • Build observability dashboards for real-time model health insights.
  • Optimize model serving infrastructure for latency and cost efficiency.

Requirements

  • 4+ years of experience in MLOps, DevOps, or infrastructure engineering focused on ML systems.
  • Strong proficiency in Python and experience with ML workflow automation.
  • Deep experience with CI/CD systems for software and model delivery.
  • Hands-on experience with Docker and Kubernetes for workload management.
  • Practical experience deploying and serving ML models in production.
  • Familiarity with model evaluation and quality assurance processes.
  • Understanding of monitoring and observability principles for ML systems.
  • Strong problem-solving skills with a bias toward automation.

Benefits

  • Medical, dental, and vision benefits.
  • Annual wellness stipend and mental health support.
  • Unlimited PTO and generous paid parental leave.
  • Flexible schedule and 12 paid US company holidays.
  • 401(k) plan with company match and tax savings programs.
  • Learning and education stipend for continuous learning opportunities.

Tech Stack

DatadogDockerGrafanaKubernetesPrometheusPythonTerraform

Categories