GrepJob
Prompt

Senior DevOps Engineer (Infrastructure & MLOps)

Prompt
Apply
15 days ago
Remote, WorldwideSenior

Base Salary

$180k - $225k/yr

Responsibilities

  • Design, implement, and manage highly available infrastructure for cloud-based platforms.
  • Architect and support AI/ML infrastructure, managing AWS Lambda and SageMaker environments.
  • Create and automate deployment pipelines using CI/CD tools for web applications and machine learning models.
  • Build, maintain, and scale containerized applications with Docker and ECS/Fargate.
  • Implement MLOps best practices for model transitions from development to production.
  • Ensure system scalability and reliability through proactive monitoring and automated alerting.
  • Collaborate with Product Engineers and Data Scientists to optimize performance and costs.
  • Manage and evolve Infrastructure as Code (IaC) footprint.

Requirements

  • 5+ years of experience in a DevOps or infrastructure role.
  • Expert knowledge of cloud platforms such as AWS, GCP, and Azure.
  • Strong experience with containerization technologies like Docker and Kubernetes.
  • Proven track record of designing and managing complex CI/CD pipelines.
  • Experience with MLOps workflows including model versioning and retraining pipelines.
  • Hands-on experience with monitoring and logging tools like Datadog and Prometheus.
  • Expertise in scripting languages, particularly Python.
  • Proficiency with infrastructure automation tools such as Terraform or Ansible.

Benefits

  • Competitive salaries.
  • Remote/hybrid work environment.
  • Potential equity compensation for outstanding performance.
  • Flexible PTO.
  • Company-wide sponsored lunches.
  • Company paid disability and life insurance benefits.
  • Company paid family and medical leave.
  • Medical, dental, and vision insurance benefits.
  • Discounted pet insurance.
  • FSA/DCA and commuter benefits.
  • 401k.
  • Complimentary subscription to digital fitness classes and wellness content.
  • Recovery suite at HQ with cold plunge, sauna, and shower.

Tech Stack

AnsibleAWSAzureBashDatadogDockerGitHub ActionsGoGoogle BigQueryGoogle Cloud PlatformGrafanaKubernetesMLflowPrometheusPythonSQLTerraform

Categories