GrepJob
Graphcore

Infrastructure and MLOps Engineer

Graphcore
Apply
about 3 hours ago
Cambridge, United KingdomMid Level / Senior
H1B Sponsor

Responsibilities

  • Develop, own, and maintain tools and services to support AI research and engineering teams.
  • Deploy and maintain services with Kubernetes and Docker.
  • Manage Cloud Infrastructure using tools such as Terraform.

Requirements

  • Knowledge of Python.
  • Familiarity with cloud services (e.g. AWS).
  • Experience managing or developing in Linux environments.
  • Understanding of CI/CD principles.
  • Experience using Kubernetes (k8s).
  • Experience maintaining machine learning applications.
  • Experience deploying ML orchestration tools (e.g. NV Ray, KFP, SkyPilot).
  • Experience managing ML accelerator hardware (e.g. DCGM).
  • Experience with Infrastructure as Code (IaC) tools (e.g. Terraform/OpenTofu).
  • Experience with GitHub Actions.
  • Experience with modern observability tooling (e.g. Prometheus).
  • Experience with Grafana.
  • Knowledge of Go/Java/C++ (or similar language).

Benefits

  • Flexible working arrangements.
  • Generous annual leave policy.
  • Private medical insurance and health cash plan.
  • Dental plan and pension matched up to 5%.
  • Life assurance and income protection.
  • Generous parental leave policy.
  • Employee assistance programme for health and mental wellbeing support.
  • Healthy food and snacks at the office.

Categories

AI & MLData EngineeringDevOps