1 day ago
Vancouver, CanadaMid Level / Senior
Responsibilities
- Support the development and execution of the infrastructure roadmap for DevOps and MLOps.
- Partner with engineering and data teams to ensure scalability, reliability, and automation.
- Evaluate and adopt tools to improve system efficiency and operational reliability.
- Build and manage infrastructure for deploying ML models using CI/CD pipelines.
- Automate training pipelines and model deployment processes.
- Monitor model performance and resource usage using various tools.
- Collaborate with teams to streamline CI/CD workflows and deployment processes.
- Stay current with trends in cloud-native, DevOps, and MLOps practices.
Requirements
- 2–5 years of hands-on DevOps or cloud engineering experience.
- Expertise with Kubernetes (EKS), Helm, and microservices.
- 1-2 years of experience with AWS and SageMaker.
- Experience creating data pipelines and deploying LLMs on Kubernetes.
- Proven track record using Terraform for Infrastructure as Code.
- A collaborative mindset with a passion for automation and continuous improvement.