
Member of Technical Staff - ML Engineering
Latent Labs3 months ago
London, United KingdomMid Level / Senior
Responsibilities
- Deploy, maintain, and optimize production and research compute clusters.
- Design and implement scalable and efficient ML inference solutions.
- Develop dynamic compute solutions for balancing research and production needs.
- Contribute to productizing model APIs for external use.
- Develop infrastructure observability and monitoring solutions.
Requirements
- Deep experience with Kubernetes and containerized workflows.
- Experience with major cloud platforms (AWS, GCP, Azure).
- Knowledge of DevOps and related tools (Terraform, etc).
- Knowledge of HPC frameworks (Slurm, Ray, etc).
- Production engineering and reliability experience.
- Experience with PyTorch and distributed computing.
Benefits
- Private health insurance.
- Pension/401(K) contributions.
- Generous leave policies including gender neutral parental leave.
- Hybrid working.
- Travel opportunities and more.