about 4 hours ago
Mountain View, CA, USA
Mid Level / Senior
H1B Sponsor
Base Salary
$160k - $241k/yr
Responsibilities
- Scale automated infrastructure-as-code (IaC) pipelines to manage thousands of GPU/CPU nodes.
- Design and optimize workload orchestration for maximum hardware utilization.
- Create robust pipelines for extracting and transforming petabyte-scale data.
- Implement feature caching and storage solutions for low-latency access.
- Contribute to a unified ML platform that abstracts complex cloud infrastructure.
Requirements
- 3+ years of professional experience in ML Infrastructure, Backend Platform Engineering, or Distributed Systems.
- Deep familiarity with Infrastructure-as-Code tools like Terraform or Pulumi.
- Hands-on experience with large-scale workload orchestrators such as Kubernetes.
- Proficiency in distributed processing frameworks like Apache Spark.
- Experience with feature stores and caching layers.
- Strong understanding of distributed systems and high-performance computing.
Benefits
- Eligible for an annual performance bonus.
- Equity options available.
- Competitive benefits package.
Tech Stack
Apache BeamApache SparkAWSAzureGoogle Cloud PlatformKubernetesRedisTerraform
Categories
AI & MLBackendData Engineering