Software Engineer, Fleet Infrastructure
OpenAI
about 1 year ago
San Francisco, CA, USA
Mid Level / Senior
Base Salary
$230k - $490k/yr
Responsibilities
- Design, implement, and operate components of the compute fleet.
- Build job scheduling, cluster management, snapshot delivery, and CI/CD systems.
- Interface with researchers and product teams to understand workload requirements.
- Collaborate with hardware, infrastructure, and business teams for high service reliability.
Requirements
- Experience with hyperscale compute systems.
- Strong programming skills.
- Experience working in public clouds, especially Azure.
- Familiarity with Kubernetes.
- Execution-focused mentality with a rigorous focus on user requirements.
- Understanding of AI/ML workloads is a bonus.
Benefits
- Hybrid work model of 3 days in the office per week.
- Relocation assistance for new employees.
Tech Stack
AzureKubernetes
Categories
AI & MLBackendDevOps