Software Engineer, Frontier Systems
OpenAI
10 months ago
San Francisco, CA, USA
Senior
Base Salary
$250k - $445k/yr
Responsibilities
- Own and improve system health checks for hyperscale supercomputers.
- Lead investigations into hardware failures and system-level bugs.
- Build automation to monitor and fix issues across thousands of machines.
- Ensure minimal disruptions during model training.
Requirements
- 7+ years of industry experience in software engineering.
- Proficiency with Python and shell scripting.
- Comfort with SQL, PromQL, and Pandas for data analysis.
- Experience developing reproducible analyses.
- Strengths in building and operationalizing systems.
Tech Stack
LinuxPandasPythonSQL
Categories
AI & MLData EngineeringDevOps