about 3 hours ago
San Francisco, CA, USA or Seattle, WA, USA
Mid Level / Senior
Base Salary
$293k - $455k/yr
Responsibilities
- Develop systems and tooling to measure, monitor, and improve token throughput across compute environments.
- Support performance benchmarking, tokenomics analysis, and model porting.
- Build tooling to integrate external infrastructure into OpenAI’s internal systems.
- Develop and monitor operational metrics including billing, usage, and reliability.
- Identify bottlenecks across hardware, networking, and software.
- Collaborate with various teams to translate raw capacity into usable workload-serving capacity.
- Build dashboards and reporting systems for visibility into TaaS capacity and performance.
Requirements
- Strong software engineering background with experience in systems and tooling.
- Experience with compute infrastructure, distributed systems, or performance engineering.
- Ability to analyze token throughput and infrastructure efficiency.
- Comfortable integrating external systems into internal infrastructure.
- Strong analytical and debugging skills across multiple domains.
Categories
AI & MLBackendData EngineeringDevOps