Software Engineer, Distributed Systems
OpenAI
4 months ago
San Francisco, CA, USA
Senior
Base Salary
$293k - $490k/yr
Responsibilities
- Architect and build the gateway/network load balancer for research jobs.
- Design traffic stickiness and routing strategies for reliability and throughput.
- Instrument and debug complex distributed systems with observability tools.
- Collaborate with researchers and ML engineers on infrastructure decisions.
- Own the end-to-end system lifecycle from design to operation and scaling.
- Contribute across layers of the stack, focusing on performance tuning.
Requirements
- Deep experience in designing and operating large-scale distributed systems.
- 5+ years of experience in software engineering and systems architecture.
- Strong debugging skills with a focus on distributed failures.
- Proficiency in writing and reviewing production code in Rust or similar languages.
- Experience in big tech or high-growth environments.
Tech Stack
AmbassadorCC++GogRPCJavaRust
Categories
AI & MLBackend