about 2 hours ago
Base Salary
$230k - $385k/yr
Responsibilities
- Design and build the core agent harness and execution loop for Codex agents.
- Develop sandboxing, isolation, orchestration, and workflow infrastructure for agents.
- Create evaluation and debugging systems to identify issues across the agent stack.
- Run experiments to improve solve rate, reliability, latency, and cost.
- Enhance observability and diagnostics across the agent stack.
- Collaborate with research to make the harness trainable and useful for model improvement.
- Build shared primitives to enhance Codex's performance and usability.
Requirements
- Experience building or operating production systems in distributed systems or ML.
- Proficiency in Rust systems code and Python configuration layers.
- Hands-on experience with LLM applications and model deployment.
- Strong debugging skills and ability to address production failures.
- Desire to work closely with research while delivering production changes.
- Ability to write meaningful code and lead AI systems projects.