about 4 hours ago
London, United Kingdom
Mid Level / Senior
H1B Sponsor
Responsibilities
- Build and improve the RL training infrastructure that researchers depend on day-to-day.
- Identify and remove bottlenecks across the RL stack: debugging, profiling, and rearchitecting where needed.
- Partner closely with researchers and adjacent engineering teams to understand pain points and ship tooling that makes them faster.
- Own the reliability and performance of research runs end-to-end.
- Contribute to design decisions that shape how Anthropic does RL at scale.
Requirements
- Have strong software engineering fundamentals and a track record of building performant, reliable systems.
- Have worked on ML infrastructure, distributed systems, or research tooling.
- Care about enabling other people's work and find leverage through platforms rather than individual experiments.
- Are comfortable operating across the stack, from low-level performance work to RL algorithms.
- Have a bias toward shipping and iterating quickly, with a mix of high agency and low ego.
Benefits
- Competitive compensation and benefits.
- Optional equity donation matching.
- Generous vacation and parental leave.
- Flexible working hours.
- A lovely office space for collaboration.
Tech Stack
PyTorch
Categories
AI & MLData Engineering