about 4 hours ago
London, United KingdomMid Level / Senior
H1B Sponsor
Responsibilities
- Design, run, and interpret large-scale RL experiments.
- Investigate improvements in RL as horizon, compute, and model size grow.
- Build and maintain benchmarks for long-horizon RL.
- Translate validated findings into production training recipes.
- Debug complex issues at the intersection of research and infrastructure.
- Collaborate with adjacent RL teams to enhance the overall RL stack.
Requirements
- Strong empirical research skills in Reinforcement Learning or large-scale ML training.
- Demonstrated ability to manage large experiments from design to interpretation.
- Proficiency in Python and experience with large-scale or distributed ML systems.
- Comfort operating at the research/systems boundary, including debugging.
- A commitment to the societal impacts of AI and responsible scaling.
Benefits
- Competitive compensation and benefits.
- Optional equity donation matching.
- Generous vacation and parental leave.
- Flexible working hours.
- Collaborative office space.