GrepJob
Anthropic

Research Engineer, RL Scaling Science

Anthropic
Apply
about 4 hours ago
London, United KingdomMid Level / Senior
H1B Sponsor

Responsibilities

  • Design, run, and interpret large-scale RL experiments.
  • Investigate improvements in RL as horizon, compute, and model size grow.
  • Build and maintain benchmarks for long-horizon RL.
  • Translate validated findings into production training recipes.
  • Debug complex issues at the intersection of research and infrastructure.
  • Collaborate with adjacent RL teams to enhance the overall RL stack.

Requirements

  • Strong empirical research skills in Reinforcement Learning or large-scale ML training.
  • Demonstrated ability to manage large experiments from design to interpretation.
  • Proficiency in Python and experience with large-scale or distributed ML systems.
  • Comfort operating at the research/systems boundary, including debugging.
  • A commitment to the societal impacts of AI and responsible scaling.

Benefits

  • Competitive compensation and benefits.
  • Optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours.
  • Collaborative office space.

Tech Stack

Categories

AI & MLData Science