GrepJob
Anthropic

Research Engineer, Code RL (Reinforcement Learning)

Anthropic
Apply
about 2 hours ago
San Francisco, CA, USA or New York, NY, USAMid Level / Senior
H1B Sponsor

Base Salary

$500k - $850k/yr

Responsibilities

  • Advance models' ability to write, edit, test, debug, and ship software.
  • Design reinforcement learning environments and coding tasks.
  • Build reward signals and verifiers to define 'good code'.
  • Run training experiments on frontier models.
  • Diagnose model performance in software engineering tasks.
  • Improve the speed and reliability of development pipelines.

Requirements

  • Strong software engineering skills with deep Python expertise.
  • Experience in async/concurrent programming.
  • Ability to own systems end to end and debug across the stack.
  • Balance research exploration with engineering implementation.
  • Commitment to code quality, testing, and performance.
  • Passion for developing safe and beneficial AI systems.

Benefits

  • Competitive compensation and benefits.
  • Optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours.
  • Collaborative office space.

Tech Stack

Categories

AI & MLData Science