Anthropic

Research Engineer, Reward Models Platform

Anthropic

Apply
2 months ago
New York, NY, USA +3 more
Mid Level / Senior
H1B Sponsor

Base Salary

$315k - $340k/yr

Responsibilities

  • Design and build infrastructure for rapid iteration on reward signals.
  • Develop systems for automated quality assessment of rewards.
  • Create tooling for comparing different reward methodologies.
  • Build pipelines that reduce toil in reward development.
  • Implement monitoring systems to track reward signal quality.
  • Collaborate with researchers to translate science requirements into platform capabilities.
  • Optimize existing systems for performance and reliability.
  • Contribute to best practices and documentation for reward workflows.

Requirements

  • Prior research experience is preferred.
  • Strong Python skills are required.
  • Experience with ML workflows and data pipelines is necessary.
  • Comfortable working across the stack from data pipelines to user-facing tooling.
  • Ability to balance building robust systems with the need for speed in research.
  • Results-oriented with a focus on flexibility and impact.
  • Willingness to take on tasks outside of the job description.
  • Motivated by the mission to develop safe AI.

Benefits

  • Competitive compensation and benefits.
  • Optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours.
  • Collaborative office space.

Tech Stack

Apache HiveApache SparkKubernetesPython

Categories

AI & MLData Engineering