Research Engineer, Reward Models Platform
Anthropic
2 months ago
New York, NY, USA +3 more
Mid Level / Senior
H1B Sponsor
Base Salary
$315k - $340k/yr
Responsibilities
- Design and build infrastructure for rapid iteration on reward signals.
- Develop systems for automated quality assessment of rewards.
- Create tooling for comparing different reward methodologies.
- Build pipelines that reduce toil in reward development.
- Implement monitoring systems to track reward signal quality.
- Collaborate with researchers to translate science requirements into platform capabilities.
- Optimize existing systems for performance and reliability.
- Contribute to best practices and documentation for reward workflows.
Requirements
- Prior research experience is preferred.
- Strong Python skills are required.
- Experience with ML workflows and data pipelines is necessary.
- Comfortable working across the stack from data pipelines to user-facing tooling.
- Ability to balance building robust systems with the need for speed in research.
- Results-oriented with a focus on flexibility and impact.
- Willingness to take on tasks outside of the job description.
- Motivated by the mission to develop safe AI.
Benefits
- Competitive compensation and benefits.
- Optional equity donation matching.
- Generous vacation and parental leave.
- Flexible working hours.
- Collaborative office space.
Tech Stack
Apache HiveApache SparkKubernetesPython
Categories
AI & MLData Engineering