Trends Sign In Sign Up

Anthropic

Applied Safety Research Engineer, Safeguards

Anthropic

about 2 months ago

New York, NY, USA or San Francisco, CA, USA

Mid Level / Senior

H1B Sponsor

Base Salary

$320k - $405k/yr

Responsibilities

Design and run experiments to improve evaluation quality.
Research factors impacting model safety behavior.
Analyze evaluation coverage to identify measurement gaps.
Productionize successful research into evaluation pipelines.
Collaborate with Policy and Enforcement on measurable evaluations.
Build tooling for policy experts to create evaluations.
Surface findings to drive model improvements.

Requirements

4+ years of software engineering or ML engineering experience.
Proficient in Python and comfortable working across the stack.
Experience building and maintaining data pipelines.
Comfortable with data analysis and drawing insights from large datasets.
Experience with LLMs and understanding their capabilities and failure modes.
Ability to transition between prototyping and production-quality code.
Excited by ambiguous problems and translating them into experiments.
Care deeply about AI safety and desire to make a real impact.

Benefits

Competitive compensation and benefits.
Optional equity donation matching.
Generous vacation and parental leave.
Flexible working hours.
Collaborative office space.

Tech Stack

Python

Categories

AI & MLData Science