Anthropic

Software Engineer, Safeguards

Anthropic

Apply
5 months ago
San Francisco, CA, USA
Senior / Staff+
H1B Sponsor

Base Salary

$320k - $425k/yr

Responsibilities

  • Develop monitoring systems to detect unwanted behaviors from API partners.
  • Build abuse detection mechanisms and infrastructure.
  • Surface abuse patterns to research teams to enhance model training.
  • Create robust multi-layered defenses for real-time safety improvements.
  • Analyze user reports of inappropriate content or accounts.

Requirements

  • Bachelor’s degree in Computer Science, Software Engineering, or comparable experience.
  • 5-10+ years of experience in software engineering, focusing on integrity or abuse detection.
  • Proficiency in Python and Typescript.
  • Ability to work across the software stack.
  • Strong communication skills to explain complex concepts to non-technical stakeholders.

Benefits

  • Competitive compensation and benefits.
  • Optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours.
  • Collaborative office space.

Tech Stack

PythonTypeScript

Categories

AI & MLBackendFull Stack