Anthropic

Machine Learning Engineer, Safeguards

Anthropic

Apply
5 months ago
New York, NY, USA or San Francisco, CA, USA
Mid Level / Senior
H1B Sponsor

Base Salary

$315k - $425k/yr

Responsibilities

  • Build machine learning models to detect unwanted or anomalous behaviors from users and API partners.
  • Integrate detection models into the production system.
  • Improve automated detection and enforcement systems as needed.
  • Analyze user reports of inappropriate accounts and build proactive detection models.
  • Surface abuse patterns to research teams to enhance model training.

Requirements

  • 4+ years of experience in research/ML engineering or applied research with a focus on AI safety.
  • Proficiency in Python, LLMs, SQL, and data analysis/data mining tools.
  • Experience in building safe AI/ML systems, such as behavioral classifiers or anomaly detection.
  • Strong communication skills to explain complex technical concepts to non-technical stakeholders.
  • A commitment to understanding the societal impacts of AI work.

Benefits

  • Competitive compensation and benefits package.
  • Optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours.
  • Collaborative office space.

Tech Stack

PythonPyTorchscikit-learnSQLTensorFlow

Categories

AI & MLData Science