Machine Learning Engineer, Safeguards
Anthropic
5 months ago
New York, NY, USA or San Francisco, CA, USA
Mid Level / Senior
H1B Sponsor
Base Salary
$315k - $425k/yr
Responsibilities
- Build machine learning models to detect unwanted or anomalous behaviors from users and API partners.
- Integrate detection models into the production system.
- Improve automated detection and enforcement systems as needed.
- Analyze user reports of inappropriate accounts and build proactive detection models.
- Surface abuse patterns to research teams to enhance model training.
Requirements
- 4+ years of experience in research/ML engineering or applied research with a focus on AI safety.
- Proficiency in Python, LLMs, SQL, and data analysis/data mining tools.
- Experience in building safe AI/ML systems, such as behavioral classifiers or anomaly detection.
- Strong communication skills to explain complex technical concepts to non-technical stakeholders.
- A commitment to understanding the societal impacts of AI work.
Benefits
- Competitive compensation and benefits package.
- Optional equity donation matching.
- Generous vacation and parental leave.
- Flexible working hours.
- Collaborative office space.
Tech Stack
PythonPyTorchscikit-learnSQLTensorFlow
Categories
AI & MLData Science