Anthropic

Machine Learning Systems Engineer, Research Tools

Anthropic

Apply
4 months ago
New York, NY, USA +2 more
Mid Level / Senior
H1B Sponsor

Base Salary

$320k - $405k/yr

Responsibilities

  • Design, develop, and maintain tokenization systems used across Pretraining and Finetuning workflows.
  • Optimize encoding techniques to improve model training efficiency and performance.
  • Collaborate closely with research teams to understand their evolving needs around data representation.
  • Build infrastructure that enables researchers to experiment with novel tokenization approaches.
  • Implement systems for monitoring and debugging tokenization-related issues in the model training pipeline.
  • Create robust testing frameworks to validate tokenization systems across diverse languages and data types.
  • Identify and address bottlenecks in data processing pipelines related to tokenization.
  • Document systems thoroughly and communicate technical decisions clearly to stakeholders across teams.

Requirements

  • Significant software engineering experience with demonstrated machine learning expertise.
  • Comfortable navigating ambiguity and developing solutions in rapidly evolving research environments.
  • Ability to work independently while maintaining strong collaboration with cross-functional teams.
  • Results-oriented with a bias towards flexibility and impact.
  • Experience with machine learning systems, data pipelines, or ML infrastructure.
  • Proficient in Python and familiar with modern ML development practices.
  • Strong analytical skills to evaluate the impact of engineering changes on research outcomes.
  • Willingness to take on tasks outside of the job description.
  • Enjoy pair programming.

Benefits

  • Competitive compensation and benefits.
  • Optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours.
  • A collaborative office space.

Tech Stack

Python

Categories

AI & MLData Engineering