Machine Learning Systems Engineer, Research Tools
Anthropic
4 months ago
New York, NY, USA +2 more
Mid Level / Senior
H1B Sponsor
Base Salary
$320k - $405k/yr
Responsibilities
- Design, develop, and maintain tokenization systems used across Pretraining and Finetuning workflows.
- Optimize encoding techniques to improve model training efficiency and performance.
- Collaborate closely with research teams to understand their evolving needs around data representation.
- Build infrastructure that enables researchers to experiment with novel tokenization approaches.
- Implement systems for monitoring and debugging tokenization-related issues in the model training pipeline.
- Create robust testing frameworks to validate tokenization systems across diverse languages and data types.
- Identify and address bottlenecks in data processing pipelines related to tokenization.
- Document systems thoroughly and communicate technical decisions clearly to stakeholders across teams.
Requirements
- Significant software engineering experience with demonstrated machine learning expertise.
- Comfortable navigating ambiguity and developing solutions in rapidly evolving research environments.
- Ability to work independently while maintaining strong collaboration with cross-functional teams.
- Results-oriented with a bias towards flexibility and impact.
- Experience with machine learning systems, data pipelines, or ML infrastructure.
- Proficient in Python and familiar with modern ML development practices.
- Strong analytical skills to evaluate the impact of engineering changes on research outcomes.
- Willingness to take on tasks outside of the job description.
- Enjoy pair programming.
Benefits
- Competitive compensation and benefits.
- Optional equity donation matching.
- Generous vacation and parental leave.
- Flexible working hours.
- A collaborative office space.
Tech Stack
Python
Categories
AI & MLData Engineering