Research Engineer, Production Model Post-Training, London
Anthropic
4 months ago
London, United Kingdom
Mid Level / Senior
H1B Sponsor
Responsibilities
- Implement and optimize post-training techniques at scale on frontier models.
- Conduct research to develop and optimize post-training recipes that improve production model quality.
- Design, build, and run robust, efficient pipelines for model fine-tuning and evaluation.
- Develop tools to measure and improve model performance across various dimensions.
- Collaborate with research teams to translate emerging techniques into production-ready implementations.
- Debug complex issues in training pipelines and model behavior.
- Help establish best practices for reliable, reproducible model post-training.
Requirements
- Strong software engineering skills with experience building complex ML systems.
- Experience with training, fine-tuning, or evaluating large language models.
- Comfortable working with large-scale distributed systems and high-performance computing.
- Ability to balance research exploration with engineering rigor and operational reliability.
- Adept at analyzing and debugging model training processes.
- Enjoy collaborating across research and engineering disciplines.
- Thrive in fast-paced environments and adapt quickly to changing priorities.
Benefits
- Competitive compensation and benefits.
- Optional equity donation matching.
- Generous vacation and parental leave.
- Flexible working hours.
- Collaborative office space.
Tech Stack
Python
Categories
AI & MLData Science