Research Engineer, Production Model Post-Training, London

Anthropic

4 months ago

London, United Kingdom

Mid Level / Senior

H1B Sponsor

Responsibilities

Implement and optimize post-training techniques at scale on frontier models.
Conduct research to develop and optimize post-training recipes that improve production model quality.
Design, build, and run robust, efficient pipelines for model fine-tuning and evaluation.
Develop tools to measure and improve model performance across various dimensions.
Collaborate with research teams to translate emerging techniques into production-ready implementations.
Debug complex issues in training pipelines and model behavior.
Help establish best practices for reliable, reproducible model post-training.

Strong software engineering skills with experience building complex ML systems.
Experience with training, fine-tuning, or evaluating large language models.
Comfortable working with large-scale distributed systems and high-performance computing.
Ability to balance research exploration with engineering rigor and operational reliability.
Adept at analyzing and debugging model training processes.
Enjoy collaborating across research and engineering disciplines.
Thrive in fast-paced environments and adapt quickly to changing priorities.

Python