Anthropic

Research Engineer, Production Model Post-Training, London

Anthropic

Apply
4 months ago
London, United Kingdom
Mid Level / Senior
H1B Sponsor

Responsibilities

  • Implement and optimize post-training techniques at scale on frontier models.
  • Conduct research to develop and optimize post-training recipes that improve production model quality.
  • Design, build, and run robust, efficient pipelines for model fine-tuning and evaluation.
  • Develop tools to measure and improve model performance across various dimensions.
  • Collaborate with research teams to translate emerging techniques into production-ready implementations.
  • Debug complex issues in training pipelines and model behavior.
  • Help establish best practices for reliable, reproducible model post-training.

Requirements

  • Strong software engineering skills with experience building complex ML systems.
  • Experience with training, fine-tuning, or evaluating large language models.
  • Comfortable working with large-scale distributed systems and high-performance computing.
  • Ability to balance research exploration with engineering rigor and operational reliability.
  • Adept at analyzing and debugging model training processes.
  • Enjoy collaborating across research and engineering disciplines.
  • Thrive in fast-paced environments and adapt quickly to changing priorities.

Benefits

  • Competitive compensation and benefits.
  • Optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours.
  • Collaborative office space.

Tech Stack

Python

Categories

AI & MLData Science