GrepJob
Anthropic

Full-Stack Software Engineer, Reinforcement Learning

Anthropic
Apply
about 4 hours ago
New York, NY, USA or San Francisco, CA, USA
Mid Level / Senior
H1B Sponsor

Base Salary

$1 - $2/yr

Responsibilities

  • Build and extend web platforms for RL environment creation and management.
  • Develop vendor-facing interfaces for external partners to create and iterate on training environments.
  • Design platforms for large-scale human data collection, including quality assurance systems.
  • Create evaluation dashboards for real-time insights into environment quality and training health.
  • Build backend services and APIs connecting various training infrastructure components.
  • Expand scalable code data generation pipelines for diverse programming tasks.
  • Develop onboarding automation and documentation for new users and vendors.
  • Collaborate with researchers and operations to translate requirements into well-scoped products.

Requirements

  • Strong software engineering fundamentals with full-stack capabilities.
  • Proficiency in Python and modern web technologies like React and TypeScript.
  • Experience in shipping systems that solve complex problems effectively.
  • Ability to operate with high agency and drive projects forward independently.
  • Strong UX design skills for creating intuitive interfaces for diverse users.
  • Excellent communication skills to collaborate with researchers and engineers.
  • Ability to thrive in a fast-paced, dynamic work environment.

Benefits

  • Competitive compensation and benefits.
  • Optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours.
  • Collaborative office space.

Tech Stack

AWSDockerGoogle Cloud PlatformPythonReactTypeScript

Categories

AI & MLFull Stack