Anthropic

Software Engineer, Inference Deployment

Anthropic

Apply
21 days ago
New York, NY, USA +2 more
Senior
H1B Sponsor

Base Salary

$320k - $485k/yr

Responsibilities

  • Own deployment orchestration for validated inference builds across GPU, TPU, and Trainium fleets.
  • Improve capacity-aware deployment scheduling to maximize throughput against constrained resources.
  • Extend deployment observability with dashboards and tools for tracking code in production.
  • Drive down cycle time from code merge to production with optimized pipeline architectures.
  • Optimize fleet rollout strategies for large-scale deployments with minimal service disruption.
  • Evolve self-service model onboarding for continuous deployment without engineering involvement.
  • Collaborate with teams across the Inference organization to integrate deployment automation.

Requirements

  • 5+ years of experience building deployment infrastructure at scale.
  • Strong software engineering skills with experience in complex state machines and multi-stage pipelines.
  • Experience with resource-constrained deployment systems.
  • Proven track record of building automation that improves deployment velocity and reliability.
  • Proficiency with Kubernetes-based deployments and container orchestration.
  • Comfort working across the stack from backend services to web UIs.
  • Strong communication skills for collaboration with various teams.

Benefits

  • Competitive compensation and benefits.
  • Optional equity donation matching.
  • Generous vacation and parental leave.
  • Flexible working hours.
  • Collaborative office space.

Tech Stack

KubernetesPythonRust

Categories

AI & MLDevOps