4 days ago
Dublin, Ireland
Staff+
H1B Sponsor
Responsibilities
- Build and maintain systems that serve the AI model Claude to millions of users.
- Maximize compute efficiency to support customer growth and enable AI research.
- Address key infrastructure blockers for high-performance inference.
- Design intelligent routing algorithms for request distribution.
- Autoscale compute fleets to match supply with demand.
- Build production-grade deployment pipelines for new models.
- Integrate new AI accelerator platforms for competitive advantage.
- Analyze observability data to tune performance based on production workloads.
Requirements
- Significant software engineering experience, particularly with distributed systems.
- Familiarity with performance optimization and large-scale service orchestration.
- Experience with load balancing, request routing, or traffic management systems.
- Knowledge of Kubernetes and cloud infrastructure (AWS, GCP) is preferred.
- Proficiency in Python or Rust is encouraged.
- Bachelor's degree in a related field or equivalent experience.
Benefits
- Competitive compensation and benefits.
- Optional equity donation matching.
- Generous vacation and parental leave.
- Flexible working hours.
- Collaborative office space.
Tech Stack
AWSGoogle Cloud PlatformKubernetesPythonRust
Categories
AI & MLBackendDevOps