about 3 hours ago
Base Salary
$320k - $485k/yr
Responsibilities
- Lead the launch of frontier models and manage inference for new architectures on cloud platforms.
- Collaborate with the core inference team to integrate new features into production.
- Identify and resolve discrepancies in inference behavior across platforms.
- Design and maintain CI/CD infrastructure for inference servers and load balancers.
- Optimize validation processes to reduce cycle times without sacrificing reliability.
- Analyze performance data to identify and remediate bottlenecks and anomalies.
Requirements
- Strong interest in LLM serving; prior inference or ML experience is not required.
- Significant software engineering experience with large-scale distributed systems.
- Proven track record in building automation or test infrastructure.
- Experience with at least one major cloud platform and familiarity with Kubernetes.
- Ability to thrive in cross-functional collaboration.
- Fast learner with the ability to quickly adapt to new technologies.
- Highly autonomous with a strong sense of ownership over projects.
Benefits
- Competitive compensation and benefits package.
- Optional equity donation matching.
- Generous vacation and parental leave.
- Flexible working hours.
- Collaborative office environment.