5 days ago
Bellevue, WA, USA +2 moreStaff+
Base Salary
$188k - $275k/yr
Responsibilities
- Provide technical leadership in designing and operating large-scale infrastructure services for GPU servers.
- Build and enhance infrastructure services and automation using open source technologies.
- Drive strategic direction for infrastructure automation and service orchestration.
- Define best practices for API development and mentor engineers.
- Collaborate with hardware, software, and operations teams.
- Contribute to open source communities through collaboration and thought leadership.
- Lead and improve CI/CD pipelines for hardware compliance and data systems.
- Champion reliability and operational excellence through observability and incident response.
Requirements
- B.S., M.S., or PhD in Computer Science or related field, or equivalent experience.
- 8+ years of software engineering experience focused on infrastructure and cloud engineering.
- Expertise in Go and experience building REST/gRPC APIs.
- Strong background in architecting and scaling cloud-native Kubernetes infrastructure.
- Proven success in mentoring engineers and leading technical projects.
- Experience contributing to open source communities.
- Skilled in applying a data-driven approach to reliability and optimization.
- Excellent communication skills for working with technical and non-technical stakeholders.
- Hands-on experience with observability stacks and operating large fleets of GPU servers.
Benefits
- 100% paid medical, dental, and vision insurance.
- Company-paid life insurance and voluntary supplemental life insurance.
- Short and long-term disability insurance.
- Flexible Spending Account and Health Savings Account.
- Tuition reimbursement and participation in Employee Stock Purchase Program.
- Mental wellness benefits and family-forming support.
- Paid parental leave and flexible childcare support.
- 401(k) with a generous employer match.
- Flexible PTO and catered lunch each day.
- Casual work environment and a culture focused on innovative disruption.
