1 day ago
Responsibilities
- Own software engineering efforts across the full SDLC for the rack management solution.
- Drive resolution of critical infrastructure issues while collaborating across teams.
- Configure and test new Graphcore AI hardware and systems using Continuous Deployment and Infrastructure-as-Code.
- Work with Datacenter Operations Engineers to maintain peak performance of AI systems.
- Implement corrective actions for systems not operating correctly.
Requirements
- Bachelor's degree or equivalent practical experience in a relevant subject.
- Experience with RESTful API development.
- Experience building, deploying, and operating containerized workloads using Kubernetes.
- Programming experience with Go.
- Hands-on experience with Infrastructure-as-Code and CI/CD automation tools.
- Experience with Redfish for datacenter hardware management.
- Strong Linux systems engineering experience, including automation and scripting.