Senior Software Development Engineer in Test (SDET) - AI Cluster
Cerebras Systems
27 days ago
Sunnyvale, CA, USA or Toronto, Canada
Senior
H1B Sponsor
Responsibilities
- Innovate and execute tests on cutting-edge AI infrastructure.
- Define optimized test strategies and methodologies.
- Adapt to new technologies and bring diverse expertise.
- Understand large-scale distributed ML training and inference.
- Automate tests for efficiency and scalability.
- Champion cluster security and reliability.
- Test all components of the AI cluster, including software and hardware.
Requirements
- Bachelor's or master's degree in engineering, computer science, AI, data science, or related field.
- 5+ years of experience in testing enterprise software, distributed systems, or datacenter hardware.
- Strong coding skills in Python, Golang, or C/C++.
- Strong debugging skills for large distributed systems.
- Understanding of operating systems internals and datacenter layout.
- Experience with cloud technologies like AWS, Kubernetes, and Docker.
- Understanding of ML model training and inference is a plus.
- Familiarity with ML hardware accelerators like GPUs and custom ASICs is a plus.
Benefits
- Opportunity to build a breakthrough AI platform beyond GPU constraints.
- Ability to publish and open source cutting-edge AI research.
- Work on one of the fastest AI supercomputers in the world.
- Enjoy job stability with startup vitality.
- Experience a simple, non-corporate work culture that respects individual beliefs.
Tech Stack
AWSCC++DockerGoGrafanaKubernetesPrometheusPython
Categories
AI & MLTesting