Cerebras Systems

Senior Software Development Engineer in Test (SDET) - AI Cluster

Cerebras Systems

Apply
27 days ago
Sunnyvale, CA, USA or Toronto, Canada
Senior
H1B Sponsor

Responsibilities

  • Innovate and execute tests on cutting-edge AI infrastructure.
  • Define optimized test strategies and methodologies.
  • Adapt to new technologies and bring diverse expertise.
  • Understand large-scale distributed ML training and inference.
  • Automate tests for efficiency and scalability.
  • Champion cluster security and reliability.
  • Test all components of the AI cluster, including software and hardware.

Requirements

  • Bachelor's or master's degree in engineering, computer science, AI, data science, or related field.
  • 5+ years of experience in testing enterprise software, distributed systems, or datacenter hardware.
  • Strong coding skills in Python, Golang, or C/C++.
  • Strong debugging skills for large distributed systems.
  • Understanding of operating systems internals and datacenter layout.
  • Experience with cloud technologies like AWS, Kubernetes, and Docker.
  • Understanding of ML model training and inference is a plus.
  • Familiarity with ML hardware accelerators like GPUs and custom ASICs is a plus.

Benefits

  • Opportunity to build a breakthrough AI platform beyond GPU constraints.
  • Ability to publish and open source cutting-edge AI research.
  • Work on one of the fastest AI supercomputers in the world.
  • Enjoy job stability with startup vitality.
  • Experience a simple, non-corporate work culture that respects individual beliefs.

Tech Stack

AWSCC++DockerGoGrafanaKubernetesPrometheusPython

Categories

AI & MLTesting