Cerebras Systems

Senior Research Engineer - Inference ML

Cerebras Systems

Apply
3 months ago
Sunnyvale, CA, USA or Toronto, Canada
Senior / Staff+
H1B Sponsor

Responsibilities

  • Design, implement, and optimize transformer architectures for NLP and computer vision on Cerebras hardware.
  • Research and prototype novel inference algorithms and model architectures that leverage Cerebras capabilities.
  • Train models to convergence, perform hyperparameter sweeps, and analyze results.
  • Bring up new models on the Cerebras system, validate correctness, and troubleshoot integration issues.
  • Profile and optimize model code to maximize throughput and minimize latency.
  • Develop diagnostic tools to identify performance bottlenecks and guide optimization strategies.
  • Collaborate across teams to drive projects from inception through delivery.

Requirements

  • Bachelor’s degree in a related technical field and 7+ years of ML software development experience, or equivalent experience.
  • Master’s degree in Computer Science or related field and 4+ years of software development experience, or equivalent experience.
  • PhD in Computer Science or related field with 2+ years of relevant experience, or equivalent practical experience.
  • 4+ years of experience testing, maintaining, or launching software products, including 2+ years in software design and architecture.
  • 3+ years of experience in software development focused on machine learning, such as deep learning or computer vision.
  • Strong programming skills in C++ and/or Python.
  • Experience with Generative AI and Machine Learning systems.

Benefits

  • Opportunity to work on a breakthrough AI platform beyond GPU constraints.
  • Ability to publish and open source cutting-edge AI research.
  • Engagement with one of the fastest AI supercomputers in the world.
  • Job stability with startup vitality.
  • A simple, non-corporate work culture that respects individual beliefs.

Tech Stack

C++PythonPyTorch

Categories

AI & MLData Science