Cerebras Systems

Full Stack LLM Engineer

Cerebras Systems

Apply
7 months ago
Toronto, Canada
Mid Level / Senior
H1B Sponsor

Responsibilities

  • Contribute to the end-to-end bring up of ML models on Cerebras CSX systems.
  • Work across the stack: model architecture translation, graph lowering, compiler optimizations, runtime integration, and performance tuning.
  • Debug performance and correctness issues spanning model code, compiler IRs, runtime behavior, and hardware utilization.
  • Propose and prototype improvements across tools, APIs, or automation flows to accelerate future bring ups.

Requirements

  • Bachelor’s, Master’s, or PhD in Computer Science, Engineering, or a related field.
  • Comfort navigating the full AI toolchain: Python modeling code, compiler IRs, performance profiling, etc.
  • Strong debugging skills across performance, numerical accuracy, and runtime integration.
  • Experience with deep learning frameworks (e.g., PyTorch, TensorFlow) and familiarity with model internals (e.g., attention, MoE, diffusion).
  • Proficiency in C/C++ programming and experience with low-level optimization.
  • Proven experience in compiler development, particularly with LLVM and/or MLIR.
  • Strong background in optimization techniques, particularly those involving NP-hard problems.

Benefits

  • Competitive salary and benefits package.
  • Opportunities for professional growth and career advancement.
  • A dynamic and innovative work environment.
  • The chance to work on cutting-edge technologies and make a significant impact on the future of AI.

Tech Stack

CC++PythonPyTorchTensorFlow

Categories

AI & MLFull Stack