Cerebras Systems

Full Stack LLM Engineer

Cerebras Systems

Apply
5 months ago
Bengaluru, India
Senior / Staff+
H1B Sponsor

Responsibilities

  • Contribute to the end-to-end bring up of frameworks for RL, inference serving, and ML models on Cerebras CSX systems.
  • Work across the stack including model architecture translation, graph lowering, compiler optimizations, and runtime integration.
  • Debug performance and correctness issues spanning model code, compiler IRs, runtime behavior, and hardware utilization.
  • Propose and prototype improvements across tools, APIs, or automation flows to accelerate future bring ups.

Requirements

  • Bachelor’s, Master’s, or PhD in Computer Science, Engineering, or a related field with 8 to 12 years’ experience.
  • Comfort navigating the full AI toolchain including Python modeling code and performance profiling.
  • Strong debugging skills across performance, numerical accuracy, and runtime integration.
  • Experience with deep learning frameworks such as PyTorch and TensorFlow.
  • Proficiency in C/C++ programming and experience with low-level optimization.
  • Strong background in optimization techniques, particularly those involving NP-hard problems.

Benefits

  • Competitive salary and benefits package.
  • Opportunities for professional growth and career advancement.
  • A dynamic and innovative work environment.
  • The chance to work on cutting-edge technologies and make a significant impact on the future of AI.

Tech Stack

CC++PythonPyTorchTensorFlow

Categories

AI & MLData EngineeringFull Stack