Nebius

Senior ML Engineer (Token Factory)

Nebius

Apply
1 day ago
Amsterdam, Netherlands +5 more
Senior

Responsibilities

  • Identify and optimize LLM inference bottlenecks to enhance production speed.
  • Implement novel speculative decoding architectures and optimize LLM components.
  • Design and productionize low-precision training and inference pipelines.

Requirements

  • Profound understanding of machine learning and transformer architecture.
  • Experience profiling GPU workloads using tools like Nsight or PyTorch profiler.
  • Understanding of GPU memory hierarchy and compute/memory tradeoffs.
  • Familiarity with concepts in the LLM space such as MHA and quantisation.
  • Strong software engineering skills, primarily in Python.
  • Deep experience with modern deep learning frameworks.
  • Proficiency in CI/CD, version control, and unit testing.
  • Strong communication and leadership abilities.

Benefits

  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth within Nebius.
  • Flexible working arrangements.
  • Dynamic and collaborative work environment that values initiative and innovation.

Tech Stack

PythonPyTorch

Categories

AI & MLData Science