Senior ML Engineer (Token Factory)
Nebius
1 day ago
Amsterdam, Netherlands +5 more
Senior
Responsibilities
- Identify and optimize LLM inference bottlenecks to enhance production speed.
- Implement novel speculative decoding architectures and optimize LLM components.
- Design and productionize low-precision training and inference pipelines.
Requirements
- Profound understanding of machine learning and transformer architecture.
- Experience profiling GPU workloads using tools like Nsight or PyTorch profiler.
- Understanding of GPU memory hierarchy and compute/memory tradeoffs.
- Familiarity with concepts in the LLM space such as MHA and quantisation.
- Strong software engineering skills, primarily in Python.
- Deep experience with modern deep learning frameworks.
- Proficiency in CI/CD, version control, and unit testing.
- Strong communication and leadership abilities.
Benefits
- Competitive salary and comprehensive benefits package.
- Opportunities for professional growth within Nebius.
- Flexible working arrangements.
- Dynamic and collaborative work environment that values initiative and innovation.
Tech Stack
PythonPyTorch
Categories
AI & MLData Science