GrepJob
Nebius

Senior ML Solutions Architect - Token Factory

Nebius
Apply
about 15 hours ago
Remote, Worldwide
Senior

Responsibilities

  • Design and implement LLM-based solutions using Nebius Token Factory’s inference services.
  • Build production-ready applications leveraging serverless LLM APIs, including multimodal models.
  • Provide technical expertise in prompt engineering, RAG architectures, model selection, and inference optimization.
  • Collaborate with product and engineering teams to incorporate customer feedback into the platform roadmap.
  • Guide customers in scaling from POC to production with a focus on performance, reliability, and cost efficiency.

Requirements

  • 5+ years of experience in ML/AI systems, with at least 2 years focused on LLMs and generative AI.
  • Deep knowledge of the LLM ecosystem, including model architectures and fine-tuning approaches.
  • Hands-on experience with prompt engineering and LLM pipeline development.
  • Experience with agentic frameworks such as Langchain or equivalent.
  • Strong Python programming skills.
  • Excellent communication skills to explain technical concepts to diverse audiences.

Benefits

  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth within Nebius.
  • Flexible working arrangements.
  • A dynamic and collaborative work environment that values initiative and innovation.

Tech Stack

AWSAzureDockerFastAPIFlaskGoogle Cloud PlatformKubernetesPython

Categories

AI & MLData ScienceDevOps