Senior ML Solutions Architect - Token Factory

3 months ago

Remote, WorldwideSenior

Responsibilities

Design and implement LLM-based solutions using Nebius Token Factory’s inference services.
Build production-ready applications leveraging serverless LLM APIs, including multimodal models.
Provide technical expertise in prompt engineering, RAG architectures, model selection, and inference optimization.
Collaborate with product and engineering teams to incorporate customer feedback into the platform roadmap.
Guide customers in scaling from POC to production with a focus on performance, reliability, and cost efficiency.

5+ years of experience in ML/AI systems, with at least 2 years focused on LLMs and generative AI.
Deep knowledge of the LLM ecosystem, including model architectures and fine-tuning approaches.
Hands-on experience with prompt engineering and LLM pipeline development.
Experience with agentic frameworks such as Langchain or equivalent.
Strong Python programming skills.
Excellent communication skills to explain technical concepts to diverse audiences.

Competitive salary and comprehensive benefits package.
Opportunities for professional growth within Nebius.
Flexible working arrangements.
A dynamic and collaborative work environment that values initiative and innovation.