about 15 hours ago
Remote, Worldwide
Senior
Responsibilities
- Design and implement LLM-based solutions using Nebius Token Factory’s inference services.
- Build production-ready applications leveraging serverless LLM APIs, including multimodal models.
- Provide technical expertise in prompt engineering, RAG architectures, model selection, and inference optimization.
- Collaborate with product and engineering teams to incorporate customer feedback into the platform roadmap.
- Guide customers in scaling from POC to production with a focus on performance, reliability, and cost efficiency.
Requirements
- 5+ years of experience in ML/AI systems, with at least 2 years focused on LLMs and generative AI.
- Deep knowledge of the LLM ecosystem, including model architectures and fine-tuning approaches.
- Hands-on experience with prompt engineering and LLM pipeline development.
- Experience with agentic frameworks such as Langchain or equivalent.
- Strong Python programming skills.
- Excellent communication skills to explain technical concepts to diverse audiences.
Benefits
- Competitive salary and comprehensive benefits package.
- Opportunities for professional growth within Nebius.
- Flexible working arrangements.
- A dynamic and collaborative work environment that values initiative and innovation.
Tech Stack
AWSAzureDockerFastAPIFlaskGoogle Cloud PlatformKubernetesPython
Categories
AI & MLData ScienceDevOps