about 1 month ago
Tel Aviv-Yafo, IsraelSenior
Responsibilities
- Deploy and manage large-scale models using high-performance inference engines.
- Develop and refine the chatbot's agentic capabilities.
- Design and execute fine-tuning strategies for model accuracy.
- Build comprehensive evaluation frameworks to measure model performance.
Requirements
- Strong proficiency in Python and Bash scripting.
- Deep experience with PyTorch and the Hugging Face ecosystem.
- Proven experience deploying models using vLLM or similar servers.
- Strong understanding of LLM architectures and attention mechanisms.
- Hands-on experience building agentic systems.
- Expertise in fine-tuning strategies and parameter-efficient techniques.
- Deep understanding of classification and retrieval metrics.
- Strong statistical foundation for designing and analyzing A/B tests.