7 months ago
London, United KingdomMid Level / Senior
Responsibilities
- Design, build and refine production-grade LLM agents and proprietary algorithms for expert litigators.
- Lead development of robust evaluation frameworks and pipelines, establishing metrics and benchmarking models on large datasets.
- Drive advanced prompt-engineering practice to maximize model efficacy.
- Apply core NLP fundamentals while ensuring structured, performant, and cost-efficient outputs.
- Own clean, modular Python back-end code for data-intensive systems.
Requirements
- Expert-level Python with a track record of shipping production LLM systems.
- Hands-on experience designing LLM agents and RAG pipelines.
- Proficiency with AI evaluation frameworks and end-to-end evaluation pipelines.
- Solid grounding in NLP fundamentals including tokenization and embeddings.
- Ability to manage token limits, cost, and latency while delivering structured outputs.
Benefits
- Competitive salary and meaningful early-stage equity (£70-90K).
- Huge autonomy and ownership in designing AI systems.
- Budget for learning and professional growth.
- Bi-annual team retreats.
