about 2 hours ago
Remote, United KingdomSenior / Mid Level
Responsibilities
- Lead independent research projects in AI evaluation methodologies, alignment techniques, and synthetic data generation.
- Design and implement novel evaluation frameworks for LLMs and agent systems grounded in human data.
- Contribute to the academic AI community through publications and open-source contributions.
- Stay at the forefront of AI research and pioneer innovative approaches to tackle pressing challenges.
- Design and conduct rigorous experiments to study AI models and systems.
- Develop scalable frameworks for systematic evaluation of model behaviors and capabilities.
- Create tools and frameworks that transform research insights into practical applications.
- Build infrastructure to support large-scale research experiments.
- Apply knowledge of model fine-tuning and optimization techniques to support research goals.
- Work closely with ML engineers, data scientists, and product teams to translate research insights.
- Mentor team members on advanced AI concepts and emerging research directions.
- Communicate complex technical concepts to diverse stakeholders.
Requirements
- 5+ years of engineering experience with significant AI/ML focus.
- Demonstrated research experience through publications or impactful projects.
- Strong engineering fundamentals and experience implementing AI systems in production.
- Deep knowledge of LLM evaluation methodologies and alignment techniques.
- Experience with model fine-tuning, adapters, quantization, and distillation frameworks.
- Self-motivation and ability to define and pursue research directions independently.
- Excellent understanding of current challenges in AI safety, reliability, and alignment.
- Strong communication skills to explain complex research concepts clearly.
- Passion for staying current with the rapidly evolving AI research landscape.
Benefits
- Competitive salary and benefits.
- Remote working opportunities.
- Access to a unique human data platform for groundbreaking research.
Categories
AI & MLData Science