13 days ago
London, United KingdomMid Level / Senior
H1B Sponsor
Responsibilities
- Build robust and scalable distributed RL systems.
- Optimize frameworks to enable complex inference-time reasoning.
- Develop environments and harnesses for agents.
Requirements
- Experienced with large-scale reinforcement learning systems.
- Skilled in designing and implementing distributed systems.
- Knowledgeable about state-of-the-art RL and inference time compute algorithms.
Categories
AI & MLData Engineering