Member of Technical Staff - Reasoning

about 2 months ago

London, United KingdomMid Level / Senior

H1B Sponsor

Responsibilities

Build robust and scalable distributed RL systems.
Optimize frameworks to enable complex inference-time reasoning.
Develop environments and harnesses for agents.

Requirements

Experienced with large-scale reinforcement learning systems.
Skilled in designing and implementing distributed systems.
Knowledgeable about state-of-the-art RL and inference time compute algorithms.

Categories

AI & MLData Engineering