GrepJob
Preference Model

Software Engineer

Preference Model
Apply
21 days ago
San Francisco, CA, USAMid Level / Senior

Responsibilities

  • Design and build RL environments end-to-end, including tasks and reward functions.
  • Develop scalable RL training infrastructure with performance optimization and monitoring.
  • Define and create model evaluations to measure agent performance.
  • Drive architecture decisions and contribute to the engineering culture.

Requirements

  • 4+ years of software engineering experience with strong project ownership.
  • Deep expertise in at least one domain such as infra, distributed systems, or performance.
  • Proficient in Python, Rust, or TypeScript across the full stack.
  • Hands-on experience with Kubernetes, AWS, or GCP.
  • Extensive experience working with coding agents.
  • Ability to work independently on ambiguous, high-ownership problems.

Tech Stack

AWSGoogle Cloud PlatformKubernetesPythonRustTypeScript