over 1 year ago
Base Salary
$200k - $550k/yr
Responsibilities
- Design and build post-training datasets using synthetic generation and targeted data collection.
- Implement filtering, scoring, and mixture strategies for RL and post-training corpora.
- Build and maintain evaluation frameworks to identify long-context failure modes.
- Design reward signals and training environments for targeted capability improvements.
- Run ablations across data sources, reward designs, and long-horizon task structures.
- Improve reliability and observability of post-training data and environment pipelines.
- Collaborate closely with Product and Research to translate capability goals into measurable iteration cycles.
Requirements
- Strong software engineering fundamentals.
- Experience building or operating large-scale data or ML systems.
- Ability to design and interpret experiments that measure model behavior changes.
- Comfort working at the intersection of ML, data systems, and infrastructure.
- Strong attention to data quality and evaluation rigor.
- Track record of owning experimental or production systems end-to-end.
Benefits
- Annual salary range between $200K - $550K based on experience.
- Equity is a significant part of total compensation, in addition to salary.
- 401(k) plan with 6% salary matching.
- Generous health, dental, and vision insurance for you and your dependents.
- Unlimited paid time off.
- Visa sponsorship and relocation stipend to bring you to SF, if possible.
