about 2 months ago
Remote, Worldwide or San Francisco, CA, USAMid Level / Senior
Base Salary
$180k - $290k/yr
Responsibilities
- Build training infrastructure and reward pipelines from scratch.
- Design and operate systems for training and evaluating Firecrawl's models.
- Fine-tune models to achieve state-of-the-art results.
- Bridge LLM agents and classical RL techniques.
- Run fast experiments and iterate based on results.
- Communicate findings clearly to non-RL team members.
- Collaborate closely with the engineering team on product roadmap.
Requirements
- 3+ years of experience in applied RL, ML engineering, or model training.
- Proven ability to build training infrastructure and reward models independently.
- Experience in fine-tuning models to achieve best-in-class performance.
- Fluency in both classical RL and modern LLM techniques.
- Production-minded with experience deploying models that serve real traffic.
- Ability to run fast experiments and communicate results effectively.
Benefits
- Competitive salary based on impact, ranging from $180,000 to $290,000/year.
- Equity options up to 0.15%.
- Generous PTO policy with 15 mandatory days and additional days upon request.
- 12 weeks of fully paid parental leave.
- Wellness stipend of $100/month.
- Learning & Development budget of up to $1,000/year.
- Team offsites and a sabbatical of 3 paid months after 4 years.
- Comprehensive medical, dental, and vision coverage for employees.
Categories
AI & MLData Science
