GrepJob
7 months ago
New York, NY, USASenior / Staff+
H1B Sponsor

Base Salary

$400k - $600k/yr

Responsibilities

  • Design and run experiments across various AI model aspects.
  • Lead evaluations by building datasets and success criteria.
  • Push performance metrics to improve efficiency and reduce failure rates.
  • Collaborate with engineering and product teams for weekly releases.
  • Mentor a senior team and establish standards for quality and documentation.
  • Drive state-of-the-art results in custom agent systems.
  • Research advanced memory techniques for multi-agent systems.

Requirements

  • 5–8+ years of experience in applied ML/AI or research engineering.
  • Deep knowledge of LLMs and agentic systems.
  • Proven ability to transition projects from paper to production.
  • Experience in building evaluation harnesses and datasets for complex tasks.
  • Track record of shipping improvements that impact core business metrics.
  • Bonus: familiarity with orchestration frameworks and real-time state synchronization.

Benefits

  • High autonomy in a fast-paced work environment.
  • Opportunity to work directly with founders and influence product direction.
  • No bureaucracy, allowing for creative freedom and ownership.
  • Small, elite team focused on impactful work.
  • In-person collaboration in New York City.

Tech Stack

Categories

AI & MLData Science