GrepJob
Arena Intelligence, Inc.

Machine Learning Scientist

Arena Intelligence, Inc.
Apply
5 months ago
San Francisco, CA, USAMid Level / Senior
H1B Sponsor

Responsibilities

  • Design and conduct experiments to evaluate AI model behavior across various dimensions.
  • Develop new metrics, methodologies, and evaluation protocols beyond traditional benchmarks.
  • Analyze large-scale human voting and interaction data for insights into model performance.
  • Collaborate with engineers to implement and scale research findings into production systems.
  • Prototype and test research ideas rapidly while balancing rigor with iteration speed.
  • Author internal reports and external publications for the broader ML research community.
  • Partner with model providers to shape evaluation questions and support responsible testing.
  • Contribute to the scientific integrity and transparency of the Arena Intelligence leaderboard.

Requirements

  • PhD or equivalent research experience in Machine Learning, NLP, Statistics, or a related field.
  • Strong understanding of LLMs and modern deep learning architectures.
  • Proficiency in Python and ML research libraries such as PyTorch, JAX, or TensorFlow.
  • Demonstrated ability to design and analyze experiments with statistical rigor.
  • Experience publishing research or working on open-source projects in ML, NLP, or AI evaluation.
  • Comfortable working with real-world usage data and designing metrics beyond standard benchmarks.
  • Ability to translate research questions into practical systems and collaborate across teams.
  • Passion for open science, reproducibility, and community-driven research.

Benefits

  • Competitive compensation and equity aligned to market locations.
  • Comprehensive health and wellness benefits, including medical, dental, and vision.
  • Opportunity to work on cutting-edge AI with a small, mission-driven team.
  • Culture that values transparency, trust, and community impact.

Tech Stack

PythonPyTorchTensorFlow

Categories

AI & MLData Science