GrepJob
Lila Sciences

ML Research Scientist I/II, Multimodal Data Extraction

Lila Sciences
Apply
12 days ago
Cambridge, MA, USAEntry Level / Mid Level
H1B Sponsor

Base Salary

$176k - $304k/yr

Responsibilities

  • Research and develop AI systems that extract and structure knowledge from diverse scientific sources.
  • Design and fine-tune large language, multimodal, and specialized models for data extraction.
  • Build scalable pipelines for unstructured and heterogeneous scientific data.
  • Collaborate with domain experts to align extracted data with real-world discovery workflows.
  • Publish research that advances the state of the art in multimodal understanding and AI-driven knowledge extraction.

Requirements

  • PhD or equivalent research experience in Computer Science, Chemistry, Materials Science, or related field.
  • Expertise in machine learning, NLP, and vision-language modeling using PyTorch and Hugging Face Transformers.
  • Proven ability to train, fine-tune, and evaluate LLMs and multimodal models for scientific data extraction.
  • Strong understanding of data structures and representations used in the physical sciences.
  • Demonstrated research impact through publications, preprints, or open-source work.

Benefits

  • Comprehensive benefits program including medical, dental, and vision coverage.
  • Employer-paid life and disability insurance.
  • Flexible time off with generous company-wide holidays.
  • Paid parental leave and educational assistance program.
  • Commuter benefits and company subsidized lunch program.

Tech Stack

Hugging Face TransformersPyTorch

Categories

AI & MLData Science