
ML Research Scientist I/II, Multimodal Data Extraction
Lila Sciences12 days ago
Cambridge, MA, USAEntry Level / Mid Level
H1B Sponsor
Base Salary
$176k - $304k/yr
Responsibilities
- Research and develop AI systems that extract and structure knowledge from diverse scientific sources.
- Design and fine-tune large language, multimodal, and specialized models for data extraction.
- Build scalable pipelines for unstructured and heterogeneous scientific data.
- Collaborate with domain experts to align extracted data with real-world discovery workflows.
- Publish research that advances the state of the art in multimodal understanding and AI-driven knowledge extraction.
Requirements
- PhD or equivalent research experience in Computer Science, Chemistry, Materials Science, or related field.
- Expertise in machine learning, NLP, and vision-language modeling using PyTorch and Hugging Face Transformers.
- Proven ability to train, fine-tune, and evaluate LLMs and multimodal models for scientific data extraction.
- Strong understanding of data structures and representations used in the physical sciences.
- Demonstrated research impact through publications, preprints, or open-source work.
Benefits
- Comprehensive benefits program including medical, dental, and vision coverage.
- Employer-paid life and disability insurance.
- Flexible time off with generous company-wide holidays.
- Paid parental leave and educational assistance program.
- Commuter benefits and company subsidized lunch program.
Tech Stack
Hugging Face TransformersPyTorch
Categories
AI & MLData Science