3 months ago
Remote, Worldwide +2 moreMid Level / Senior
H1B Sponsor
Responsibilities
- Act as the technical owner for enterprise customer VLM post-training engagements.
- Translate customer requirements into multimodal post-training specifications and workflows.
- Design and execute visual data generation, filtering, and quality assessment processes.
- Run supervised fine-tuning, preference alignment, and reinforcement learning workflows for VLMs.
- Design task-specific evaluations for multimodal capabilities and interpret results.
Requirements
- Hands-on experience with data generation and evaluation for VLM or multimodal post-training.
- Experience training or fine-tuning vision-language models using SFT, preference alignment, and/or RL.
- Strong intuition for visual data quality, annotation design, and multimodal evaluation.
- Familiarity with vision encoders and image-text architectures.
Benefits
- Competitive base salary with equity in a unicorn-stage company.
- 100% coverage of medical, dental, and vision premiums for employees and dependents.
- 401(k) matching up to 4% of base pay.
- Unlimited PTO plus company-wide Refill Days throughout the year.
Categories
AI & MLData Science
