about 5 hours ago
San Mateo, CA, USASenior / Staff+
H1B Sponsor
Base Salary
$243k - $295k/yr
Responsibilities
- Architect and maintain automated pipelines for multi-modal dataset ingestion and pre-processing.
- Leverage image and video generation models to scale synthetic datasets.
- Partner with research teams to create training data for experiments.
- Build and own evaluation frameworks for training datasets and models.
- Design and optimize high-throughput, low-latency Inference APIs.
- Participate in literature reviews to implement optimizations in generative modeling.
- Implement monitoring for pipeline health and optimize data loading.
Requirements
- 8+ years of experience as a research-focused data systems engineer.
- Expertise in building scalable ML data pipelines for large datasets.
- Versatile with several programming languages and technologies.
- Collaborative team player with technical leadership skills.
- Proficient in Python for automation and infrastructure management.
- Experience with cloud data platforms and distributed processing technologies.
- Passionate about generative AI in creative domains.
- Bachelor's degree or equivalent experience in a technical field.