about 1 year ago
Toronto, Canada +5 moreMid Level / Senior
H1B Sponsor
Responsibilities
- Conduct data ablations to assess data quality and experiment with data mixtures.
- Develop robust data modeling techniques for optimal training efficiency.
- Research and implement innovative data curation methods.
- Collaborate with cross-functional teams to ensure data pipelines meet model demands.
Requirements
- Strong software engineering skills with proficiency in Python.
- Familiarity with curriculum learning, data mixing, and data attribution.
- Experience with data processing frameworks like Apache Spark or Pandas.
- Experience working with large-scale datasets, including web and multilingual data.
- Knowledge of data quality assessment techniques.
Benefits
- Open and inclusive culture and work environment.
- Weekly lunch stipend, in-office lunches, and snacks.
- Full health and dental benefits, including mental health support.
- 100% Parental Leave top-up for up to 6 months.
- Personal enrichment benefits for arts, culture, fitness, and workspace improvement.
- Remote-flexible work options and co-working stipend.
- 6 weeks of vacation (30 working days).
Tech Stack
Categories
AI & MLData Engineering
