10 days ago
Remote, Worldwide +2 moreMid Level / Senior / Staff+
H1B Sponsor
Responsibilities
- Design model evaluation pipelines for development and production models.
- Create user studies for subjective model evaluations.
- Convert requirements into measurable metrics.
- Develop automated evaluation dashboards to track model performance.
- Train new models to capture diverse evaluation metrics.
- Collaborate with model and data teams to improve model performance.
- Ensure product requirements are accurately measured in evaluations.
- Help grow and lead the evaluation team.
Requirements
- Strong experience in designing metrics for model performance.
- Experience designing user studies on platforms like Mechanical Turk.
- Proficient in model training and fine-tuning for evaluation.
- Strong statistical knowledge for comparing evaluation results.
- Excellent engineering and programming skills.
- Experience with training ASR and TTS models.
- Background in large-scale machine learning projects.
Categories
AI & MLData Science
