
AI QA Engineer (Multilingual)
Scaled Cognitionabout 4 hours ago
Boston, MA, USA +2 moreEntry Level / Mid Level
Responsibilities
- Inspect, review, and grade LLM training data and evaluation test cases for quality assurance.
- Maintain local development environments to run test pipelines and investigate edge cases.
- Submit pull requests via Git/GitHub to update training repositories.
- Identify error cases in training data as a technical data detective.
- Collaborate with the engineering team to refine evaluation criteria and improve data pipelines.
Requirements
- Strong technical background with hands-on coding experience, preferably in Python.
- Fluency in English and native or near-native proficiency in at least one other language.
- Deep understanding of Large Language Models and their failure modes.
- Proven experience in Quality Assurance, Data Quality, or Data Engineering.
- Exceptional written communication skills across multiple languages.