AI QA Engineer (Multilingual)

Scaled Cognition

about 2 months ago

Boston, MA, USA +2 moreEntry Level / Mid Level

Responsibilities

Inspect, review, and grade LLM training data and evaluation test cases for quality assurance.
Maintain local development environments to run test pipelines and investigate edge cases.
Submit pull requests via Git/GitHub to update training repositories.
Identify error cases in training data as a technical data detective.
Collaborate with the engineering team to refine evaluation criteria and improve data pipelines.

Requirements

Strong technical background with hands-on coding experience, preferably in Python.
Fluency in English and native or near-native proficiency in at least one other language.
Deep understanding of Large Language Models and their failure modes.
Proven experience in Quality Assurance, Data Quality, or Data Engineering.
Exceptional written communication skills across multiple languages.

Tech Stack

Categories

AI & MLData EngineeringTesting