GrepJob
Scaled Cognition

AI QA Engineer (Multilingual)

Scaled Cognition
Apply
about 4 hours ago
Boston, MA, USA +2 moreEntry Level / Mid Level

Responsibilities

  • Inspect, review, and grade LLM training data and evaluation test cases for quality assurance.
  • Maintain local development environments to run test pipelines and investigate edge cases.
  • Submit pull requests via Git/GitHub to update training repositories.
  • Identify error cases in training data as a technical data detective.
  • Collaborate with the engineering team to refine evaluation criteria and improve data pipelines.

Requirements

  • Strong technical background with hands-on coding experience, preferably in Python.
  • Fluency in English and native or near-native proficiency in at least one other language.
  • Deep understanding of Large Language Models and their failure modes.
  • Proven experience in Quality Assurance, Data Quality, or Data Engineering.
  • Exceptional written communication skills across multiple languages.

Tech Stack

Categories

AI & MLData EngineeringTesting