GrepJob
Coupa Software

Sr. Lead AI Engineer, Data - 11315

Coupa Software
Apply
about 21 hours ago
Bengaluru, India
Senior / Staff+
H1B Sponsor

Responsibilities

  • Lead the design and implementation of data pipelines for AI model training.
  • Build data curation workflows to transform raw data into validated datasets.
  • Design data quality frameworks including validation and anomaly detection.
  • Extend existing data export pipelines to support AI training workloads.
  • Implement synthetic data generation pipelines.
  • Design schema mappings for feature extraction across enterprise tables.
  • Collaborate with ML engineers on training data format requirements.
  • Establish data catalog and metadata management for AI training artifacts.

Requirements

  • 10+ years of software engineering experience, with 5+ years in data engineering.
  • Strong experience with Apache Spark / PySpark and large-scale data processing.
  • Experience building ETL/ELT pipelines on cloud infrastructure.
  • Knowledge of data quality frameworks and data governance.
  • Experience with data anonymization and privacy-preserving data processing.
  • Understanding of ML training data requirements.
  • Proficiency in Python and SQL.
  • Experience with data catalog tools and metadata management.
  • BS/MS in Computer Science or equivalent experience.
  • Experience in B2B SaaS with multi-tenant data preferred.

Tech Stack

Apache SparkPythonSQL

Categories

AI & MLData Engineering