Sr. Lead AI Engineer, Data - 11315
Coupa Softwareabout 21 hours ago
Bengaluru, India
Senior / Staff+
H1B Sponsor
Responsibilities
- Lead the design and implementation of data pipelines for AI model training.
- Build data curation workflows to transform raw data into validated datasets.
- Design data quality frameworks including validation and anomaly detection.
- Extend existing data export pipelines to support AI training workloads.
- Implement synthetic data generation pipelines.
- Design schema mappings for feature extraction across enterprise tables.
- Collaborate with ML engineers on training data format requirements.
- Establish data catalog and metadata management for AI training artifacts.
Requirements
- 10+ years of software engineering experience, with 5+ years in data engineering.
- Strong experience with Apache Spark / PySpark and large-scale data processing.
- Experience building ETL/ELT pipelines on cloud infrastructure.
- Knowledge of data quality frameworks and data governance.
- Experience with data anonymization and privacy-preserving data processing.
- Understanding of ML training data requirements.
- Proficiency in Python and SQL.
- Experience with data catalog tools and metadata management.
- BS/MS in Computer Science or equivalent experience.
- Experience in B2B SaaS with multi-tenant data preferred.
Tech Stack
Apache SparkPythonSQL
Categories
AI & MLData Engineering