Lead ML Data Engineer, AI Core
Nubank
3 days ago
Durham, NC, USA +3 more
Mid Level / Senior / Staff+
Responsibilities
- Design and build scalable data ingestion pipelines for AI Core.
- Implement data quality monitoring and validation systems.
- Model new types of data into foundation models.
- Analyze the impact of new data sources on existing models.
- Develop and maintain data preparation workflows for model training.
- Tune and optimize machine learning models with new datasets.
- Collaborate with AI Core ML, Platform, and Infra teams for seamless data flow.
- Lead technical initiatives to improve data engineering practices and mentor team members.
Requirements
- Typically 6+ years of experience in machine learning engineering or data engineering.
- Proven experience designing and building data ingestion pipelines at scale.
- Strong background in applied machine learning, including model training and evaluation.
- Experience analyzing data changes and their impact on model performance.
- Proficiency in Python for data engineering and ML workflows.
- Solid understanding of data quality principles and monitoring systems.
- Strong problem-solving skills for complex, ambiguous problems.
- Excellent communication skills for technical and non-technical stakeholders.
- Demonstrated leadership experience in mentoring and technical decision-making.
Benefits
- Opportunity of earning equity at Nu
- Medical, Dental, and Vision Insurance
- Life Insurance and AD&D
- Extended maternity and paternity leaves
- Access to learning platforms and language programs
- Mental health and wellness assistance program
- 401K and saving plans
- Work-from-home allowance
- Relocation assistance package if applicable.
Tech Stack
Apache AirflowApache SparkMLflowPython
Categories
AI & MLData Engineering