Senior Software Engineer – Training & Registry (AI Platform)
Datadog
8 months ago
Paris, France
Senior
H1B Sponsor
Responsibilities
- Design and implement scalable systems for training orchestration and model registration.
- Streamline ML experimentation workflows by integrating advanced tooling.
- Develop APIs and services for launching and tracking training jobs.
- Build robust version control and metadata systems for model artifacts.
- Collaborate with AI infrastructure teams to ensure integrated user experiences.
- Mentor engineers and influence architectural decisions.
Requirements
- 6+ years of experience in backend, distributed systems, or platform engineering.
- Experience with ML platforms or infrastructure supporting training workflows.
- Proficient in designing APIs and managing data at scale.
- Fluent in Python or Go, with experience in cloud-native tools.
- Ability to translate scientific requirements into reliable systems.
- Bonus: experience with model registries and experiment tracking tools.
Benefits
- New hire stock equity (RSUs) and employee stock purchase plan (ESPP).
- Continuous professional development and career pathing opportunities.
- Intradepartmental mentor and buddy program for networking.
- Inclusive company culture with Community Guilds for employee resource groups.
- Access to internal panel discussions on inclusion.
- Free global mental health benefits for employees and dependents.
Tech Stack
Apache AirflowGoKubernetesMLflowPython
Categories
AI & MLBackend