GrepJob
ConnectHum

Audio | Multimodal ML Engineer

ConnectHum
Apply
21 days ago
Paris, FranceMid Level / Senior

Responsibilities

  • Train and fine-tune large-scale audio and multimodal models.
  • Design and run experiments for architecture and training strategies.
  • Build and optimize audio data pipelines.
  • Improve inference speed and production readiness.
  • Deploy models end-to-end in low-latency environments.
  • Define meaningful evaluation metrics beyond benchmark scores.
  • Collaborate closely with research and engineering teams.

Requirements

  • 3+ years of experience training deep learning models in audio or speech domains.
  • Strong experience with distributed training frameworks.
  • Solid understanding of audio signal processing fundamentals.
  • Experience shipping models to production with a focus on latency.
  • Experience building and maintaining data pipelines.
  • Strong engineering hygiene including clean code and testing.

Benefits

  • Competitive compensation and equity.
  • Hybrid work setup in Europe with relocation support.
  • Comprehensive health coverage.
  • Access to top-tier hardware and tools.
  • Team off-sites and budget for learning and AI tooling.

Tech Stack

PyTorch

Categories

AI & MLData Engineering