Baidu

Machine Learning System Software Architect

Baidu

Apply
about 1 year ago
Sunnyvale, CA, USA
Senior / Staff+
H1B Sponsor

Responsibilities

  • Architect and create high-performance machine learning system software.
  • Build distributed AI training systems connecting Kunlun Accelerators and servers.
  • Develop system-level ML architectures focusing on performance, power, and latency.
  • Monitor AI industry trends and integrate them into product roadmaps.
  • Evaluate power, performance, and cost of architectures and subsystems.
  • Collaborate with software engineers to optimize the hardware/software stack.
  • Work with SoC design, verification, and validation engineers.

Requirements

  • 5+ years of experience architecting software solutions for Machine Learning and distributed training systems.
  • Knowledge of the Machine Learning market and emerging applications.
  • Experience with deep learning frameworks like TensorFlow, PyTorch, or PaddlePaddle.
  • Strong outreach skills to ML researchers and application developers.
  • Familiarity with CPUs, GPUs, memory systems, and accelerators.
  • Experience in performance simulation and modeling using C++.
  • MS or PhD in Electrical or Computer Engineering.
  • Excellent communication skills in English and Chinese.

Tech Stack

C++PyTorchTensorFlow

Categories

AI & MLData Engineering