
Machine Learning System Software Architect
Baidu
about 1 year ago
Sunnyvale, CA, USA
Senior / Staff+
H1B Sponsor
Responsibilities
- Architect and create high-performance machine learning system software.
- Build distributed AI training systems connecting Kunlun Accelerators and servers.
- Develop system-level ML architectures focusing on performance, power, and latency.
- Monitor AI industry trends and integrate them into product roadmaps.
- Evaluate power, performance, and cost of architectures and subsystems.
- Collaborate with software engineers to optimize the hardware/software stack.
- Work with SoC design, verification, and validation engineers.
Requirements
- 5+ years of experience architecting software solutions for Machine Learning and distributed training systems.
- Knowledge of the Machine Learning market and emerging applications.
- Experience with deep learning frameworks like TensorFlow, PyTorch, or PaddlePaddle.
- Strong outreach skills to ML researchers and application developers.
- Familiarity with CPUs, GPUs, memory systems, and accelerators.
- Experience in performance simulation and modeling using C++.
- MS or PhD in Electrical or Computer Engineering.
- Excellent communication skills in English and Chinese.
Tech Stack
C++PyTorchTensorFlow
Categories
AI & MLData Engineering