GrepJob
XPENG

Staff Machine Learning Engineer - AI Foundation

XPENG
Apply
27 days ago
Santa Clara, CA, USAStaff+

Base Salary

$215k - $364k/yr

Responsibilities

  • Optimize transformer-based LLMs for low-latency and high-throughput inference.
  • Optimize kernels and model graphs using tools like CUDA, Triton, and custom fused operators.
  • Implement and benchmark techniques such as quantization and knowledge distillation.
  • Deploy optimized models across GPUs, CPUs, and edge accelerators.
  • Contribute to internal tooling and documentation for model optimization flows.

Requirements

  • Master's degree in CS/CE/EE or equivalent with 5-8 years of industry experience.
  • Good knowledge of PyTorch.
  • Knowledge of transformer architecture and ways to accelerate training and inference.

Benefits

  • A fun, supportive, and engaging environment.
  • Infrastructures and computational resources to support your work.
  • Opportunity to work on cutting-edge technologies with top talents in the field.
  • Opportunity to make a significant impact on the transportation revolution.
  • Competitive compensation package.
  • Snacks, lunches, dinners, and fun activities.

Tech Stack

C++PythonPyTorch

Categories

AI & MLData Science