GrepJob
Inworld

Staff / Principal Machine Learning Engineer, Serving - USA

Inworld
Apply
about 1 month ago
Mountain View, CA, USAStaff+ / Senior

Base Salary

$270k - $500k/yr

Responsibilities

  • Develop and optimize real-time multimodal models and serving frameworks.
  • Implement inference optimization techniques and model acceleration strategies.
  • Ensure high-performance systems using languages like C++, CUDA, or Rust.
  • Manage distributed systems and scaling for high concurrency.
  • Take ownership of models from research to production deployment.

Requirements

  • Deep understanding of inference optimization and serving frameworks.
  • Hands-on experience with model acceleration techniques like quantization and caching.
  • Proficiency in high-performance programming languages and profiling code.
  • Experience with distributed systems, Kubernetes, and multi-GPU inference.
  • PhD in CS, Physics, Math, or equivalent practical experience.

Benefits

  • Relocation assistance for new team members.
  • Collaborative in-person work environment.
  • Opportunities for open-source contributions and visibility of work.

Tech Stack

C++KubernetesPythonRust

Categories

AI & MLBackendData Engineering