GrepJob
Stack AV

Senior Software Engineer, Machine Learning Inference Platform

Stack AV
Apply
about 3 hours ago
Remote, Worldwide or Pittsburgh, PA, USASenior / Mid Level
H1B Sponsor

Responsibilities

  • Own technical design and delivery of subsystems in a high-throughput, low-latency inference platform.
  • Develop robust API layers and developer SDKs for distributed inference orchestration.
  • Build and harden a multi-tenant control plane for accurate metering and tenant isolation.
  • Optimize inference performance across the entire system stack.
  • Build observability and SLOs for insights into system economics and performance.
  • Collaborate with product and infrastructure teams on model onboarding and customer adoption.
  • Mentor engineers and drive issues to closure while maintaining code quality.

Requirements

  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
  • 4+ years of experience building and operating backend distributed systems end to end.
  • Strong fundamentals in data-intensive distributed systems, concurrency, and performance profiling.
  • Hands-on experience with large-scale inference services on GPUs.
  • Direct experience with inference engines or serving frameworks.
  • Strong programming skills in C++, Go, Rust, or Python.
  • Familiarity with deep learning frameworks and GPU computing primitives.
  • Practical understanding of high-performance networking architectures.
  • Strong analytical and problem-solving skills.
  • Experience with autonomous vehicles is a bonus.

Categories

AI & MLBackendData Engineering