about 3 hours ago
Remote, Worldwide or Pittsburgh, PA, USASenior / Mid Level
H1B Sponsor
Responsibilities
- Own technical design and delivery of subsystems in a high-throughput, low-latency inference platform.
- Develop robust API layers and developer SDKs for distributed inference orchestration.
- Build and harden a multi-tenant control plane for accurate metering and tenant isolation.
- Optimize inference performance across the entire system stack.
- Build observability and SLOs for insights into system economics and performance.
- Collaborate with product and infrastructure teams on model onboarding and customer adoption.
- Mentor engineers and drive issues to closure while maintaining code quality.
Requirements
- Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
- 4+ years of experience building and operating backend distributed systems end to end.
- Strong fundamentals in data-intensive distributed systems, concurrency, and performance profiling.
- Hands-on experience with large-scale inference services on GPUs.
- Direct experience with inference engines or serving frameworks.
- Strong programming skills in C++, Go, Rust, or Python.
- Familiarity with deep learning frameworks and GPU computing primitives.
- Practical understanding of high-performance networking architectures.
- Strong analytical and problem-solving skills.
- Experience with autonomous vehicles is a bonus.