about 16 hours ago
Responsibilities
- Design and implement load-balancing algorithms for AI workloads.
- Architect and evolve inference infrastructure for seamless model deployment.
- Drive initiatives to minimize tail latency and maximize throughput.
- Build infrastructure-as-code and CI/CD pipelines for dynamic compute fleets.
- Leverage telemetry data to optimize system performance.
- Lead cross-functional initiatives and mentor team members.
Requirements
- Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
- 8–12 years of progressive software engineering experience.
- Strong proficiency in Go or Python with a focus on networked systems.
- Expert-level knowledge of Kubernetes and containerization.
- Proven experience with load balancing and request routing at scale.
- A strong ownership mindset with experience in high-availability systems.
Benefits
- Hybrid work model.