
Senior HPC Developer - RDMA Networking
Clockwork.ioabout 22 hours ago
Base Salary
$150k - $230k/yr
Responsibilities
- Build and optimize high-performance GPU and networking subsystems.
- Work with collective communication libraries and algorithms for multi-node, multi-GPU workloads.
- Debug performance issues across kernel, driver, GPU, and network layers.
- Develop and improve GPU-aware networking solutions.
- Profile, analyze, and tune system performance using low-level tooling.
- Collaborate closely with a small engineering team and take ownership of core systems.
Requirements
- 5+ years of experience in systems, HPC, or performance-critical software development.
- Strong proficiency in low-level C/C++.
- Solid understanding of RDMA networking, including InfiniBand, RoCE, and IBVerbs.
- Experience working with multi-node, multi-GPU workloads.
- Familiarity with collective communication libraries and communication algorithms.
- Ability and willingness to debug complex issues across hardware and software boundaries.
- Curiosity and eagerness to learn in a fast-moving startup environment.
Benefits
- Challenging projects.
- A friendly and inclusive workplace culture.
- Competitive compensation.
- A great benefits package.
- Catered lunch.