GrepJob
Clockwork.io

Senior HPC Developer - RDMA Networking

Clockwork.io
Apply
about 22 hours ago
Palo Alto, CA, USASenior
H1B Sponsor

Base Salary

$150k - $230k/yr

Responsibilities

  • Build and optimize high-performance GPU and networking subsystems.
  • Work with collective communication libraries and algorithms for multi-node, multi-GPU workloads.
  • Debug performance issues across kernel, driver, GPU, and network layers.
  • Develop and improve GPU-aware networking solutions.
  • Profile, analyze, and tune system performance using low-level tooling.
  • Collaborate closely with a small engineering team and take ownership of core systems.

Requirements

  • 5+ years of experience in systems, HPC, or performance-critical software development.
  • Strong proficiency in low-level C/C++.
  • Solid understanding of RDMA networking, including InfiniBand, RoCE, and IBVerbs.
  • Experience working with multi-node, multi-GPU workloads.
  • Familiarity with collective communication libraries and communication algorithms.
  • Ability and willingness to debug complex issues across hardware and software boundaries.
  • Curiosity and eagerness to learn in a fast-moving startup environment.

Benefits

  • Challenging projects.
  • A friendly and inclusive workplace culture.
  • Competitive compensation.
  • A great benefits package.
  • Catered lunch.

Tech Stack

CC++

Categories

AI & MLBackendData Engineering