Clockwork Systems is a company focused on redefining distributed computing through innovative software-driven fabrics for AI workloads. They are seeking a Senior HPC Developer to build and optimize high-performance GPU and networking subsystems, working closely with a small engineering team.
Responsibilities:
- Build and optimize high-performance GPU and networking subsystems
- Work with collective communication libraries and algorithms for multi-node, multi-GPU workloads
- Debug performance issues across kernel, driver, GPU, and network layers
- Develop and improve GPU-aware networking solutions
- Profile, analyze, and tune system performance using low-level tooling
- Collaborate closely with a small engineering team and take ownership of core systems