Delos Data is a stealth-mode startup focused on building technology to enhance performance and scalability in AI data center clusters. They are seeking a talented System Software Engineer to design and implement essential software that enables efficient operation of large-scale AI models across GPUs.
Responsibilities:
- Collaborate across the stack to influence the design of our foundational technology, ensuring it meets the needs of next-generation AI models
- Identify and resolve performance bottlenecks in distributed training and inference workloads through deep-dive analysis of the software-hardware interface
- Conduct rigorous performance benchmarking and characterization on multi-node clusters