Lemurian Labs is reimagining the foundations of computing to make AI accessible to everyone. They are seeking a Runtime Engineer to design, develop, and maintain a multi-target runtime, utilizing techniques in parallelization and partitioning while working closely with the product team to enhance performance for ML engineers.
Responsibilities:
- Design, develop, maintain and improve our multi-target runtime
- Use the latest techniques in parallelization and partitioning to automate generation and exploit highly optimized kernels
- Rapid prototyping and data driven exploration of new ideas
- Benchmark and analyze the outputs produced by our optimizing compiler on target hardware
- Work closely with our product team to understand the evolving needs of ML engineers and drive improvements in architecture
- Build tools to collect and analyze performance bottleneck