Modular is on a mission to revolutionize AI infrastructure by rebuilding the AI software stack. The GenAI Performance Engineer will optimize the performance of Modular’s MAX product, collaborating with various engineering teams to enhance AI model deployment and performance analysis tooling.
Responsibilities:
- Measure, analyze, and identify opportunities to improve the performance of the MAX product under realistic and relevant usage patterns
- Partner with the product and customer teams to understand the performance of the MAX product in both standard and cutting edge AI applications and design benchmarks to reflect them
- Collaborate with the kernels and GenAI modeling team to bring up new model families
- Collaborate with the kernels and runtime team to bring up and optimize new GPUs and accelerators
- Collaborate with the cloud team to design and benchmark advanced serving features and new serving algorithms
- Build statistical models and tools to operate on benchmarking & telemetry data and help develop key insights for performance, cloud costs, etc
Requirements:
- 5+ years of professional or postgraduate academic experience working on or researching performance analysis, tooling or benchmarking
- Expertise in performance measurement (i.e. benchmarking), modeling, and analysis on real-world workloads
- Extensive experience with Python
- Creativity and curiosity for solving complex problems
- Experience writing production-quality software
- Strong written and verbal communication skills