Modular is on a mission to revolutionize AI infrastructure by rebuilding the AI software stack. The GenAI Performance Engineer will optimize the performance of Modular’s MAX product, collaborating with various engineering teams to enhance AI model deployment and performance analysis tooling.

Responsibilities:

Measure, analyze, and identify opportunities to improve the performance of the MAX product under realistic and relevant usage patterns
Partner with the product and customer teams to understand the performance of the MAX product in both standard and cutting edge AI applications and design benchmarks to reflect them
Collaborate with the kernels and GenAI modeling team to bring up new model families
Collaborate with the kernels and runtime team to bring up and optimize new GPUs and accelerators
Collaborate with the cloud team to design and benchmark advanced serving features and new serving algorithms
Build statistical models and tools to operate on benchmarking & telemetry data and help develop key insights for performance, cloud costs, etc

Requirements:

5+ years of professional or postgraduate academic experience working on or researching performance analysis, tooling or benchmarking
Expertise in performance measurement (i.e. benchmarking), modeling, and analysis on real-world workloads
Extensive experience with Python
Creativity and curiosity for solving complex problems
Experience writing production-quality software
Strong written and verbal communication skills

GenAI Performance Engineer

Key skills

About this role

Responsibilities:

Requirements: