WarpDrive: An Agentic Workflow for Ninja GPU Transformations

Sana Damani

Siva Kumar Sastry Hari

Mark Stephenson

Christos Kozyrakis Stanford

NeurIPS Workshop on Machine Learning for Systems, 2024


Abstract

Performance engineering for GPU-accelerated applications is challenging and time-consuming. We propose WarpDrive, a customizable LLM-driven performance analysis and optimization framework that automatically transforms and tests GPU applications. WarpDrive automates the optimization process using agents that analyze run time performance, create optimization plans, transform the code, and test for correctness.

We demonstrate its effectiveness by customizing it to four different levels of optimization, including compiler options, compiler hints, function-level transformations, and application-level transformations.