This website presents a collection of research projects focused on the performance and portability of compute kernels on WebGPUs, with an emphasis on key operations such as Matrix Multiplication, Matrix-Vector Multiplication, and Flash Attention.
By running a kernel, you consent to the collection of device and performance data for academic research purposes.