Optimizing a WebGPU Matmul Kernel for 1TFLOP+ Performance

Optimizing a WebGPU Matmul Kernel for 1TFLOP+ Performance.
Building Surfgrad, a high-performant, WebGPU-powered autograd library

Read in full here:

This thread was posted by one of our members via one of our news source trackers.