You need to change the recorded kernels to others

Look for a Go package to interface with CUDA. The Go compiler can target WASM, but that is unlikely to give you the computational power that you seek by moving to a GPU.