An extension library of WMMA API (Tensor Core API)
-
Updated
Jul 12, 2024 - Cuda
An extension library of WMMA API (Tensor Core API)
FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme
Fast SGEMM emulation on Tensor Cores
Add a description, image, and links to the tensorcores topic page so that developers can more easily learn about it.
To associate your repository with the tensorcores topic, visit your repo's landing page and select "manage topics."