Skip to yearly menu bar Skip to main content


Talk
in
Workshop: First Workshop on Quantum Tensor Networks in Machine Learning

Invited Talk 7: cuTensor: High-Performance CUDA Tensor Primitives

Paul Springer


Abstract:

This talk discusses cuTENSOR, a high-performance CUDA library for tensor operations that efficiently handles the ubiquitous presence of high-dimensional arrays (i.e., tensors) in today's HPC and DL workloads. This library supports highly efficient tensor operations such as tensor contractions, element-wise tensor operations such as tensor permutations, and tensor reductions. While providing high performance, cuTENSOR also enables users to express their mathematical equations for tensors in a straightforward way that hides the complexity of dealing with these high-dimensional objects behind an easy-to-use API.