Targeting High-Performance Applications with NVIDIA Tensor Cores: Precision and Performance Insights

Rafael Piñán

Wenn Sie Schwierigkeiten haben, das Dokument zu öffnen, versuchen Sie auch bitte diesen Link

Dokumenttyp:: Bachelorarbeit
Art der Studienarbeit:: Theoretisch
Autor(en):: Rafael Piñán
Titel:: Targeting High-Performance Applications with NVIDIA Tensor Cores: Precision and Performance Insights
Titelzusatz:: Zur Nutzung von NVIDIA Tensor Cores für High-Performance-Anwendungen: Untersuchungen zu Genauigkeit und Rechenleistung
Abstract:: This work gives a complete overview of performing dense matrix multiplication and accumulating floating-point operations on NVIDIA Tensor Cores. In 2017, NVIDIA unveiled the first Generation of Tensor Cores as part of the Volta architecture. Nowadays, Tensor Cores are an essential part of the computation hardware of data centers worldwide. As NVIDIA GPUs and these computation units developed, the possibilities expanded. This work reviews the current capabilities of Tensor Cores and how to leverage their performance. Tensor Cores are well-established in numerous applications that are not sensible to precision losses. This work proves that Tensor Cores are not limited to those applications and should be exploited for any application that performs dense matrix multiplication and accumulating floating-point operations. An API for carrying out Tensor Cores operations has been implemented as part of this work. The API shows real Tensor Cores programmability using two different approaches. A benchmark proving Tensor Core’s performance and precision has been developed as part of this work as well. «
This work gives a complete overview of performing dense matrix multiplication and accumulating floating-point operations on NVIDIA Tensor Cores. In 2017, NVIDIA unveiled the first Generation of Tensor Cores as part of the Volta architecture. Nowadays, Tensor Cores are an essential part of the computation hardware of data centers worldwide. As NVIDIA GPUs and these computation units developed, the possibilities expanded. This work reviews the current capabilities of Tensor Cores and how to levera... »
Betreuer:: Mario Wille
Gutachter:: Prof. Dr. Michael Bader
Jahr:: 2024
Quartal:: 3. Quartal
Jahr / Monat:: 2024-08
Monat:: Aug
Sprache:: en
Hochschule / Universität:: Technical University of Munich
Fakultät:: TUM School of Computation, Information and Technology
TUM Einrichtung:: Department of Computer Science
BibTeX

Vorkommen:

mediaTUM Gesamtbestand Einrichtungen Schools TUM School of Computation, Information and Technology Departments Computer Science Informatik 5 - Lehrstuhl für Scientific Computing (Prof. Bungartz)2024