Benutzer: Gast  Login
Dokumenttyp:
Masterarbeit
Autor(en):
Jakob Taube
Titel:
Analyzing Approximations of Computation in Inference Hardware for Transformers
Übersetzter Titel:
Analyse von Berechnungsapproximationen in Inferenz-Hardware für Transformer
Abstract:
Transformer architectures, such as large language models, have revolutionized the field of deep learning, achieving state-of-the-art performance across a variety of tasks. Despite their success, their computational demands result in substantial operational costs at scale and pose major challenges for deployment in resource-constrained environments. This thesis proposes approximation techniques within inference hardware to enhance the computational efficiency of transformers by leveraging a confi...     »
Aufgabensteller:
Felix Dietrich
Betreuer:
Thomas Pfeil; Lukas Wiest
Jahr:
2024
Quartal:
4. Quartal
Jahr / Monat:
2024-12
Monat:
Dec
Sprache:
en
Hochschule / Universität:
Technical University of Munich
Fakultät:
TUM School of Computation, Information and Technology
 BibTeX