User: Guest  Login
Document type:
Masterarbeit
Author(s):
Jakob Taube
Title:
Analyzing Approximations of Computation in Inference Hardware for Transformers
Translated title:
Analyse von Berechnungsapproximationen in Inferenz-Hardware für Transformer
Abstract:
Transformer architectures, such as large language models, have revolutionized the field of deep learning, achieving state-of-the-art performance across a variety of tasks. Despite their success, their computational demands result in substantial operational costs at scale and pose major challenges for deployment in resource-constrained environments. This thesis proposes approximation techniques within inference hardware to enhance the computational efficiency of transformers by leveraging a confi...     »
Supervisor:
Felix Dietrich
Advisor:
Thomas Pfeil; Lukas Wiest
Year:
2024
Quarter:
4. Quartal
Year / month:
2024-12
Month:
Dec
Language:
en
University:
Technical University of Munich
Faculty:
TUM School of Computation, Information and Technology
 BibTeX