User: Guest  Login
Title:

Neural Nets with a Newton Conjugate Gradient Method on Multiple GPUs

Document type:
Zeitschriftenaufsatz
Author(s):
Severin Reiz; Tobias Neckel; Hans-Joachim Bungartz
Abstract:
Training deep neural networks consumes increasing computational resource shares in many compute centers. Often, a brute force approach to obtain hyperparameter values is employed. Our goal is (1) to enhance this by enabling second-order optimization methods with fewer hyperparameters for large-scale neural networks and (2) to compare optimizers for specific tasks to suggest users the best one for their problem. We introduce a novel second-order optimization method that requires the effect of the...     »
Keywords:
Numerical methods Machine learning; Deep learning; Second-order optimization; Data-parallelism
Congress title:
14th International Conference on Parallel Processing and Applied Mathematics
Journal title:
In Proceedings of the 14th International Conference on Parallel Processing and Applied Mathematics
Year:
2022
Year / month:
2022-09
Quarter:
3. Quartal
Month:
Sep
Pages contribution:
13
Reviewed:
ja
Language:
en
WWW:
Springer Link
Submitted:
10.05.2022
Accepted:
09.09.2022
Date of publication:
28.04.2023
Semester:
WS 22-23
TUM Institution:
Department of Informatics
 BibTeX