A Deep Reinforcement Learning Model for Combinatorial Optimization and Fleet Dispatching in Mobility On-Demand Services

Hamzehi, Sascha Sirus Jakob

User: Guest

Dissertationen

Back
Back to start of result list
Permanent link for displayed object

If you experience problems opening the document, please try this link.

Original title:: A Deep Reinforcement Learning Model for Combinatorial Optimization and Fleet Dispatching in Mobility On-Demand Services
Translated title:: Ein Tief-Bestärkendes Maschinen-Lern-Modell für kombinatorische Optimierungen der Flottenlogistik und Mobilitätsdienstleistungen nach Anforderung
Author:: Hamzehi, Sascha Sirus Jakob
Year:: 2023
Document type:: Dissertation
Faculty/School:: TUM School of Engineering and Design
Advisor:: Bogenberger, Klaus (Prof. Dr.)
Referee:: Bogenberger, Klaus (Prof. Dr.); Schiffer, Maximilian (Prof. Dr.)
Language:: en
Subject group:: BAU Bauingenieurwesen, Vermessungswesen
Keywords:: Neural Networks, Combinatorial Optimization, Deep Reinforcement Learning, Fleet Dispatching
Translated keywords:: Neuronale Netze, Kombinatorische Optimierung, Tief-Bestärkendes Lernen, Flottenlogistik, automatisierte Flottenzuweisung
TUM classification:: BAU 850; BAU 875
Abstract:: This thesis describes the development and benchmarking of a deep reinforcement learning architecture using a pointer generation neural network with an Actor-Critic Monte-Carlo Policy Gradient training algorithm. The model is tested by solving real-world data-based combinatorial optimization problems for Fleet Dispatching and Minimum Weighted Bipartite Matching graphs by learning a competitive near-optimal and efficient heuristic automatically.
Translated abstract:: Diese Dissertation beschreibt die Entwicklung und den Leistungsvergleich einer „Deep Reinforcement Learning“ Architektur mit Zeiger-Generations-Netzwerk, das mit einem „Actor-Critic Monte-Carlo Policy Gradient“ Algorithmus antrainiert wird. Das Modell wird mit echten Daten anhand kombinatorischer Optimierungsprobleme für Flottenlogistik und gewichteten bipartiten Zuweisungsgraphen getestet. Das Modell erlernt hierbei automatisiert eine kompetitive nah-optimale und sehr effiziente Heuristik.
WWW:: https://mediatum.ub.tum.de/?id=1649715
Date of submission:: 17.06.2022
Oral examination:: 20.04.2023
File size:: 26256574 bytes
Pages:: 191
Urn (citeable URL):: https://nbn-resolving.de/urn/resolver.pl?urn:nbn:de:bvb:91-diss-20230420-1649715-1-2
Last change:: 17.07.2023
BibTeX