: Optimizing the dispatching and rebalancing of autonomous vehicle fleets (e.g., ride-sharing services) to minimize wait times and maximize efficiency.
: Filippos Christianos, Georgios Papoudakis, Aris Filos, and Stefano V. Albrecht.
: A novel Deep Reinforcement Learning (DRL) approach that uses a hierarchical structure to improve "sample efficiency," meaning the system learns effective strategies using significantly less data than traditional methods.