Full Text: PDF
Volume 3, Issue 1, 30 April 2021, Pages 105-114
Abstract. In this paper, we describe the problem of learning an optimal incentivization strategy that maximizes the service level given a fixed budget constraint for a sharing service such as bike-sharing, car-sharing, etc. in a spatiotemporal environment. The service level can be affected due to an imbalance in supply and demand at different locations during a specific time period. We describe and present our study and comparison of various reinforcement learning algorithms on a 1-D problem setting in a simulated bike-share system with a budget constraint on the incentives. We empirically study the performance of three policy gradient based reinforcement learning algorithms, namely: Proximal Policy Optimization (PPO), Trust Region Policy Optimization (TRPO), and Actor Critic using Kronecker-Factored Trust Region (ACKTR).
How to Cite this Article:
Shen-Shyang Ho, Matthew Schofield, Ning Wang, Learning incentivization strategy for resource rebalancing in shared services with a budget constraint, J. Appl. Numer. Optim. 3 (2021), 105-114.