Markov Decision Processes and Reinforcement Learning for Timely UAV-IoT Data Collection Applications (Studies in Computational Intelligence, 1220, Band 1220)