Roulette Wheel Selection Algorithm and Reinforcement Learning