Introduction
Reinforcement Learning is a part of Machine Learning, where an agent learns to behave in an environment. The agents performs certain actions and observes the rewards and the results which it gets from these actions. Essentially, it is a closed loop problem because the learner system’s actions influence its later inputs and in this case the learner is not guided in terms or which actions to take. Instead, the learner should discover which actions yield the most reward by trying them out. . There are various approaches to solving reinforcement learning problems that will be discussed in the following sections.
The field of reinforcement learning has exploded in the past few years and has led to major advancements in areas such as control and robotics. A few modern day examples of reinforcement learning based advancements are Google DeepMind beating a professional Alpha Go Played and the OpenAI team beating a professional DOTA player.
The basic idea of reinforcement learning is simply to capture the most important aspects of the real problem facing a learning agent interacting with its environment to achieve a goal. The formulation of a reinforcement learning problem is to include three major aspects – sensation, action and goal – in their simplest possible forms without trivializing the problem. Therefore, this model formulation can be highly useful in the transportation engineering area as it allows a real world problem to be formulated and analyzed to solve complex issues of alleviating traffic and improving driving experience. There have multiple transportation based studies that use reinforcement learning for real time optimization of traffic control policies. These studies focus on optimizing traffic control policies to empower Intelligent Transportation Systems (ITS) devices with the functions for faster computing analytics as well as different functionalities for signal processing in connected and automated vehicle environments. Additionally, there is a huge influx of transportation trajectory data and using inverse reinforcement learning, much of this data can be analysed and modelled into useful information. Some of these applications have been explored in the following sections.
Last updated