Traditionally, the study of learning in biological systems was conducted at the expense of overlooking its lesser known counterparts: motivation and emotion. Every creature has innate abilities that accommodate its survival in the world. It can identify food, shelter, partners and danger, but these "simple mappings between stimuli and reactions will not be enough to keep the organisms from encountering problems." For example, if a given animal knows that its predator has qualities A, B and C, it may escape all creatures that have those qualities, and thus waste its energy and resources on non-existent danger.
It has been suggested that learning takes place in two fundamental steps. First, a particular stimulus is correlated with an emotional response. Second, this emotional consequence shapes an association between the stimulus and the response. This analysis is quite influential in part because it was one of the first to suggest that emotions play a key part in learning. In more recent studies, it has been shown that the association between a stimulus and its emotional consequence takes place in the amygdala. "In this region, highly analyzed stimulus representations in the cortex are associated with an emotional value. Therefore, emotions are properties of stimuli".
The task of the amygdala is thus to assign a primary emotional value to each stimulus that has been paired with a primary reinforcer – the reinforcer is the reward and punishment that the mammal receives. This task is aided by the orbitofrontal complex. "In terms of learning theory, the amygdala appears to handle the presentation of primary reinforcement, while the orbitofrontal cortex is involved in the detection of omission of reinforcement."
This image shows that the sensory input enters through the thalamus TH. In biological systems, the thalamus takes on the task of initiating the process of a response to stimuli. It does so by passing the signal to the amygdala and the sensory cortex. This signal is then analyzed in the cortical area – CX. In biological systems, the sensory cortex operates by distributing the incoming signals appropriately between the amygdala and the orbitofrontal cortex. This sensory representation in CX is then sent to the amygdala A through the pathway V. This is the main pathway for learning in this model. Reward and punishment enter the amygdala to strengthen the connection between the amygdala and the pathway. At a later stage if a similar representation is activated in the cortex, E becomes activated and produces an emotional response.
O, the orbitofrontal cortex, operates based on the difference between the perceived (i.e., expected) reward/punishment and the actual received reward/punishment. This perceived reward/punishment is the one that has been developed in the brain over time using learning mechanisms and it reaches the orbitofrontal cortex via the sensory cortex and the amygdala. The received reward/punishment on the other hand, comes courtesy of the outside world and is the actual reward/punishment that the species has just obtained. If these two are identical, the output ( E) is the same. If not, the orbitofrontal cortex inhibits and restrains emotional response to make way for further learning. So the path W is only activated in such conditions.
BELBIC is one such nonlinear controller – a neuromorphic controller based on the computational learning model shown above to produce the control action. This model is employed much like an algorithm in these control engineering applications; intelligence is not given to the system from the outside but is actually acquired by the system itself. This model has been employed as a feedback controller to be applied to control design problems.
BELBIC, which is a model-free controller, suffers from the same drawback of intelligent model-free controllers: it cannot be applied on unstable systems or systems with unstable equilibrium point. This is a natural result of the trial-and-error manner of the learning procedure, i.e., exploration for finding the appropriate control signals can lead to instability. By integrating imitative learning and Fuzzy logic, BELBIC is generalized in order to be capable of controlling unstable systems.
For predicting geomagnetic activity index; the various extended models are proposed by researchers. Babaei et al. presented multi agent model of brain emotional learning and Lotfi and Akbarzadeh proposed supervised learning version of brain emotional learning to forecast Geomagnetic Activity Indices.
|
|