Tags:Hebbian learning, HMM, Sequence memory and Temporal memory
Abstract:
Learning models online in partially observable stochastic environments can still be challenging for artificial intelligent agents. In this paper, we propose an algorithm for the probabilistic modeling of observation sequences based on the neurophysiological model of the human cortex, which is notoriously fit for this task. We argue that each dendritic segment of a pyramidal neuron may be considered an independent naive Bayesian detector of afferent neuron activity patterns. Experiments show that our model can learn the dynamics of the partially observable environments for very few interactions online and reliably predict probabilistic distributions of observations for several future time steps using Monte Carlo sampling. Additionally, we compare our algorithm with a biologically inspired HMM implementation of temporal memory and standard LSTM on both Markov chain-generated character sequences and observation image sequences in a pinball-like environment.
Learning Hidden Markov Model of Stochastic Environment with Bio-Inspired Probabilistic Temporal Memory