Novelty detection improves performance of reinforcement learners in fluctuating, partially observable environments

Publication date: Available online 12 June 2019Source: Journal of Theoretical BiologyAuthor(s): Sarah E. MarzenAbstractEvolved and engineered organisms must adapt to fluctuating environments that are often only partially observed. We show that adaptation to a second environment can be significantly harder after adapting to a first, completely unrelated environment, even when using second-order learning algorithms and a constant learning rate. In effect, there is a lack of fading memory in the organism’s performance. However, organisms can adapt well to the second environment by incorporating a simple novelty detection algorithm that signals when the environment has changed and reinitializing the parameters that define their behavior if so. We propose that it may be fruitful to look for signs of this novelty detection in biological organisms, and to engineer novelty detection algorithms into artificial organisms.
Source: Journal of Theoretical Biology - Category: Biology Source Type: research