Skip Nav Destination
Close Modal
Update search
NARROW
Format
TocHeadingTitle
Date
Availability
1-3 of 3
Solvi Arnold
Close
Follow your search
Access your saved searches in your account
Would you like to receive an alert when new items match your search?
Sort by
Proceedings Papers
. isal2024, ALIFE 2024: Proceedings of the 2024 Artificial Life Conference12, (July 22–26, 2024) 10.1162/isal_a_00725
Abstract
View Paper
PDF
Advanced biological intelligence learns efficiently from an in-formation-rich stream of stimulus information, even when feed-back on behaviour quality is sparse or absent. Such learning ex-ploits implicit assumptions about task domains. We refer to such learning as Domain-Adapted Learning (DAL). In contrast, AI learning algorithms rely on explicit externally provided measures of behaviour quality to acquire fit behaviour. This im-poses an information bottleneck that precludes learning from di-verse non-reward stimulus information, limiting learning effi-ciency. We consider the question of how biological evolution circumvents this bottleneck to produce DAL. We propose that species first evolve the ability to learn from reward signals, providing inefficient (bottlenecked) but broad adaptivity. From there, integration of non-reward information into the learning process can proceed via gradual accumulation of biases induced by such information on specific task domains. This scenario pro-vides a biologically plausible pathway towards bottleneck-free, domain-adapted learning. Focusing on the second phase of this scenario, we set up a population of NNs with reward-driven learning modelled as Reinforcement Learning (A2C), and allow evolution to improve learning efficiency by integrating non-re-ward information into the learning process using a neuromodu-latory update mechanism. On a navigation task in continuous 2D space, evolved DAL agents show a 300-fold increase in learning speed compared to pure RL agents. Evolution is found to elimi-nate reliance on reward information altogether, allowing DAL agents to learn from non-reward information exclusively, using local neuromodulation-based connection weight updates only. Code available at github.com/aislab/dal.
Proceedings Papers
. ecal2013, ECAL 2013: The Twelfth European Conference on Artificial Life425-430, (September 2–6, 2013) 10.1162/978-0-262-31709-2-ch061
Proceedings Papers
. alife2012, ALIFE 2012: The Thirteenth International Conference on the Synthesis and Simulation of Living Systems301-308, (July 19–22, 2012) 10.1162/978-0-262-31050-5-ch040