Introduction

An AND program encodes a simple model of behaviour: responding to a sequence of stimuli by generating a sequence of responses. The stimuli and responses are just abstract symbols A to Z. But we imagine that they correspond to real-world sensory inputs and motor outputs. In the case of gene regulation, these symbols correspond to the presence or absence of particular molecular signals that initiate or inhibit gene expression, and to the regulatory and structural proteins produced as a result.

In many ways, this resembles a policy, in reinforcement learning, which maps states to actions. Add something here about policies and behaviour and references to xxx book.

We can bring these abstract symbols to life by wiring them to a simple agent, and embedding the agent in an environment. The stimuli are generated by the agent’s sensors as it moves about the environment, and the responses control the subsequent actions of the agent. We can think of this cell as a minimal agent, with the AND program encoding its cognition, and the sensorimotor system encoding its body and how it interacts with the world.

We will evolve our AND program so the cell can achieve a task in its environment. Importantly, the sensorimotor architecture remains fixed; only the cognitive wiring, controlled by the AND program, is subject to evolution.

The aim in this section to show that this simple architecture, inspired by gene regulation, can be evolved to respond adaptively to a changing environment. Our AND program effectively encodes the cognition of the cell.