Self-Replication in
Neural Networks
Thomas Gabor∗
LMU Munich
thomas.gabor@ifi.lmu.de
Abstract A key element of biological structures is self-replication.
Neural networks are the prime structure used for the emergent
construction of complex behavior in computers. We analyze how
various network types lend themselves to self-replication.
Backpropagation turns out to be the natural way to navigate the
space of network weights and allows non-trivial self-replicators to
arise naturally. We perform an in-depth analysis to show the
self-replicators’ robustness to noise. We then introduce artificial
chemistry environments consisting of several neural networks and
examine their emergent behavior. In extension to this work’s
previous version (Gabor et al., 2019), we provide an extensive
analysis of the occurrence of fixpoint weight configurations within
the weight space and an approximation of their respective
attractor basins.
Steffen Illium
LMU Munich
Maximilian Zorn
LMU Munich
Cristian Lenta
LMU Munich
Andy Mattausch
LMU Munich
Lenz Belzner
Technische Hochschule Ingolstadt
Claudia Linnhoff-Popien
LMU Munich
Palabras clave
Neural network, self-replication, artificial
chemistry system, soup, weight space
1 Introducción
Dawkins (1976) stressed the importance of self-replication to the origin of life. He argued that
proto-RNA was able to copy its molecule structure within a soup of randomly interacting elements.
This allowed it to reach a stability in concentration that could not be maintained by any other kind
of structure. As the story goes, life evolved more or less as an elaborate means to maintain the
copying of structural information.
Since the early days of computing, the recreation of biological structures has been a target of
investigación, starting from the early formulation of an evolutionary process by Turing (1950) and in-
cluding famous examples like Box (1957), jardinero (1970), or Dorigo and Di Caro (1999). También
see the overviews given by Koza (1994) or Bäck et al. (1997). Although also conceived very early,
(Minsky & Papert, 1972; Rosenblatt, 1958), neural networks have only rather recently found broad
practical application for advanced tasks like image recognition (Krizhevsky et al., 2012), speech
recognition (Hinton et al., 2012), or strategic game playing (Silver et al., 2017). The variety of uses
shows that neural networks are a powerful tool of abstraction for various domains. Sin embargo, en
* Autor correspondiente.
© 2022 Massachusetts Institute of Technology Artificial Life 28: 205–223 (2022) https://doi.org/10.1162/artl_a_00359
yo
D
oh
w
norte
oh
a
d
mi
d
F
r
oh
metro
h
t
t
pag
:
/
/
d
i
r
mi
C
t
.
metro
i
t
.
mi
d
tu
a
r
t
yo
/
/
yo
a
r
t
i
C
mi
–
pag
d
F
/
/
/
/
2
8
2
2
0
5
2
0
3
2
7
7
7
a
r
t
yo
/
_
a
_
0
0
3
5
9
pag
d
.
F
b
y
gramo
tu
mi
s
t
t
oh
norte
0
7
S
mi
pag
mi
metro
b
mi
r
2
0
2
3
t. Gabor et al.
Self-Replication in Neural Networks
all these cases neural networks are used with a certain intent, es decir., equipped with an extrinsic goal
función. Through backpropagation, the distance of the network’s output to the goal function can
be systematically minimized, making the network match the goal to an increasing extent.
The wide variety of application domains shows the power of neural networks as a functional ab-
straction. For other functional abstractions, such as expressions in the λ-calculus (Church, 1932)
or a variety of assembler-like instruction sets and automata (Dittrich et al., 2001; Görnerup &
Crutchfield, 2008), it is known that, when a population of random instances of said functional
abstractions is set up and allowed to interact, self-replicators arise naturally (see Fontana & Buss,
1996, or Dittrich & Banzhaf, 1998, respectivamente). For neural networks, Chang and Lipson (2018)
have shown that self-application (es decir., constructing new neural networks by applying neural networks
to other neural networks) may lead to the formation of a self-replicating structure, albeit a rather
trivial instance of one. In this article, nosotros (a) repeat these results for a broader range of neural network
architectures and (b) extend the interaction model by the notion of self-training, which yields lots of
non-trivial fixpoints. Nosotros entonces (C) subject these fixpoints to various degrees of noise and analyze their
comportamiento, shedding light on how fixpoints occur within the network weight space. The examined
setup allows us to (d) construct an artificial chemistry setup using neural networks as individuals that
(under certain circumstances, por supuesto) reliably produces a variety of non-trivial self-replicators.
This article is an extended version of the article originally published by Gabor et al. (2019) con
the substantial addition of contribution (C) as described above. The additional content can be found
mainly in the new section 3.3, Weight Space Analysis, including Figures 7–10, and the final exper-
iment described in section 3.5, including Figure 14. This article is structured as follows: We first
describe all formal definitions of our approach and then provide a series of experiments examining
the behavior of self-replicating networks; among the latter, we first discuss experiments on single
independent neural networks and then continue with experiments on soups consisting of multiple
interacting networks. Después, we briefly discuss related work and provide a conclusion.
2 Acercarse
We provide a brief introduction to how neural networks function, then we proceed to discuss how
to apply neural networks to other neural networks and how to train neural networks using other
neural networks.
2.1 Basics
Neural networks are most commonly made from layers of neurons that are connected to adjacent
layers of neurons. What all neurons have in common is the base functionality of receiving input
valores (in the form of a matrix or vector), applying weights and biases (given as the network’s
parámetros), and computing output values via a specific activation function (given as part of the
network’s architecture). Note that while neural networks originated as a model of biological neurons,
they cannot accurately fulfill that role with respect to modern knowledge about biological neurons
and instead serve as general function approximators in machine learning.
yo
D
oh
w
norte
oh
a
d
mi
d
F
r
oh
metro
h
t
t
pag
:
/
/
d
i
r
mi
C
t
.
metro
i
t
.
mi
d
tu
a
r
t
yo
/
/
yo
a
r
t
i
C
mi
–
pag
d
F
/
/
/
/
2
8
2
2
0
5
2
0
3
2
7
7
7
a
r
t
yo
/
_
a
_
0
0
3
5
9
pag
d
.
F
b
y
gramo
tu
mi
s
t
t
oh
norte
0
7
S
mi
pag
mi
metro
b
mi
r
2
0
2
3
The most basic form of a feedforward network is a single-layer perceptron, consisting of many
fully connected cells that provide a transformation of the input based on their learned parameters.
Mathematically, each cell in such a network is described by a function
(cid:2)
y = f (
wixi + b)
i
where xi is the value produced by the i-th input cell, wi is the weight assigned to that connection,
b is a cell-specific bias, f is the activation function, and y is the cell’s output.
The recurrent neural network (RNN) structure allows cells to retain some information throughout
multiple executions: The result of the evaluation step t is passed to the evaluation at step t + 1 como
vector ht+1. A recurrent cell’s activation at every timestep t is ht = f (Wxt + Wht−1) where W are
206
Artificial Life Volume 28, Número 2
t. Gabor et al.
Self-Replication in Neural Networks
the network weights (Chung et al., 2014). This allows for more powerful models when processing
sequential inputs.
A neural network thus defines a function N : Rp → Rq for input length p and output length q.
For an input vector x ∈ Rp we write the computation of the corresponding output vector y ∈ Rq as
y = N (X). A neural network N is usually given by (a) its architecture, es decir., the types of neurons used,
their activation function, and their topology and connections, así como (b) its parameters, es decir., el
weights assigned to the connections. Whenever the architecture of a neural network is fixed, podemos
define a neural network by its parameters (cid:5)N ∈ Rr. Note that r =def | (cid:5)norte | depends on the amount of
internal connection and hidden layers, but as all inputs and all outputs must be connected somehow
to other cells in the network it always holds that r > p and r > q.
2.2 Application
In the course of this work, we are interested in having neural networks that can be applied to
other neural networks (and can output other neural networks). It is evident that if we want neural
networks to self-replicate, we need to enable them to output an encoding of a neural network
containing at least as many weights as they contain themselves. We discuss multiple approaches to
do so but first introduce a general notation covering all the approaches: We write O = N (cid:2) METRO
to mean that O is the neural network that is generated as the output of the neural network N
when given the neural network M as input. When M and O are sufficiently smaller than N , es decir.,
si | (cid:5)METRO| (cid:6) | (cid:5)norte | y | (cid:5)oh| (cid:6) | (cid:5)norte |, then we can simply define the output network O via its weights
(cid:5)O = N ( (cid:5)METRO). Sin embargo, these conditions obviously do not allow for self-replication. De este modo, nosotros
introduce several reductions that allow us to define the application operator (cid:2) differently and open
it up for self-replication. Note that for these definitions, we assume that M and O have the same
architecture and that the application of N keeps the size of the input network, es decir., METRO : Rp → Rp
for some p and | (cid:5)METRO| = | (cid:5)oh| = p.
Reduction 1 (Weightwise Application). We define N : R4 → R fixed. Dejar (cid:5)m = (cid:7)vi(cid:8)
then set
0≤i<| (cid:5)M|. We
(cid:5)O = (cid:7)wi(cid:8)
0≤i<| (cid:5)O|
where
wi = N (vi, l(i), c(i), p(i))
the number of
and l(i) is
the weight
i leads into, and p(i) is the positional number of weight i among the weights of its cell. We use (cid:5)O to define
O = N
the number of
the weight
c(i) is
layer of
that
cell
the
the
i,
M.
Note that l, c, p depend purely on the networks’ architectures and the index of the weight i, not
on the value of the weight vi. Theoretically, we could pass on i to the network directly, but it seemed
more reasonable to provide the network with the most semantically rich information we have. Also
note that we normalize l, c, p : N →[ 0; 1] ⊂ R, i.e., the positional values are encoded by real numbers
between 0 and 1 as is common for inputs to neural networks.
Intuitively, weightwise application calls N on every single weight of M and provides the
weight’s value and some information on where in the network the weight lives. N then outputs a
new value for that respective weight. After calling N for | (cid:5)M| = | (cid:5)O| times, we have a new output
net O. This approach is most similar to the one used by Chang and Lipson (2018).
Reduction 2 (Aggregating Application). Let agga : Ra → R be an aggregator function taking in an arbi-
trary number of parameters a. Let deagga : R → Ra be a de-aggregating function returning an arbitrary number
of outputs a. Let (cid:5)M = (cid:7)vi(cid:8)
0≤i<| (cid:5)M|. Let
M ↓agg
b
= (cid:7)aggai
(vi, . . . , vi+ai−1)(cid:8)0≤i