FOKUS-FUNKTION: - Am MIT spezialisierte KI-Forschung

FOKUS-FUNKTION:
Konnektivität, Cognition, and Consciousness

Thalamocortical contribution to flexible
learning in neural systems

Mien Brabeeba Wang1,2 and Michael M. Halassa1

1Department of Brain and Cognitive Science, Massachusetts Institute of Technology, Cambridge, MA, USA
2Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, USA

Schlüsselwörter: Meta-learning, Credit assignment, Continual learning, Thalamocortical interactions,
Basal ganglia, Thalamus

Keine offenen Zugänge

Tagebuch

ABSTRAKT

Animal brains evolved to optimize behavior in dynamic environments, flexibly selecting
actions that maximize future rewards in different contexts. A large body of experimental work
indicates that such optimization changes the wiring of neural circuits, appropriately mapping
environmental input onto behavioral outputs. A major unsolved scientific question is how
optimal wiring adjustments, which must target the connections responsible for rewards, can be
accomplished when the relation between sensory inputs, action taken, and environmental
context with rewards is ambiguous. The credit assignment problem can be categorized into
context-independent structural credit assignment and context-dependent continual learning. In
this perspective, we survey prior approaches to these two problems and advance the notion
that the brain’s specialized neural architectures provide efficient solutions. Within this
Rahmen, the thalamus with its cortical and basal ganglia interactions serves as a systems-
level solution to credit assignment. Speziell, we propose that thalamocortical interaction is
the locus of meta-learning where the thalamus provides cortical control functions that
parametrize the cortical activity association space. By selecting among these control functions,
the basal ganglia hierarchically guide thalamocortical plasticity across two timescales to
enable meta-learning. The faster timescale establishes contextual associations to enable
behavioral flexibility, while the slower one enables generalization to new contexts.

ZUSAMMENFASSUNG DES AUTORS

Deep learning has shown great promise over the last decades, allowing artificial neural
networks to solve difficult tasks. The key to success is the optimization process by which task
errors are translated to connectivity patterns. A major unsolved question is how the brain
optimally adjusts the wiring of neural circuits to minimize task error analogously. In our
Perspektive, we advance the notion that the brain’s specialized architecture is part of the
solution and spell out a path towards its theoretical, rechnerisch, and experimental testing.
Speziell, we propose that the interaction between the cortex, thalamus, and basal ganglia
induces plasticity in two timescales to enable flexible behaviors. The faster timescale
establishes contextual associations to enable behavioral flexibility, while the slower one
enables generalization to new contexts.

Zitat: Wang, M. B., & Halassa, M. M.
(2022). Thalamocortical contribution to
flexible learning in neural systems.
Netzwerkneurowissenschaften, 6(4), 980–997.
https://doi.org/10.1162/netn_a_00235

DOI:
https://doi.org/10.1162/netn_a_00235

Erhalten: 26 September 2021
Akzeptiert: 19 Januar 2022

Konkurrierende Interessen: Die Autoren haben
erklärte, dass keine konkurrierenden Interessen bestehen
existieren.

Korrespondierender Autor:
Michael M. Halassa
mhalassa@mit.edu

Handling-Editor:
Randy McIntosh

Urheberrechte ©: © 2022
Massachusetts Institute of Technology
Veröffentlicht unter Creative Commons
Namensnennung 4.0 International
(CC BY 4.0) Lizenz

Die MIT-Presse

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
N
e
N
A
R
T
ich
C
e
–
P
D

F
/

6
4
9
8
0
2
0
5
6
2
6
0
N
e
N
_
A
_
0
0
2
3
5
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Thalamocortical contribution to flexible learning in neural systems

Reward prediction error:
A quantity represented by the
difference between the expected
reward and actual reward.

Credit assignment:
A computational problem to
determine which stimulus, Aktion,
internal states, and context lead to
outcome.

Continual learning:
A computational problem to learn
tasks sequentially to both learn new
tasks faster and not forget old tasks.

EINFÜHRUNG

Learning to flexibly choose appropriate actions in uncertain environments is a hallmark of
intelligence (Müller & Cohen, 2001; Niv, 2009; Thorndike, 2017). When animals explore unfa-
miliar environments, they tend to reinforce actions that lead to unexpected rewards. A com-
mon notion in contemporary neuroscience is that such behavioral reinforcement emerges from
changes in synaptic connectivity, where synapses that contribute to the unexpected reward are
strengthened (Abbott & Nelson, 2000; Bliss & Lomo, 1973; Dayan & Abbott, 2005; Hebb,
2002; Whittington & Bogacz, 2019). A prominent model for connecting synaptic to behav-
ioral reinforcement is dopaminergic innervation of basal ganglia (BG), where dopamine
(DA) carries the reward prediction error (RPE) signals to guide synaptic learning (Bamford,
Wightman, & Sulzer, 2018; Bayer & Glimcher, 2005; Montague, Dayan, & Sejnowski,
1996; Schultz, Dayan, & Montague, 1997). This circuit motif is thought to implement a basic
form of the reinforcement learning algorithm (Houk, Davis, & Beiser, 1994; Morris, Nevet,
Arkadir, Vaadia, & Bergman, 2006; Roesch, Calu, & Schoenbaum, 2007; Suri & Schultz,
1999; R. Sutton & Barto, 2018; R. S. Sutton & Barto, 1990; Wickens & Kotter, 1994), welche
has had much success in explaining simple Pavlovian and instrumental conditioning (Ikemoto
& Panksepp, 1999; Niv, 2009; R. Sutton & Barto, 2018; R. S. Sutton & Barto, 1990). Jedoch,
it is unclear how this circuit can reinforce the appropriate connections in complex natural
environments where animals need to dynamically map sensory inputs to different action in
a context-dependent way. If one naively credits all synapses with the RPE signals, the learning
will be highly inefficient since different cues, contexts, and actions contribute to the RPE sig-
nals differently. To properly credit the cues, Kontext, and actions that lead to unexpected
reward is a challenging problem, known as the credit assignment problem (Lillicrap, Santoro,
Marris, Akerman, & Hinton, 2020; Minsky, 1961; Rumelhart, Hinton, & Williams, 1986;
Whittington & Bogacz, 2019).

One can roughly categorize the credit assignment into context-independent structural
credit assignment and context-dependent continual learning. In structural credit assignment,
animals may make decisions in a multi-cue environment and should be able to credit those
cues that contribute to the rewarding outcome. Ähnlich, if actions are being chosen based
on internal decision variables, then the underlying activity states must also be reinforced. In
such cases, neurons that are selective to external cues or internal latent variables need to
adjust their downstream connectivity based on its contribution of their downstream targets to
the RPE. This is a challenging computation to implement because, for upstream neurons, Die
RPE will be dependent on downstream neurons that are several connections away. Für
Beispiel, a sensory neuron needs to know the action chosen in the motor cortex to selec-
tively credit the sensory synapses that contribute to the action. In continual learning, Tiere
not only need to appropriately credit the sensory cues and actions that lead to the reward
but also need to credit the sensorimotor combination in the right context to retain the
behaviors learned from different contexts and even to generalize to novel contexts. Dort-
Vordergrund, animals can continually learn and generalize across different contexts while retaining
behaviors in familiar contexts. Zum Beispiel, when one is in the United States, one learns to
first look left before crossing the street, whereas in the United Kingdom, one learns to look
right instead. Jedoch, after spending time in the United Kingdom, someone from the
United States should not unlearn the behavior of looking left first when they return home
because their brain ought to properly assign the credit to a different context. Außerdem,
once one learns how to cross the street in the United States, it is much easier to learn how
to cross the street in the United Kingdom because the brain flexibly generalize behaviors
across contexts.

Netzwerkneurowissenschaften

981

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
N
e
N
A
R
T
ich
C
e
–
P
D

F
/

6
4
9
8
0
2
0
5
6
2
6
0
N
e
N
_
A
_
0
0
2
3
5
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Thalamocortical contribution to flexible learning in neural systems

Backpropagation:
An algorithm to compute the error
gradient of an artificial neural
network through chain rules.

In this perspective, we will first go over common approaches from machine learning to
tackle these two credit assignment problems. Dabei, we highlight the challenge in their
efficient implementation within biological neural circuits. We also highlight some recent pro-
posals that advance the notion of specialized neural hardware that approximate more general
solutions for credit assignment (Fiete & Seung, 2006; Ketz, Morkonda, & O’Reilly, 2013;
Kornfeld et al., 2020; Kusmierz, Isomura, & Toyoizumi, 2017; Lillicrap, Cownden, Tweed,
& Akerman, 2016; Liu, Schmied, Mihalas, Shea-Brown, & Sümbül, 2020; O’Reilly, 1996;
O’Reilly, Russin, Zolfaghar, & Rohrlich, 2021; Richards & Lillicrap, 2019; Roelfsema &
Holtmaat, 2018; Roelfsema & van Ooyen, 2005; Sacramento, Ponte Costa, Bengio, & Senn,
2018; Schiess, Urbanczik, & Senn, 2016; Zenke & Ganguli, 2018). In diesem Sinne, we pro-
pose an efficient systems-level solution involving the thalamus and its interaction with the
cortex and BG for these two credit assignment problems.

COMMON MACHINE LEARNING APPROACHES TO CREDIT ASSIGNMENT

One solution to structural credit assignment in machine learning is backpropagation (Rumelhart
et al., 1986). Backpropagation recursively computes the vector-valued error signal for synapses
based on their contribution to the error signal. There is much empirical success of backpropa-
gation in surpassing human performance in supervised learning such as image recognition
(Er, Zhang, Ren, & Sun, 2016; Krizhevsky, Sutskever, & Hinton, 2012) and reinforcement
learning such as playing the game of Go and Atari (Mnih et al., 2015; Schrittwieser et al.,
2020; Silver et al., 2016; Silver et al., 2017). Zusätzlich, comparing artificial networks trained
with backpropagation with neural responses from the ventral visual stream of nonhuman pri-
mates shows comparable internal representations (Cadieu et al., 2014; Yamins et al., 2014).
Despite its empirical success in superhuman-level performance and matching the internal
representation of actual brains, backpropagation may not be straightforward to implement
in biological neural circuits, as we explain below.

In its most basic form, backpropagation requires symmetric connections between neurons
(forward and backward connections). Mathematically, we can write down the backpropaga-
tion in Equation 1:

δWi ∝

∂E
∂Wi

D
¼ eif ai−1

Þ⊤;

ei ¼ W T

iþ1eiþ1 ∘ f 0 aið

Þ;

(1)

E is the total error, ei is the vector error at layer i, Wi is the synaptic weight connecting layer i − 1
to layer i, and f is the nonlinearity. Intuitively, this is saying that the change of synaptic weight Wi
is computed by a Hebbian learning rule between backpropagation error ei and activity from last
layer f(ai−1), while the backpropagation error is computed by backpropagating the error in the
next layer through symmetric feedback weights W ⊤
iþ1. Wichtig, in this algorithm, error sig-
nals do not alter the activity of neurons in the preceding layers and instead operate indepen-
dently from the feedforward activity. Jedoch, such arrangement is not observed in the brain;
symmetric connections across neurons are not a universal feature of circuit organization, Und
biological neurons may encode both feedforward inputs and errors through changes in spike
output (changes in activity; Crick, 1989; Richards & Lillicrap, 2019). daher, it is hard to
imagine how the basic form of backpropagation (symmetry and error/activity separation) Ist
physically implemented in the brain.

Netzwerkneurowissenschaften

982

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
N
e
N
A
R
T
ich
C
e
–
P
D

F
/

6
4
9
8
0
2
0
5
6
2
6
0
N
e
N
_
A
_
0
0
2
3
5
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Thalamocortical contribution to flexible learning in neural systems

Catastrophic forgetting:
A phenomenon in which the network
forgets about the previous tasks upon
learning new tasks.

Außerdem, while an animal can continually learn to behave across different contexts,
artificial neural networks trained by backpropagation struggle to learn and remember different
tasks in different contexts: a problem known as catastrophic forgetting (French, 1999; Kemker,
McClure, Abitino, Hayes, & Kanan, 2018; Kumaran, Hassabis, & McClelland, 2016; McCloskey
& Cohen, 1989; Parisi, Kemker, Teil, Kanan, & Wermter, 2019). Speziell, dieses Problem
occurs when the tasks are trained sequentially because the weights optimized for former tasks
will be modified to fit the later tasks. One of the common solutions is to interleave the tasks
from different contexts to jointly optimize performance across contexts by using an episodic
memory system and replay mechanism (Kumaran et al., 2016; McClelland, McNaughton, &
O’Reilly, 1995). This approach has received empirical success in artificial neural networks,
including learning to play many Atari games (Mnih et al., 2015; Schrittwieser et al., 2020).
Jedoch, since one needs to store past training data in memory to replay during learning, Das
approach demands a high computational overhead and can be is inefficient as the number of
the contexts increases. Andererseits, humans and animals acquire diverse sensorimotor
skills in different contexts throughout their life span: a feat that cannot be solely explained by
memory replay (M. M. Murray, Lewkowicz, Amedi, & Wallace, 2016; Parisi et al., 2019;
Power & Schlaggar, 2017; Zenke, Gerstner, & Ganguli, 2017). daher, biological neural
circuits are likely to employ other solutions to continual learning in addition to memory replay.

daher, to solve these two credit assignment problems in the brain, one needs to seek
different solutions. One of the pitfalls of backpropagation is that it is a general algorithm that
works on any architecture. Jedoch, actual brains are collections of specialized hardware put
together in a specialized way. It can be conceived that through clever coordination between
different cell types and different circuits, the brains can solve the credit assignment problem by
leveraging its specialized architectures. Along this line of ideas, many investigators have pro-
posed cellular (Fiete & Seung, 2006; Kornfeld et al., 2020; Kusmierz et al., 2017; Liu et al.,
2020; Richards & Lillicrap, 2019; Sacramento et al., 2018; Schiess et al., 2016) and circuit-level
mechanisms (Lillicrap et al., 2016; O’Reilly, 1996; Roelfsema & Holtmaat, 2018; Roelfsema &
van Ooyen, 2005) to assign credit appropriately. In this perspective, we would like to advance
the notion that the specialized hardware arrangement also happens at the system level and pro-
pose that the thalamus and its interaction with basal ganglia and the cortex serve as a system-
level solution for these three types of credit assignment.

A PROPOSAL: THALAMOCORTICAL–BASAL GANGLIA INTERACTIONS ENABLE
META-LEARNING TO SOLVE CREDIT ASSIGNMENT

To motivate the notion of thalamocortical–basal ganglia interactions being a potential solution
for credit assignment, we will start with a brief introduction. The cortex, thalamus, and basal
ganglia are the three major components of the mammalian forebrain—the part of the brain to
which high-level cognitive capacities are attributed to (Alexander, DeLong, & Strick, 1986;
Badre, Kayser, & D'Esposito, 2010; Cox & Witten, 2019; Makino, Hwang, Hedrick, &
Komiyama, 2016; Müller, 2000; Müller & Cohen, 2001; Niv, 2009; Seo, Lee, & Averbeck,
2012; Wolff & Vann, 2019). Each of these components has its specialized internal architec-
tures; the cortex is dominated by excitatory neurons with extensive lateral connectivity profiles
(Fuster, 1997; Rakic, 2009; Singer, Sejnowski, & Rakic, 2019), the thalamus is grossly divided
into different nuclei harboring mostly excitatory neurons devoid of lateral connections (Harris
et al., 2019; Jones, 1985; Sherman & Guillery, 2005), and the basal ganglia are a series of
inhibitory structures driven by excitatory inputs from the cortex and thalamus (Gerfen &
Bolam, 2010; Lanciego, Luquin, & Obeso, 2012; Nambu, 2011) (Figur 1). A popular view
within system neuroscience stipulates that BG and the cortex underwent different learning

Netzwerkneurowissenschaften

983

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
N
e
N
A
R
T
ich
C
e
–
P
D

F
/

6
4
9
8
0
2
0
5
6
2
6
0
N
e
N
_
A
_
0
0
2
3
5
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Thalamocortical contribution to flexible learning in neural systems

Figur 1. Distinct architectures of cortex, thalamus, and basal ganglia. Cortex is largely composed
of excitatory neurons with extensive recurrent connectivity. Thalamus consists of mostly excitatory
neurons without lateral connections. Basal ganglia consist of mostly inhibitory neurons driven by
cortical and thalamic inputs, and the corticostriatal plasticity is modulated by dopamine.

paradigms, where BG is involved in reinforcement learning while the cortex is involved in
unsupervised learning (Doya, 1999, 2000). Speziell, the input structure of the basal ganglia
known as the striatum is thought to be where reward gated plasticity takes place to implement
reinforcement learning (Bamford et al., 2018; Cox & Witten, 2019; Hikosaka, Kim, Yasuda, &
Yamamoto, 2014; Kornfeld et al., 2020; Niv, 2009; Perrin & Venance, 2019). One such evi-
dence is the high temporal precision of DA activity in the striatum. To accurately attribute the
action that leads to positive RPE, DA is released into the relevant corticostriatal synapses.
Jedoch, DA needs to disappear quickly to prevent the next stimulus-response combination
from being reinforced. In the striatum, this elimination process is carried out by dopamine
active transporter (DAT) to maintain a high temporal resolution of DA activity on a timescale
of around 100 ms–1 s to support reinforcement learning (Cass & Gerhardt, 1995; Ciliax et al.,
1995; Garris & Wightman, 1994). Im Gegensatz, although the cortex also has dopaminergic
innervation, cortical DAT expression is low and therefore DA levels may change at a timescale
that is too slow to support reinforcement learning (Cass & Gerhardt, 1995; Garris & Wightman,
1994; Lapish, Kroener, Durstewitz, Lavin, & Seamans, 2007; Seamans & Robbins, 2010) Aber
instead supports other processes related to learning (Badre et al., 2010; Müller & Cohen, 2001).
Tatsächlich, ample evidence indicates that cortical structures undergo Hebbian-like long-term
potentiation (LTP) and long-term depression (LTD; Cooke & Bear, 2010; Feldman, 2009;
Kirkwood, Rioult, & Bear, 1996). Jedoch, despite the unsupervised nature of these processes,
cortical representations are task-relevant and include appropriate sensorimotor mappings that
lead to rewards (Allen et al., 2017; Donahue & Lee, 2015; Enel, Wallis, & Reich, 2020; Jacobs &
Moghaddam, 2020; Petersen, 2019; Tsutsui, Hosokawa, Yamada, & Iijima, 2016). How could
this arise from an unsupervised process? One possible explanation is that basal ganglia acti-
vate the appropriate cortical neurons during behaviors and the cortical network collectively
consolidates high-reward sensorimotor mappings via Hebbian-like learning (Andalman & Fee,
2009; Ashby, Ennis, & Spiering, 2007; Hélie, Ell, & Ashby, 2015; Tesileanu, Olveczky, &
Balasubramanian, 2017; Warren, Tumer, Charlesworth, & Brainard, 2011). Previous computa-
tional accounts of this process have emphasized a consolidation function for the cortex in this
Verfahren, which naively would beg the question of why duplicate a process that seems to func-
tion well in the basal ganglia and perhaps include a lot of details of the associated experience?

Netzwerkneurowissenschaften

984

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
N
e
N
A
R
T
ich
C
e
–
P
D

F
/

6
4
9
8
0
2
0
5
6
2
6
0
N
e
N
_
A
_
0
0
2
3
5
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Thalamocortical contribution to flexible learning in neural systems

Meta-learning:
A learning paradigm in which a
network learns how to learn more
efficiently.

Figur 2. Two views of learning in the cortex. (A) One possible view is that the Hebbian cortical
plasticity consolidates the sensorimotor mapping from BG to learn a stimulus-action mapping at =
F(st). (B) We propose that thalamocortical systems perform meta-learning by consolidating the
teaching signals from BG to learn a context-dependent mapping at = fc(st), where the context c is
computed by past stimulus history and represented by different thalamic activities.

The answer to this question is the core of our proposal. We propose that the learning pro-
cess is not a duplication, but instead that the reinforcement process in the basal ganglia selects
thalamic control functions that subsequently activate cortical associations to allow flexible
mappings across different contexts (Figur 2).

To understand this proposition, we need to take a closer look at the involvement of these
distinct network elements in task learning. Learning in basal ganglia happens in corticostriatal
synapses where the basic form of reinforcement learning is implemented. Speziell, the coac-
tivation of sensory and motor cortical inputs generates eligibility traces in corticostriatal synap-
ses that get captured by the presence or absence of DA (Fee & Goldberg, 2011; Fiete, Fee, &
Seung, 2007; Kornfeld et al., 2020). This reinforcement learning algorithm is fast at acquiring
simple associations but slow at generalization to other behaviors. Andererseits, the cortical
plasticity operates in a much slower timescale but seems to allow flexible behaviors and fast
generalization (Kim, Johnson, Cilles, & Gold, 2011; Mante, Sussillo, Shenoy, & Newsome,
2013; Müller, 2000; Müller & Cohen, 2001). How does the cortex exhibit slow synaptic plasticity
and flexible behaviors at the same time? An explanatory framework is meta-learning (Botvinick
et al., 2019; Wang et al., 2018), where the flexibility arises from network dynamics and the
generalization emerges from slow synaptic plasticity across different contexts. Mit anderen Worten,
synaptic plasticity stores a higher order association between contexts and sensorimotor associ-
ations while the network dynamics switches between different sensorimotor associations based
on this higher order association. Jedoch, properly arbitrating between synaptic plasticity and
network dynamics to store such higher order association is a nontrivial task (Sohn, Meirhaeghe,
Rajalingham, & Jazayeri, 2021). We propose that the thalamocortical system learns these
Dynamik, where the thalamus provides control nodes that parametrize the cortical activity asso-
ciation space. Basal ganglia inputs to the thalamus learn to select between these different control
Knoten, directly implementing the interface between weight adjustment and dynamical controls.
Our proposal rests on the following three specific points.

Erste, building on a line of the literature that shows diverse thalamocortical interaction in
sensory, cognitive, and motor cortex, we propose that thalamic output may be described as
control functions over cortical computations. These control functions can be purely in the
sensory domain like attentional filtering, in the cognitive domain like manipulating working
Erinnerung, or in the motor domain like preparation for movement (Bolkan et al., 2017; W. Guo,
Clause, Barth-Maron, & Polley, 2017; Z. V. Guo et al., 2017; Mukherjee et al., 2020; Rikhye,
Gilra, & Halassa, 2018; Saalmann & Kastner, 2015; Schmitt et al., 2017; Tanaka, 2007;
Wimmer et al., 2015; Zhou, Schafer, & Desimone, 2016). These functions directly relate

Netzwerkneurowissenschaften

985

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
N
e
N
A
R
T
ich
C
e
–
P
D

F
/

6
4
9
8
0
2
0
5
6
2
6
0
N
e
N
_
A
_
0
0
2
3
5
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Thalamocortical contribution to flexible learning in neural systems

thalamic activity patterns to different cortical dynamical regimes and thus offer a way to estab-
lish higher order association between context and sensorimotor mapping within the thalamo-
cortical pathways. Zweite, based on previous studies on direct and indirect BG pathways that
influence most cortical regions (Hunnicutt et al., 2016; Jiang & Kim, 2018; Nakajima, Schmitt,
& Halassa, 2019; Peters, Fabre, Steinmetz, Harris, & Carandini, 2021), we propose that BG
hierarchically selects these thalamic control functions to influence activities of the cortex
toward rewarding behavioral outcomes. zuletzt, we propose that thalamocortical structure con-
solidates the selection of BG through a two-timescale Hebbian learning process to enable
meta-learning. Speziell, the faster corticothalamic plasticity learns the higher order associ-
ation that enables flexible contextual switching with different thalamic patterns (Marton,
Seifikar, Luongo, Lee, & Sohal, 2018; Rikhye et al., 2018), while the slower cortical plasticity
learns the shared representations that allow generalization to new behaviors. Below, we will
go over the supporting literature that leads us to this proposal.

MORE GENERAL ROLES OF THALAMOCORTICAL INTERACTION AND
BASAL GANGLIA

Classical literature has emphasized the role of the thalamus in transmitting sensory inputs to
the cortex. This is because some of the better studied thalamic pathways are those connected
to sensors on one end and primary cortical areas on another (Hubel & Wiesel, 1961; Lien &
Scanziani, 2018; Reinagel, Godwin, Sherman, & Koch, 1999; Sherman & Spear, 1982; Usrey,
Alonso, & Reid, 2000). From that perspective, thalamic neurons being devoid of lateral
connection transmit their inputs (z.B., from the retina in the case of the lateral geniculate
nucleus, LGN) to the primary sensory cortex ( V1 in this same example case), and the input
transformation (center-surround to oriented edges) occurs within the cortex (Hoffmann, Stein,
& Sherman, 1972; Hubel & Wiesel, 1962; Lien & Scanziani, 2018; Usrey et al., 2000). In vielen
Fälle, these formulations of thalamic “relay” have generalized to how motor and cognitive
thalamocortical interactions may be operating. Jedoch, in contrast to the classical relay view
of the thalamus, more recent studies have shown diverse thalamic functions in sensory, cog-
nitive, and motor processing (Bolkan et al., 2017; W. Guo et al., 2017; Z. V. Guo et al., 2017;
Rikhye et al., 2018; Saalmann & Kastner, 2015; Schmitt et al., 2017; Tanaka, 2007; Wimmer
et al., 2015; Zhou et al., 2016). For example in mice, sensory thalamocortical transmission can
be adjusted based on prefrontal cortex (PFC)-dependent, top-down biasing signals transmitted
through nonclassical basal ganglia pathways involving the thalamic reticular nucleus (TRN;
Nakajima et al., 2019; Phillips, Kambi, & Saalmann, 2016; Wimmer et al., 2015). Interessant,
these task-relevant PFC signals themselves require long-range interactions with the associative
mediodorsal (MD) thalamus to be initiated, maintained, and flexibly switched (Rikhye et al.,
2018; Schmitt et al., 2017; Wimmer et al., 2015). One can also observe nontrivial control
functions in the motor thalamus. Motor preparatory activities in the anterior motor cortex
(ALM) show persistent activities that predicted future actions. Interessant, the motor thalamus
also shows similar preparatory activities that predict future actions and by optogenetically
manipulating the motor thalamus activities, the persistent activities in ALM quickly diminished
(Z. V. Guo et al., 2017). Kürzlich, Mukherjee, Lam, Wimmer, and Halassa (2021) discovered
two cell types within MD thalamus differentially modulate the cortical evidence accumulation
dynamics depending on whether the evidence is conflicting or sparse to boost the signal-to-
noise ratio in decision-making. Based on the above studies, we propose that the thalamus
provides a set of control functions to the cortex. Speziell, cortical computations may be
flexibly switched to different dynamical modes by activating a particular thalamic output that
corresponds to that mode.

Netzwerkneurowissenschaften

986

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
N
e
N
A
R
T
ich
C
e
–
P
D

F
/

6
4
9
8
0
2
0
5
6
2
6
0
N
e
N
_
A
_
0
0
2
3
5
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Thalamocortical contribution to flexible learning in neural systems

Andererseits, the selective role of BG in motor and cognitive control also has dom-
inated the literature because thalamocortical–basal ganglia interaction is the most well studied
in frontal systems (Cox & Witten, 2019; Makino et al., 2016; McNab & Klingberg, 2008;
Monchi, Petride, Strafella, Worsley, & Doyon, 2006; Seo et al., 2012). Jedoch, classical
and contemporary studies have recognized that all cortical areas, including primary sensory
Bereiche, project to the striatum (Hunnicutt et al., 2016; Jiang & Kim, 2018; Peters et al., 2021).
Ähnlich, the basal ganglia can project to the more sensory parts of the thalamus through lesser
studied pathways to influence the sensory cortex (Hunnicutt et al., 2016; Nakajima et al.,
2019; Peters et al., 2021). Speziell, a nonclassical BG pathway projects to TRN, welche
in turn modulates the activities of LGN to influence sensory thalamocortical transmission
(Nakajima et al., 2019). Andererseits, it has also been argued that BG is involved in
gating working memory (McNab & Klingberg, 2008; Voytek & Ritter, 2010). This shows that
BG has a much more general role than classical action and action strategy selection. Dort-
Vordergrund, combining with our proposals on thalamic control functions, we propose that BG hier-
archically selects different thalamic control functions to influence all cortical areas in different
contexts through reinforcement learning.

Außerdem, there are series of the work that indicates the role of BG to guide plasticity in
thalamocortical structures (Andalman & Fee, 2009; Fiete et al., 2007; Hélie et al., 2015;
Mehaffey & Doupe, 2015; Tesileanu et al., 2017). Insbesondere, there is evidence that BG is
critical for the initial learning and less involved in the automatic behaviors once the behaviors
are learned across different species. In zebra finches, the lesion of BG in adult zebra finch has
little effect on song production, but the lesion of BG in juvenile zebra finch prevents the bird
from learning the song (Fee & Goldberg, 2011; Scharff & Nottebohm, 1991; Sohrabji,
Nordeen, & Nordeen, 1990). Similar patterns can be observed in people with Parkinson’s dis-
ease. Parkinson’s patients who have a reduction of DA and striatal defects have troubles in
solving procedural learning tasks but can produce automatic behaviors normally (Asmus,
Huber, Gasser, & Schöls, 2008; Soliveri, Braun, Jahanshahi, Caraceni, & Marsden, 1997;
Thomas-Ollivier et al., 1999). This behavioral evidence suggests that thalamocortical struc-
tures consolidate the learning from BG as the behaviors become more automatic. Außerdem,
on the synaptic level, a songbird learning circuit also demonstrates this cortical consolidation
motif (Mehaffey & Doupe, 2015; Tesileanu et al., 2017). In a zebra finch, the premotor nucleus
HVC (a proper name) projects to the motor nucleus robust nucleus of the arcopallium (RA) Zu
produce the song. Andererseits, RA also receives BG nucleus Area X mediated inputs
from the lateral nucleus of the medial nidopallium (LMAN). The latter pathway is believed to
be a locus of reinforcement learning in the songbird circuit. By burst stimulating both input
pathways in different time lags, one can discover that HVC-RA and LMAN-RA underwent
opposite plasticity (Mehaffey & Doupe, 2015). This suggests that the learning is gradually
transferred from LMAN-RA to HVC-RA pathway (Fee & Goldberg, 2011; Mehaffey & Doupe,
2015; Tesileanu et al., 2017). This indicates a general role of BG as the trainer for cortical
plasticity.

THE THALAMOCORTICAL STRUCTURE CONSOLIDATES THE BG SELECTIONS
ON THALAMIC CONTROL FUNCTIONS IN DIFFERENT TIMESCALES TO
ENABLE META-LEARNING

In diesem Abschnitt, in addition to BG’s role as the trainer for cortical plasticity, we further propose
that BG is the trainer in two different timescales for thalamocortical structures to enable meta-
learning. The faster timescale trainer trains the corticothalamic connections to select the

Netzwerkneurowissenschaften

987

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
N
e
N
A
R
T
ich
C
e
–
P
D

F
/

6
4
9
8
0
2
0
5
6
2
6
0
N
e
N
_
A
_
0
0
2
3
5
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Thalamocortical contribution to flexible learning in neural systems

Figur 3. Two-timescale learning in thalamocortical structures. We propose that one can learn the
thalamocortical structure to enable meta-learning by applying the general network motif in two dif-
ferent timescales. Erste, one can learn the corticothalamic connections by applying the motif on the
blue loop with a faster timescale. This allows the network to consolidate flexible switching behav-
iors. Zweite, one can learn the cortical connections by applying the motif on the orange loop in a
slower timescale. This allows cortical neurons to develop a task-relevant shared representation that
can generalize across contexts.

appropriate thalamic control functions in different contexts, while the slower timescale trainer
trains the cortical connections to form a task-relevant and generalizable representation.

From the songbird example, we see how thalamocortical structures can consolidate sim-
ple associations learned through the basal ganglia. To enable meta-learning, we propose that
this general network consolidation motif operates over two different timescales within
thalamocortical–basal ganglia interactions (Figur 3). Erste, combining the idea of thalamic
outputs as control functions over cortical network activity patterns and the basal ganglia
selecting such functions, we frame learning in basal ganglia as a process that connects con-
textual associations (higher order) with the appropriate dynamical control that maximizes
reward at the sensorimotor level (lower order). Under this framing, corticothalamic plasticity
consolidates the higher order association within a fast timescale. This allows flexible switch-
ing between different thalamic control functions in different contexts. Andererseits, Die
cortical plasticity consolidates the sensorimotor association over a slow timescale to allow
shared representation that can generalize across different contexts. As the thalamocortical
structures learn the higher order association, the behaviors become less BG-dependent
and the network is able to switch between different thalamic control functions to induce
different sensorimotor mappings in different contexts. By having two learning timescales, ani-
mals can conceivably both adapt quickly in changing environments with fast learning of
corticothalamic connections and maintain the important information across the environment
in the cortical connections. One should note that this separation of timescales is indepen-
dent from different timescales across cortex (Gao, van den Brink, Pfeffer, & Voytek, 2020;
J. D. Murray et al., 2014). While different timescales across cortex allows animals to process
information differentially, the separation of corticothalmic and cortical plasticity allows the
thalamocortical system to learn the higher contextual association to modulate cortical
dynamics flexibly.

Some anatomical observations support this idea. The thalamostriatal neurons have a more
modulatory role to the cortical dynamics in a diffusive projection, while thalamocortical neu-
rons have a more driver role to the cortical dynamic in a topographically restricted dense pro-
jection (Sherman & Guillery, 2005). This indicates that thalamostriatal neurons might serve as
the role of control functions in the faster consolidation loop with the feedback to striatum to
conduct credit assignment. Andererseits, thalamocortical neurons might be more
involved in the slower consolidation loop with the feedback to striatum coming from the cor-
tex to train the common cortical representation across contexts.

Netzwerkneurowissenschaften

988

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
N
e
N
A
R
T
ich
C
e
–
P
D

F
/

6
4
9
8
0
2
0
5
6
2
6
0
N
e
N
_
A
_
0
0
2
3
5
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Thalamocortical contribution to flexible learning in neural systems

Zusammenfassend, this two-timescale network consolidation scheme provides a general way for
BG to guide plasticity in the thalamocortical architecture to enable meta-learning and thus
solves structural credit assignment as a special case. In diesem Sinne, experimental evidence
supports the notion that when faced with multisensory inputs, the BG can selectively disinhibit
a modality-specific subnetwork of the thalamic reticular nucleus (TRN) to filter out the sensory
inputs that are not relevant to the behavior outcomes and thus solve the structural credit
assignment problem.

In the discussion above, we discuss our proposal under a general formulation of thalamic
control functions. In the next section, we will specify other thalamic control functions sug-
gested by recent studies and observe how they can solve continual learning under this
framework as well.

THE THALAMUS SELECTIVELY AMPLIFIES FUNCTIONAL CORTICAL CONNECTIVITY
AS A SOLUTION TO CONTINUAL LEARNING AND CATASTROPHIC FORGETTING

One of the pitfalls of the artificial neural network is catastrophic forgetting. If one trains an
artificial neural network on a sequence of tasks, the performance on the older task will quickly
deteriorate as the network learns the new task (French, 1999; Kemker et al., 2018; Kumaran
et al., 2016; McCloskey & Cohen, 1989; Parisi et al., 2019). Andererseits, the brain can
achieve continual learning, the ability to learn different tasks in different contexts without cat-
astrophic forgetting and even generalize the performance to novel context (Lewkowicz, 2014;
M. M. Murray et al., 2016; Power & Schlaggar, 2017; Zenke, Gerstner, & Ganguli, 2017).
There are three main approaches in machine learning to deal with catastrophic forgetting.
Erste, one can use the regularization method to mostly update the weights that are less impor-
tant to the prior tasks (Fernando et al., 2017; Jung, Ju, Jung, & Kim, 2018; Kirkpatrick et al.,
2017; Li & Hoiem, 2018; Maltoni & Lomonaco, 2019; Zenke, Poole, & Ganguli, 2017). Das
idea is inspired by experimental and theoretical studies on how synaptic information is selec-
tively protected in the brain (Benna & Fusi, 2016; Cichon & Gan, 2015; Fusi, Drew, & Abbott,
2005; Hayashi-Takagi et al., 2015; Yang, Pan, & Gan, 2009). Jedoch, it is unclear how to
biologically compute the importance of each synapse to prior tasks nor how to do global reg-
ularization locally. Zweite, one can also use a dynamic architecture in which the network
expands the architecture by allocating a subnetwork to train with the new information while
preserving old information (Cortes, Gonzalvo, Kuznetsov, Mohri, & Yang, 2017; Draelos et al.,
2017; Rusu et al., 2016; Xiao, Zhang, Yang, Peng, & Zhang, 2014). Jedoch, this type of
method is not scalable since the number of neurons needs to scale linearly with the number
of tasks. zuletzt, one can use a memory buffer to replay past tasks to avoid catastrophic forget-
ting by interleaving the experience of the past tasks with the experience of the present task
(Kemker & Kanan, 2018; Kumaran et al., 2016; McClelland et al., 1995; Schienbein, Lee, Kim, &
Kim, 2017). Jedoch, this type of method cannot be the sole solution, as the memory buffer
needs to scale linearly with the number of tasks and potentially the number of trials.

We propose that the thalamus provides another way to solve continual learning and cata-
strophic forgetting via selectively amplifying parts of the cortical connections in different con-
texts (Figur 4). Speziell, we propose that a population of thalamic neurons topographically
amplify the connectivity of cortical subnetworks as their control functions. During a behavioral
Aufgabe, BG selects subsets of the thalamus that selectively amplify the connectivity of cortical
subnetworks. Because of the reinforcement learning in BG, the subnetwork that is the most
relevant to the current task will be more preferentially activated and updated. By selecting only
the relevant subnetwork to activate in one context, the thalamus protects other subnetworks

Netzwerkneurowissenschaften

989

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
N
e
N
A
R
T
ich
C
e
–
P
D

F
/

6
4
9
8
0
2
0
5
6
2
6
0
N
e
N
_
A
_
0
0
2
3
5
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Thalamocortical contribution to flexible learning in neural systems

Figur 4. A thalamocortical architecture with interaction with BG for continual learning. Während
task execution, BG selects thalamic neurons that amplify the relevant cortical subnetwork. This pro-
tects other parts of the network that are important for another context from being overwritten. Wann
the other task comes, BG selects other thalamic neurons and since the synapses are protected from
the last task, animals can freely switch from different tasks without forgetting the previous tasks.
Außerdem, as the corticothalamic synapses learn how to select the right thalamic neurons in a
different context (blue dashed line), task execution can become less BG dependent.

that can have useful information in another context from being overwritten. The corticothala-
mic structures can then consolidate these BG-guided flexible switching behaviors via our
proposed network motif, and the switching becomes less BG-dependent. Außerdem, unser
proposed solution has implications on generalization as well. Different tasks can have princi-
ples in common that can be transferred. Zum Beispiel, although the rules of chess and Go are
very different, players in both games all need to predict what the other players are going to do
and counterattack based on the prediction. Since BG selects the subnetwork at each hierarchy
that is most relevant to the current tasks, in addition to selecting different subnetworks to pre-
vent catastrophic forgetting, BG can also select subnetworks that are beneficial to both tasks as
well to achieve generalization. daher, the cortex can develop a modular hierarchical rep-
resentation of the world that can be easily generalized.

The idea of protecting relevant information from the past tasks to be overwritten has been
applied before computationally and has decent success in combating catastrophic forgetting in
deep learning (Kirkpatrick et al., 2017). Experimentally, we also have found that thalamic neu-
rons selectively amplify the cortical connectivity to solve the continual learning problem. In einem
task where the mice need to switch between different sets of task cues that guided the attention
to the visual or auditory target, the performance of the mice does not deteriorate much after
switching to the original context, which is an indication of continual learning (Rikhye et al.,
2018). Through electrophysiological recording of PFC and mediodorsal thalamic nucleus
(MD) Neuronen, we discovered that PFC neurons preferentially code for the rule of the attention,
while MD neurons preferentially code for the contexts of different sets of the cues. Thalamic
neurons that encode the task-relevant context translate this neural representation into the
amplification of cortical activity patterns associated with that context (despite the fact that cor-
tical neurons themselves only encode the context implicitly). These experimental observations
are consistent with our proposed solution: By incorporating the thalamic population that can
selectively amplify connectivity of cortical subnetworks, the thalamus and its interaction with
cortex and BG solve the continual learning problem and prevent catastrophic forgetting.

Netzwerkneurowissenschaften

990

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
N
e
N
A
R
T
ich
C
e
–
P
D

F
/

6
4
9
8
0
2
0
5
6
2
6
0
N
e
N
_
A
_
0
0
2
3
5
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Thalamocortical contribution to flexible learning in neural systems

CONCLUSION

Zusammenfassend, in contrast to the traditional relay view of the thalamus, we propose that thala-
mocortical interaction is the locus of meta-learning where the thalamus provides cortical con-
trol functions, such as sensory filtering, working memory gating, or motor preparation, Das
parametrize the cortical activity association space. Außerdem, we propose a two-timescale
learning consolidation framework in which BG hierarchically selects these thalamic control
functions to enable meta-learning, solving the credit assignment problem. The faster plasticity
learns contextual associations to enable rapid behavioral flexibility, while the slower plasticity
establishes cortical representation that generalizes. By considering the recent observation of
the thalamus selectively amplifying functional cortical connectivity, the thalamocortical–basal
ganglia network is able to flexibly learn context-dependent associations without catastrophic
forgetting while generalizing to the new contexts. This modular account of the thalamocortical
interaction may seem to be in contrast with the recent proposed dynamical perspectives
(Barack & Krakauer, 2021) on thalamocortical interaction in which the thalamus shapes and
constrains the cortical attractor landscapes (Shine, 2021). We would like to argue that both
the modular and the dynamical perspectives are compatible with our proposal. The crux of
the perspectives is that the thalamus provides control functions that parametrize cortical
Dynamik, and these control functions can be of modular nature or of dynamical nature
depending on their specific input-output connectivity. Flexible behaviors can be induced by
selecting either the control functions that amplify the appropriate cortical subnetworks or those
that adjust the cortical dynamics to the appropriate regimes.

BEITRÄGE DES AUTORS

Mien Wang: Konzeptualisierung; Untersuchung; Methodik; Writing – original draft; Writing –
Rezension & Bearbeitung. Michael M. Halassa: Konzeptualisierung; Akquise von Fördermitteln; Methodik;
Aufsicht; Writing – review & Bearbeitung.

FUNDING INFORMATION

Michael M. Halassa, National Institute of Mental Health (https://dx.doi.org/10.13039
/100000025), Award ID: 5R01MH120118-02.

VERWEISE

Abbott, L. F., & Nelson, S. B. (2000). Synaptic plasticity: Taming the
beast. Naturneurowissenschaften, 3, 1178–1183. https://doi.org/10
.1038/81453, PubMed: 11127835

Alexander, G. E., DeLong, M. R., & Strick, P. L. (1986). Parallel
organization of functionally segregated circuits linking basal
ganglia and cortex. Jahresrückblick auf die Neurowissenschaften, 9, 357–381.
https://doi.org/10.1146/annurev.ne.09.030186.002041, PubMed:
3085570

Allen, W. E., Kauvar, ICH. V., Chen, M. Z., Richman, E. B., Yang, S. J.,
Chan, K., … Deisseroth, K. (2017). Global representations of
goal-directed behavior in distinct cell types of mouse neocortex.
Neuron, 94(4), 891–907. https://doi.org/10.1016/j.neuron.2017
.04.017, PubMed: 28521139

Andalman, A. S., & Fee, M. S. (2009). A basal ganglia-forebrain
circuit in the songbird biases motor output to avoid vocal errors.
Verfahren der Nationalen Akademie der Wissenschaften, 106(30),

12518–12523. https://doi.org/10.1073/pnas.0903214106,
PubMed: 19597157

Ashby, F. G., Ennis, J. M., & Spiering, B. J. (2007). A neurobiological
theory of automaticity in perceptual categorization. Psychologi-
cal Review, 114(3), 632–656. https://doi.org/10.1037/0033-295X
.114.3.632, PubMed: 17638499

Asmus, F., Huber, H., Gasser, T., & Schöls, L. (2008). Kick and rush:
Paradoxical kinesia in Parkinson disease. Neurologie, 71(9), 695.
https://doi.org/10.1212/01.wnl.0000324618.88710.30, PubMed:
18725599

Badre, D., Kayser, A. S., & D'Esposito, M. (2010). Frontal cortex
and the discovery of abstract action rules. Neuron, 66(2),
315–326. https://doi.org/10.1016/j.neuron.2010.03.025,
PubMed: 20435006

Bamford, N. S., Wightman, R. M., & Sulzer, D. (2018). Dopamine’s
effects on corticostriatal synapses during reward-based

Netzwerkneurowissenschaften

991

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
N
e
N
A
R
T
ich
C
e
–
P
D

F
/

6
4
9
8
0
2
0
5
6
2
6
0
N
e
N
_
A
_
0
0
2
3
5
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Thalamocortical contribution to flexible learning in neural systems

behaviors. Neuron, 97(3), 494–510. https://doi.org/10.1016/j
.neuron.2018.01.006, PubMed: 29420932

Barack, D. L., & Krakauer, J. W. (2021). Two views on the cognitive
Gehirn. Nature Reviews Neurowissenschaften, 22(6), 359–371. https://doi
.org/10.1038/s41583-021-00448-6, PubMed: 33859408

Bayer, H. M., & Glimcher, P. W. (2005). Midbrain dopamine neurons
encode a quantitative reward prediction error signal. Neuron,
47(1), 129–141. https://doi.org/10.1016/j.neuron.2005.05.020,
PubMed: 15996553

Benna, M. K., & Fusi, S. (2016). Computational principles of synaptic
memory consolidation. Naturneurowissenschaften, 19(12), 1697–1706.
https://doi.org/10.1038/nn.4401, PubMed: 27694992

Bliss, T. V., & Lomo, T. (1973). Long-lasting potentiation of synaptic
transmission in the dentate area of the anaesthetized rabbit
following stimulation of the perforant path. Journal of Physiology,
232(2), 331–356. https://doi.org/10.1113/jphysiol.1973
.sp010273, PubMed: 4727084

Bolkan, S. S., Stujenske, J. M., Parnaudeau, S., Spellman, T. J.,
Rauffenbart, C., Abbas, A. ICH., … Kellendonk, C. (2017). Thalamic
projections sustain prefrontal activity during working memory
Wartung. Naturneurowissenschaften, 20(7), 987–996. https://doi
.org/10.1038/nn.4568, PubMed: 28481349

Botvinick, M., Ritter, S., Wang, J. X., Kurth-Nelson, Z., Blundell, C.,
& Hassabis, D. (2019). Reinforcement learning, fast and slow.
Trends in den Kognitionswissenschaften, 23(5), 408–422. https://doi.org/10
.1016/j.tics.2019.02.006, PubMed: 31003893

Cadieu, C. F., Hong, H., Yamins, D. L. K., Pinto, N., Ardila, D.,
Solomon, E. A., … DiCarlo, J. J. (2014). Deep neural networks
rival the representation of primate IT cortex for core visual object
recognition. PLoS Computational Biology, 10(12), 1–18. https://
doi.org/10.1371/journal.pcbi.1003963, PubMed: 25521294
Cass, W. A., & Gerhardt, G. A. (1995). In vivo assessment of dopa-
mine uptake in rat medial prefrontal cortex: Comparison with
dorsal striatum and nucleus accumbens. Journal of Neurochem-
istry, 65(1), 201–207. https://doi.org/10.1046/j.1471-4159.1995
.65010201.X, PubMed: 7790861

Cichon, J., & Gan, W. B. (2015). Branch-specific dendritic Ca(2+)
spikes cause persistent synaptic plasticity. Natur, 520(7546),
180–185. https://doi.org/10.1038/nature14251, PubMed:
25822789

Ciliax, B. J., Heilman, C., Demchyshyn, L. L., Pristupa, Z. B., Ince, E.,
Hersch, S. M., … Levey, A. ICH. (1995). The dopamine transporter:
Immunochemical characterization and localization in brain.
Zeitschrift für Neurowissenschaften, 15(3 Pt. 1), 1714–1723. https://doi.org
/10.1523/JNEUROSCI.15-03-01714.1995, PubMed: 7534339
Cooke, S. F., & Bear, M. F. (2010). Visual experience induces
long-term potentiation in the primary visual cortex. Zeitschrift für
Neurowissenschaften, 30(48), 16304–16313. https://doi.org/10.1523
/JNEUROSCI.4333-10.2010, PubMed: 21123576

Cortes, C., Gonzalvo, X., Kuznetsov, V., Mohri, M., & Yang, S.
(2017). AdaNet: Adaptive structural learning of artificial neural
Netzwerke. In Proceedings of the 34th international conference
on machine learning (Bd. 70, S. 874–883). Retrieved from
https://proceedings.mlr.press/v70/cortes17a.html

Cox, J., & Witten, ICH. B. (2019). Striatal circuits for reward learning
and decision-making. Nature Reviews Neurowissenschaften, 20(8),
482–494. https://doi.org/10.1038/s41583-019-0189-2, PubMed:
31171839

Crick, F. (1989). The recent excitement about neural networks.
Natur, 337(6203), 129–132. https://doi.org/10.1038/337129a0,
PubMed: 2911347

Dayan, P., & Abbott, L. F. (2005). Theoretical neuroscience:
Computational and mathematical modeling of neural systems.
MIT Press.

Donahue, C. H., & Lee, D. (2015). Dynamic routing of task-
relevant signals for decision making in dorsolateral prefrontal
Kortex. Naturneurowissenschaften, 18(2), 295–301. https://doi.org/10
.1038/nn.3918, PubMed: 25581364

Doya, K. (1999). What are the computations of the cerebellum, Die
basal ganglia and the cerebral cortex? Neural Networks, 12(7–8),
961–974. https://doi.org/10.1016/S0893-6080(99)00046-5,
PubMed: 12662639

Doya, K. (2000). Complementary roles of basal ganglia and cere-
bellum in learning and motor control. Current Opinion in Neu-
robiology, 10(6), 732–739. https://doi.org/10.1016/S0959-4388
(00)00153-7, PubMed: 11240282

Draelos, T. J., Miner, N. E., Lamb, C. C., Cox, J. A., Vineyard, C. M.,
Carlson, K. D., … Aimone, J. B. (2017). Neurogenesis deep learn-
ing: Extending deep networks to accommodate new classes. In
2017 international joint conference on neural networks (IJCNN)
(S. 526–533). https://doi.org/10.1109/IJCNN.2017.7965898
Enel, P., Wallis, J. D., & Reich, E. L. (2020). Stable and dynamic
representations of value in the prefrontal cortex. eLife, 9, e54313.
https://doi.org/10.7554/eLife.54313, PubMed: 32628108

Fee, M. S., & Goldberg, J. H. (2011). A hypothesis for basal ganglia–
dependent reinforcement learning in the songbird. Neurowissenschaften,
198, 152–170. https://doi.org/10.1016/j.neuroscience.2011.09
.069, PubMed: 22015923

Feldman, D. E. (2009). Synaptic mechanisms for plasticity in neocortex.
Jahresrückblick auf die Neurowissenschaften, 32, 33–55. https://doi.org/10.1146
/annurev.neuro.051508.135516, PubMed: 19400721

Fernando, C., Banarse, D., Blundell, C., Zwols, Y., Ha, D., Rusu,
A. A., … Wierstra, D. (2017). Pathnet: Evolution channels gradi-
ent descent in super neural networks. CoRR, abs/1701.08734.
Retrieved from https://arxiv.org/abs/1701.08734. https://doi.org
/10.48550/arXiv.1701.08734

Fiete, ICH. R., Fee, M. S., & Seung, H. S. (2007). Model of birdsong
learning based on gradient estimation by dynamic perturbation
of neural conductances. Journal of Neurophysiology, 98(4),
2038–2057. https://doi.org/10.1152/jn.01311.2006, PubMed:
17652414

Fiete, ICH. R., & Seung, H. S. (2006). Gradient learning in spiking neu-
ral networks by dynamic perturbation of conductances. Physical
Review Letters, 97, 048104. https://doi.org/10.1103/PhysRevLett
.97.048104, PubMed: 16907616

French, R. M. (1999). Catastrophic forgetting in connectionist net-
funktioniert. Trends in den Kognitionswissenschaften, 3(4), 128–135. https://doi
.org/10.1016/S1364-6613(99)01294-2, PubMed: 10322466

Fusi, S., Drew, P. J., & Abbott, L. F. (2005). Cascade models of syn-
aptically stored memories. Neuron, 45(4), 599–611. https://doi
.org/10.1016/j.neuron.2005.02.001, PubMed: 15721245

Fuster, J. (1997). The prefrontal cortex: Anatomy, physiology, Und
neuropsychology of the frontal lobe. Lippincott-Raven. Retrieved
from https://books.google.com/books?id=YupqAAAAMAAJ

Gao, R., van den Brink, R. L., Pfeffer, T., & Voytek, B. (2020). Neu-
ronal timescales are functionally dynamic and shaped by cortical

Netzwerkneurowissenschaften

992

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
N
e
N
A
R
T
ich
C
e
–
P
D

F
/

6
4
9
8
0
2
0
5
6
2
6
0
N
e
N
_
A
_
0
0
2
3
5
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Thalamocortical contribution to flexible learning in neural systems

microarchitecture. eLife, 9, e61277. https://doi.org/10.7554/eLife
.61277, PubMed: 33226336

Garris, P. A., & Wightman, R. M. (1994). Different kinetics govern
dopaminergic transmission in the amygdala, prefrontal cortex,
and striatum: An in vivo voltammetric study. Journal of Neurosci-
enz, 14(1), 442–450. https://doi.org/10.1523/ JNEUROSCI.14
-01-00442.1994, PubMed: 8283249

Gerfen, C., & Bolam, J. (2010). The neuroanatomical organization
of the basal ganglia. Handbook of Behavioral Neuroscience, 20,
3–28. https://doi.org/10.1016/B978-0-12-374767-9.00001-9
Guo, W., Clause, A. R., Barth-Maron, A., & Polley, D. B. (2017). A
corticothalamic circuit for dynamic switching between feature
detection and discrimination. Neuron, 95(1), 180–194. https://
doi.org/10.1016/j.neuron.2017.05.019, PubMed: 28625486
Guo, Z. V., Inagaki, H. K., Daie, K., Druckmann, S., Gerfen, C. R., &
Svoboda, K. (2017). Maintenance of persistent activity in a fron-
tal thalamocortical loop. Natur, 545(7653), 181–186. https://doi
.org/10.1038/nature22324, PubMed: 28467817

Harris, J. A., Mihalas, S., Hirokawa, K. E., Whitesell, J. D., Choi, H.,
Bernard, A., … Zeng, H. (2019). Hierarchical organization of cor-
tical and thalamic connectivity. Natur, 575(7781), 195–202.
https://doi.org/10.1038/s41586-019-1716-z, PubMed: 31666704
Hayashi-Takagi, A., Yagishita, S., Nakamura, M., Shirai, F., Wu,
Y. ICH., Loshbaugh, A. L., … Kasai, H. (2015). Labelling and optical
erasure of synaptic memory traces in the motor cortex. Natur,
525(7569), 333–338. https://doi.org/10.1038/nature15257,
PubMed: 26352471

Er, K., Zhang, X., Ren, S., & Sun, J. (2016). Deep residual learning
for image recognition. In 2016 IEEE conference on computer
vision and pattern recognition (CVPR) (S. 770–778). https://doi
.org/10.1109/CVPR.2016.90

Hebb, D. (2002). The organization of behavior: A neuropsycholog-
ical theory. Taylor & Francis. Retrieved from https://books.google
.com/books?id=gUtwMochAI8C

Hikosaka, O., Kim, H. F., Yasuda, M., & Yamamoto, S. (2014). Basal
ganglia circuits for reward value-guided behavior. Jährlich
Review of Neuroscience, 37, 289–306. https://doi.org/10.1146
/annurev-neuro-071013-013924, PubMed: 25032497

Hoffmann, K. P., Stein, J., & Sherman, S. M. (1972). Relay of
receptive-field properties in dorsal lateral geniculate nucleus of
the cat. Journal of Neurophysiology, 35(4), 518–531. https://doi
.org/10.1152/jn.1972.35.4.518, PubMed: 4338566

Houk, J. C., Davis, J. L., & Beiser, D. G. (1994). Adaptive critics and
the basal ganglia. In Models of information processing in the
basal ganglia (S. 215–232). MIT Press. https://doi.org/10.7551
/mitpress/4708.003.0018

Hubel, D. H., & Wiesel, T. N. (1961). Integrative action in the cat’s
lateral geniculate body. Journal of Physiology, 155, 385–398.
https://doi.org/10.1113/jphysiol.1961.sp006635, PubMed:
13716436

Hubel, D. H., & Wiesel, T. N. (1962). Receptive fields, binocular
interaction and functional architecture in the cat’s visual cortex.
Journal of Physiology, 160, 106–154. https://doi.org/10.1113
/jphysiol.1962.sp006837, PubMed: 14449617

Hunnicutt, B. J., Jongbloets, B. C., Birdsong, W. T., Gertz, K. J.,
Zhong, H., & Mao, T. (2016). A comprehensive excitatory input
map of the striatum reveals novel functional organization. eLife, 5,
e19103. https://doi.org/10.7554/eLife.19103, PubMed: 27892854

Hélie, S., Ell, S. W., & Ashby, F. G. (2015). Learning robust
cortico-cortical associations with the basal ganglia: An integra-
tive review. Kortex, 64, 123–135. https://doi.org/10.1016/j
.cortex.2014.10.011, PubMed: 25461713

Ikemoto, S., & Panksepp, J. (1999). The role of nucleus accumbens
dopamine in motivated behavior: A unifying interpretation with
special reference to reward-seeking. Brain Research Reviews, 31(1),
6–41. https://doi.org/10.1016/S0165-0173(99)00023-5, PubMed:
10611493

Jacobs, D. S., & Moghaddam, B. (2020). Prefrontal cortex represen-
tation of learning of punishment probability during reward-
motivated actions. Zeitschrift für Neurowissenschaften, 40(26), 5063–5077.
https://doi.org/10.1523/ JNEUROSCI.0310-20.2020, PubMed:
32409619

Jiang, H., & Kim, H. F. (2018). Anatomical inputs from the sensory
and value structures to the tail of the rat striatum. Grenzen in
Neuroanatomy, 12, 30. https://doi.org/10.3389/fnana.2018
.00030, PubMed: 29773980

Jones, E. G. (Ed.). (1985). The thalamus. Springer US. https://doi.org

/10.1007/978-1-4615-1749-8

Jung, H., Ju, J., Jung, M., & Kim, J. (2018). Less-forgetful learning for
domain expansion in deep neural networks. In AAAI conference
on artificial intelligence. Retrieved from https://www.aaai.org/ocs
/index.php/AAAI/AAAI18/paper/view/17073

Kemker, R., & Kanan, C. (2018). FearNet: Brain-inspired model for
incremental learning. In International conference on learning rep-
resentations. Retrieved from https://openreview.net/forum?id
=SJ1Xmf-Rb

Kemker, R., McClure, M., Abitino, A., Hayes, T., & Kanan, C.
(2018). Measuring catastrophic forgetting in neural networks. In
AAAI conference on artificial intelligence. Retrieved from https://
aaai.org/ocs/index.php/AAAI/AAAI18/paper/view/16410

Ketz, N., Morkonda, S. G., & O’Reilly, R. C. (2013). Theta coordi-
nated error-driven learning in the hippocampus. PLoS Computa-
tional Biology, 9(6), 1–9. https://doi.org/10.1371/journal.pcbi
.1003067, PubMed: 23762019

Kim, C., Johnson, N. F., Cilles, S. E., & Gold, B. T. (2011). Common
and distinct mechanisms of cognitive flexibility in prefrontal cor-
tex. Zeitschrift für Neurowissenschaften, 31(13), 4771–4779. https://doi.org
/10.1523/JNEUROSCI.5923-10.2011, PubMed: 21451015

Kirkpatrick, J., Pascanu, R., Rabinowitz, N., Veness, J., Gardens,
G., Rusu, A. A., … Hadsell, R. (2017). Overcoming catastrophic
forgetting in neural networks. Proceedings of the National Acad-
emy of Sciences, 114(13), 3521–3526. https://doi.org/10.1073
/pnas.1611835114, PubMed: 28292907

Kirkwood, A., Rioult, M. C., & Bear, M. F. (1996). Experience-
dependent modification of synaptic plasticity in visual cortex.
Natur, 381(6582), 526–528. https://doi.org/10.1038/381526a0,
PubMed: 8632826

Kornfeld, J., Januszewski, M., Schubert, P., Jain, V., Denk, W., &
Fee, M. (2020). An anatomical substrate of credit assignment in
reinforcement learning. bioRxiv. https://doi.org/10.1101/2020.02
.18.954354

Krizhevsky, A., Sutskever, ICH., & Hinton, G. E. (2012). ImageNet classi-
fication with deep convolutional neural networks. In Advances in
neural information processing systems (Bd. 25). Curran Associates,
Inc. Retrieved from https://proceedings.neurips.cc/paper/2012
/file/c399862d3b9d6b76c8436e924a68c45b-Paper.pdf

Netzwerkneurowissenschaften

993

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
N
e
N
A
R
T
ich
C
e
–
P
D

F
/

6
4
9
8
0
2
0
5
6
2
6
0
N
e
N
_
A
_
0
0
2
3
5
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Thalamocortical contribution to flexible learning in neural systems

Kumaran, D., Hassabis, D., & McClelland, J. L. (2016). Was
learning systems do intelligent agents need? Complementary
learning systems theory updated. Trends in den Kognitionswissenschaften,
20(7), 512–534. https://doi.org/10.1016/j.tics.2016.05.004,
PubMed: 27315762

Kusmierz, L., Isomura, T., & Toyoizumi, T. (2017). Learning with
three factors: modulating Hebbian plasticity with errors. Current
Opinion in Neurobiology, 46, 170–177. https://doi.org/10.1016/j
.conb.2017.08.020, PubMed: 28918313

Lanciego, J. L., Luquin, N., & Obeso, J. A. (2012). Functional neu-
roanatomy of the basal ganglia. Cold Spring Harbor Perspectives
in Medicine, 2(12), a009621. https://doi.org/10.1101
/cshperspect.a009621, PubMed: 23071379

Lapish, C. C., Kroener, S., Durstewitz, D., Lavin, A., & Seamans,
J. K. (2007). The ability of the mesocortical dopamine system
to operate in distinct temporal modes. Psychopharmacology,
191(3), 609–625. https://doi.org/10.1007/s00213-006-0527-8,
PubMed: 17086392

Lewkowicz, D. J. (2014). Early experience and multisensory
perceptual narrowing. Developmental Psychobiology, 56(2),
292–315. https://doi.org/10.1002/dev.21197, PubMed:
24435505

Li, Z., & Hoiem, D. (2018). Learning without forgetting. IEEE Trans-
actions on Pattern Analysis and Machine Intelligence, 40(12),
2935–2947. https://doi.org/10.1109/ TPAMI.2017.2773081,
PubMed: 29990101

Lien, A. D., & Scanziani, M. (2018). Cortical direction selectivity
emerges at convergence of thalamic synapses. Natur, 558(7708),
80–86. https://doi.org/10.1038/s41586-018-0148-5, PubMed:
29795349

Lillicrap, T. P., Cownden, D., Tweed, D. B., & Akerman, C. J.
(2016). Random synaptic feedback weights support error back-
propagation for deep learning. Nature Communications, 7,
13276. https://doi.org/10.1038/ncomms13276, PubMed:
27824044

Lillicrap, T. P., Santoro, A., Marris, L., Akerman, C. J., & Hinton, G.
(2020). Backpropagation and the brain. Nature Reviews Neuro-
Wissenschaft, 21(6), 335–346. https://doi.org/10.1038/s41583-020
-0277-3, PubMed: 32303713

Liu, Y. H., Schmied, S., Mihalas, S., Shea-Brown, E., & Sümbül, U.
(2020). A solution to temporal credit assignment using
cell-type-specific modulatory signals. bioRxiv. https://doi.org/10
.1101/2020.11.22.393504

Makino, H., Hwang, E. J., Hedrick, N. G., & Komiyama, T. (2016).
Circuit mechanisms of sensorimotor learning. Neuron, 92(4),
705–721. https://doi.org/10.1016/j.neuron.2016.10.029,
PubMed: 27883902

Maltoni, D., & Lomonaco, V. (2019). Continuous learning in
single-incremental-task scenarios. Neural Networks, 116,
56–73. https://doi.org/10.1016/j.neunet.2019.03.010, PubMed:
31005851

Mante, V., Sussillo, D., Shenoy, K. V., & Newsome, W. T. (2013).
Context-dependent computation by recurrent dynamics in pre-
frontal cortex. Natur, 503(7474), 78–84. https://doi.org/10
.1038/nature12742, PubMed: 24201281

Marton, T. F., Seifikar, H., Luongo, F. J., Lee, A. T., & Sohal, V. S.
(2018). Roles of prefrontal cortex and mediodorsal thalamus
in task engagement and behavioral flexibility. Zeitschrift für

Neurowissenschaften, 38(10), 2569–2578. https://doi.org/10.1523
/JNEUROSCI.1728-17.2018, PubMed: 29437889

McClelland, J. L., McNaughton, B. L., & O’Reilly, R. C. (1995).
Why there are complementary learning systems in the hippo-
campus and neocortex: Insights from the successes and failures
of connectionist models of learning and memory. Psychological
Rezension, 102(3), 419–457. https://doi.org/10.1037/0033-295X
.102.3.419, PubMed: 7624455

McCloskey, M., & Cohen, N. J. (1989). Catastrophic interference
in connectionist networks: The sequential learning problem.
In G. H. Bower (Ed.), Psychology of learning and motivation
(Bd. 24, S. 109–165). Academic Press. https://doi.org/10.1016
/S0079-7421(08)60536-8

McNab, F., & Klingberg, T. (2008). Prefrontal cortex and basal
ganglia control access to working memory. Naturneurowissenschaften,
11(1), 103–107. https://doi.org/10.1038/nn2024, PubMed:
18066057

Mehaffey, W. H., & Doupe, A. J. (2015). Naturalistic stimulation
drives opposing heterosynaptic plasticity at two inputs to song-
bird cortex. Naturneurowissenschaften, 18(9), 1272–1280. https://doi
.org/10.1038/nn.4078, PubMed: 26237364

Müller, E. K. (2000). The prefontral cortex and cognitive control.
Nature Reviews Neurowissenschaften, 1(1), 59–65. https://doi.org/10
.1038/35036228, PubMed: 11252769

Müller, E. K., & Cohen, J. D. (2001). An integrative theory of pre-
frontal cortex function. Jahresrückblick auf die Neurowissenschaften, 24,
167–202. https://doi.org/10.1146/annurev.neuro.24.1.167,
PubMed: 11283309

Minsky, M. (1961). Steps toward artificial intelligence. Verfahren
of the IRE, 49(1), 8–30. https://doi.org/10.1109/ JRPROC.1961
.287775

Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A. A., Veness, J.,
Bellemare, M. G., … Hassabis, D. (2015). Human-level control
through deep reinforcement learning. Natur, 518(7540),
529–533. https://doi.org/10.1038/nature14236, PubMed:
25719670

Monchi, O., Petride, M., Strafella, A. P., Worsley, K. J., & Doyon, J.
(2006). Functional role of the basal ganglia in the planning and
execution of actions. Annals of Neurology, 59(2), 257–264.
https://doi.org/10.1002/ana.20742, PubMed: 16437582

Montague, P. R., Dayan, P., & Sejnowski, T. J. (1996). A framework
for mesencephalic dopamine systems based on predictive
Hebbian learning. Zeitschrift für Neurowissenschaften, 16(5), 1936–1947.
https://doi.org/10.1523/ JNEUROSCI.16-05-01936.1996,
PubMed: 8774460

Morris, G., Nevet, A., Arkadir, D., Vaadia, E., & Bergman, H.
(2006). Midbrain dopamine neurons encode decisions for future
Aktion. Naturneurowissenschaften, 9(8), 1057–1063. https://doi.org/10
.1038/nn1743, PubMed: 16862149

Mukherjee, A., Bajwa, N., Lam, N. H., Porrero, C., Clasca, F., &
Halassa, M. M. (2020). Variation of connectivity across exemplar
sensory and associative thalamocortical loops in the mouse.
eLife, 9, e62554. https://doi.org/10.7554/eLife.62554, PubMed:
33103997

Mukherjee, A., Lam, N. H., Wimmer, R. D., & Halassa, M. M.
(2021). Thalamic circuits for independent control of prefrontal
signal and noise. Natur, 600(7887), 100–104. https://doi.org
/10.1038/s41586-021-04056-3, PubMed: 34614503

Netzwerkneurowissenschaften

994

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
N
e
N
A
R
T
ich
C
e
–
P
D

F
/

6
4
9
8
0
2
0
5
6
2
6
0
N
e
N
_
A
_
0
0
2
3
5
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Thalamocortical contribution to flexible learning in neural systems

Murray, J. D., Bernacchia, A., Freedman, D. J., Romo, R., Wallis,
J. D., Cai, X., … Wang, X. J. (2014). A hierarchy of intrinsic
timescales across primate cortex. Naturneurowissenschaften, 17(12),
1661–1663. https://doi.org/10.1038/nn.3862, PubMed:
25383900

Murray, M. M., Lewkowicz, D. J., Amedi, A., & Wallace, M. T.
(2016). Multisensory processes: A balancing act across the
lifespan. Trends in den Neurowissenschaften, 39(8), 567–579. https://doi
.org/10.1016/j.tins.2016.05.003, PubMed: 27282408

Nakajima, M., Schmitt, L. ICH., & Halassa, M. M. (2019). Prefrontal
cortex regulates sensory filtering through a basal ganglia-to-
thalamus pathway. Neuron, 103(3), 445–458. https://doi.org/10
.1016/j.neuron.2019.05.026, PubMed: 31202541

Nambu, A. (2011). Somatotopic organization of the primate basal
ganglia. Frontiers in Neuroanatomy, 5, 26. https://doi.org/10
.3389/fnana.2011.00026, PubMed: 21541304

Niv, Y. (2009). Reinforcement learning in the brain. Zeitschrift für
Mathematical Psychology, 53(3), 139–154. https://doi.org/10
.1016/j.jmp.2008.12.005

O’Reilly, R. C. (1996). Biologically plausible error-driven learning
using local activation differences: The generalized recirculation
Algorithmus. Neural Computation, 8(5), 895–938. https://doi.org/10
.1162/neco.1996.8.5.895

O’Reilly, R. C., Russin, J. L., Zolfaghar, M., & Rohrlich, J. (2021).
Deep predictive learning in neocortex and pulvinar. Zeitschrift für
Cognitive Neuroscience, 33(6), 1158–1196. https://doi.org/10
.1162/jocn_a_01708, PubMed: 34428793

Parisi, G. ICH., Kemker, R., Teil, J. L., Kanan, C., & Wermter, S. (2019).
Continual lifelong learning with neural networks: Eine Rezension.
Neural Networks, 113, 54–71. https://doi.org/10.1016/j.neunet
.2019.01.012, PubMed: 30780045

Perrin, E., & Venance, L. (2019). Bridging the gap between striatal
plasticity and learning. Aktuelle Meinung in der Neurobiologie, 54,
104–112. https://doi.org/10.1016/j.conb.2018.09.007, PubMed:
30321866

Peters, A. J., Fabre, J. M. J., Steinmetz, N. A., Harris, K. D., &
Carandini, M. (2021). Striatal activity topographically reflects
cortical activity. Natur, 591, 420–425. https://doi.org/10.1038
/s41586-020-03166-8, PubMed: 33473213

Petersen, C. C. H. (2019). Sensorimotor processing in the rodent
barrel cortex. Nature Reviews Neurowissenschaften, 20(9), 533–546.
https://doi.org/10.1038/s41583-019-0200-y, PubMed: 31367018
Phillips, J. M., Kambi, N. A., & Saalmann, Y. B. (2016). A subcorti-
cal pathway for rapid, goal-driven, attentional filtering. Trends in
Neurosciences, 39(2), 49–51. https://doi.org/10.1016/j.tins.2015
.12.003, PubMed: 26743499

Power, J. D., & Schlaggar, B. L. (2017). Neural plasticity across the
lifespan. Wiley Interdisciplinary Reviews: Developmental Biology,
6(1), e216. https://doi.org/10.1002/wdev.216, PubMed: 27911497
Rakic, P. (2009). Evolution of the neocortex: A perspective from
developmental biology. Nature Reviews Neurowissenschaften, 10(10),
724–735. https://doi.org/10.1038/nrn2719, PubMed: 19763105
Reinagel, P., Godwin, D., Sherman, S. M., & Koch, C. (1999).
Encoding of visual information by LGN bursts. Journal of Neuro-
physiology, 81(5), 2558–2569. https://doi.org/10.1152/jn.1999
.81.5.2558, PubMed: 10322089

Richards, B. A., & Lillicrap, T. P. (2019). Dendritic solutions to the
credit assignment problem. Aktuelle Meinung in der Neurobiologie,

54, 28–36. https://doi.org/10.1016/j.conb.2018.08.003,
PubMed: 30205266

Rikhye, R. V., Gilra, A., & Halassa, M. M. (2018). Thalamic
regulation of switching between cortical representations enables
cognitive flexibility. Naturneurowissenschaften, 21(12), 1753–1763.
https://doi.org/10.1038/s41593-018-0269-z, PubMed:
30455456

Roelfsema, P. R., & Holtmaat, A. (2018). Control of synaptic plas-
ticity in deep cortical networks. Nature Reviews Neurowissenschaften,
19(3), 166–180. https://doi.org/10.1038/nrn.2018.6, PubMed:
29449713

Roelfsema, P. R., & van Ooyen, A. (2005). Attention-gated rein-
forcement learning of internal representations for classification.
Neural Computation, 17(10), 2176–2214. https://doi.org/10
.1162/0899766054615699, PubMed: 16105222

Roesch, M. R., Calu, D. J., & Schoenbaum, G. (2007). Dopamine
neurons encode the better option in rats deciding between
differently delayed or sized rewards. Naturneurowissenschaften,
10(12), 1615–1624. https://doi.org/10.1038/nn2013, PubMed:
18026098

Rumelhart, D. E., Hinton, G. E., & Williams, R. J. (1986). Learning
representations by back-propagating errors. Natur, 323(6088),
533–536. https://doi.org/10.1038/323533a0

Rusu, A. A., Rabinowitz, N. C., Gardens, G., Soyer, H., Kirkpatrick,
J., Kavukcuoglu, K., … Hadsell, R. (2016). Progressive neural
Netzwerke. CoRR, abs/1606.04671. Retrieved from https://arxiv
.org/abs/1606.04671. https://doi.org/10.48550/arXiv.1606.04671
Saalmann, Y. B., & Kastner, S. (2015). The cognitive thalamus. Fron-
tiers in Systems Neuroscience, 9, 39. https://doi.org/10.3389
/fnsys.2015.00039, PubMed: 25852498

Sacramento, J., Ponte Costa, R., Bengio, Y., & Senn, W. (2018).
Dendritic cortical microcircuits approximate the backpropaga-
tion algorithm. In Advances in neural information processing sys-
Systeme (Bd. 31, S. 8735–8746). Curran Associates, Inc. Retrieved
f r o m h t t p s : / / p r o c e e d i n g s . n e u r i p s . c c / p a p e r / 2 0 1 8 / f i l e
/1dc3a89d0d440ba31729b0ba74b93a33-Paper.pdf

Scharff, C., & Nottebohm, F. (1991). A comparative study of the
behavioral deficits following lesions of various parts of the zebra
finch song system: Implications for vocal learning. Zeitschrift für
Neurowissenschaften, 11(9), 2896–2913. https://doi.org/10.1523
/JNEUROSCI.11-09-02896.1991, PubMed: 1880555

Schiess, M., Urbanczik, R., & Senn, W. (2016). Somato-dendritic
synaptic plasticity and error-backpropagation in active dendrites.
PLoS Computational Biology, 12(2), 1–18. https://doi.org/10
.1371/journal.pcbi.1004638, PubMed: 26841235

Schmitt, L. ICH., Wimmer, R. D., Nakajima, M., Happ, M., Mofakham,
S., & Halassa, M. M. (2017). Thalamic amplification of cortical
connectivity sustains attentional control. Natur, 545(7653),
219–223. https://doi.org/10.1038/nature22073, PubMed:
28467827

Schrittwieser, J., Antonoglou, ICH., Hubert, T., Simonyan, K., Sifre, L.,
Schmitt, S., … Silver, D. (2020). Mastering Atari, Go, chess and
shogi by planning with a learned model. Natur, 588(7839),
604–609. https://doi.org/10.1038/s41586-020-03051-4,
PubMed: 33361790

Schultz, W., Dayan, P., & Montague, P. R. (1997). A neural substrate
of prediction and reward. Wissenschaft, 275(5306), 1593–1599. https://
doi.org/10.1126/science.275.5306.1593, PubMed: 9054347

Netzwerkneurowissenschaften

995

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
N
e
N
A
R
T
ich
C
e
–
P
D

F
/

6
4
9
8
0
2
0
5
6
2
6
0
N
e
N
_
A
_
0
0
2
3
5
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Thalamocortical contribution to flexible learning in neural systems

Seamans, J. K., & Robbins, T. W. (2010). Dopamine modulation of
the prefrontal cortex and cognitive function. In The dopamine
receptors (S. 373–398). Totowa, NJ: Humana Press. https://doi
.org/10.1007/978-1-60327-333-6_14

Seo, M., Lee, E., & Averbeck, B. B. (2012). Action selection and
action value in frontal-striatal circuits. Neuron, 74(5), 947–960.
https://doi.org/10.1016/j.neuron.2012.03.037, PubMed:
22681697

Sherman, S. M., & Guillery, R. W. (2005). Exploring the thalamus

and its role in cortical function (2nd ed.). MIT Press.

Sherman, S. M., & Spear, P. D. (1982). Organization of visual path-
ways in normal and visually deprived cats. Physiological
Rezensionen, 62(2), 738–855. https://doi.org/10.1152/physrev.1982
.62.2.738, PubMed: 6280221

Schienbein, H., Lee, J. K., Kim, J., & Kim, J. (2017). Continual learning with
deep generative replay. In Advances in neural information
processing systems (Bd. 30). Curran Associates, Inc. Retrieved
f r o m h t t p s : / / p r o c e e d i n g s . n e u r i p s . c c / p a p e r / 2 0 1 7 / f i l e
/0efbe98067c6c73dba1250d2beaa81f9-Paper.pdf

Shine, J. M. (2021). The thalamus integrates the macrosystems of
the brain to facilitate complex, adaptive brain network dynamics.
Fortschritte in der Neurobiologie, 199, 101951. https://doi.org/10.1016/j
.pneurobio.2020.101951, PubMed: 33189781

Silver, D., Huang, A., Maddison, C. J., Guez, A., Sifre, L., van den
Driessche, G., … Hassabis, D. (2016). Mastering the game of Go
with deep neural networks and tree search. Natur, 529(7587),
484–489. https://doi.org/10.1038/nature16961, PubMed:
26819042

Silver, D., Schrittwieser, J., Simonyan, K., Antonoglou, ICH., Huang,
A., Guez, A., … Hassabis, D. (2017). Mastering the game of Go
without human knowledge. Natur, 550(7676), 354–359. https://
doi.org/10.1038/nature24270, PubMed: 29052630

Singer, W., Sejnowski, T., & Rakic, P. (2019). The neocortex. MIT
Drücken Sie. Retrieved from https:// books.google.com/ books?id
=aL60DwAAQBAJ. https://doi.org/10.7551/mitpress/12593.001
.0001

Sohn, H., Meirhaeghe, N., Rajalingham, R., & Jazayeri, M. (2021).
A network perspective on sensorimotor learning. Trends in Neu-
rosciences, 44(3), 170–181. https://doi.org/10.1016/j.tins.2020
.11.007, PubMed: 33349476

Sohrabji, F., Nordeen, E. J., & Nordeen, K. W. (1990). Selective
impairment of song learning following lesions of a forebrain
nucleus in the juvenile zebra finch. Behavioral and Neural Biology,
53(1), 51–63. https://doi.org/10.1016/0163-1047(90)90797-A,
PubMed: 2302141

Soliveri, P., Braun, R. G., Jahanshahi, M., Caraceni, T., & Marsden,
C. D. (1997). Learning manual pursuit tracking skills in patients
with Parkinson’s disease. Gehirn, 120(Pt. 8), 1325–1337. https://
doi.org/10.1093/brain/120.8.1325, PubMed: 9278626

Suri, R. E., & Schultz, W. (1999). A neural network model with
dopamine-like reinforcement signal that learns a spatial delayed
response task. Neurowissenschaften, 91(3), 871–890. https://doi.org/10
.1016/S0306-4522(98)00697-6, PubMed: 10391468

Sutton, R., & Barto, A. (2018). Reinforcement learning: An introduc-
tion. MIT Press. Retrieved from https://books.google.com/books
?id=sWV0DwAAQBAJ

Sutton, R. S., & Barto, A. G. (1990). Time-derivative models of
Pavlovian reinforcement. In Learning and computational

neuroscience: Foundations of adaptive networks (S. 497–537).
MIT Press.

Tanaka, M. (2007). Cognitive signals in the primate motor thala-
mus predict saccade timing. Zeitschrift für Neurowissenschaften, 27(44),
12109–12118. https://doi.org/10.1523/ JNEUROSCI.1873-07
.2007, PubMed: 17978052

Tesileanu, T., Olveczky, B., & Balasubramanian, V. (2017). Rules
and mechanisms for efficient two-stage learning in neural
circuits. eLife, 6, e20944. https://doi.org/10.7554/eLife.20944,
PubMed: 28374674

Thomas-Ollivier, V., Reymann, J. M., Le Moal, S., Schück, S.,
Lieury, A., & Allain, H. (1999). Procedural memory in recent-
onset Parkinson’s disease. Dementia and Geriatric Cognitive
Disorders, 10(2), 172–180. https://doi.org/10.1159/000017100,
PubMed: 10026393

Thorndike, E. (2017). Animal intelligence: Experimental studies.
Taylor & Francis. Retrieved from https:// books.google.com
/ b o o k s ? i d = 1 _ h A D w A A Q B A J . h t t p s : / / d o i . o r g / 1 0 . 4 3 2 4
/9781351321044

Tsutsui, K., Hosokawa, T., Yamada, M., & Iijima, T. (2016). Repre-
sentation of functional category in the monkey prefrontal cortex
and its rule-dependent use for behavioral selection. Zeitschrift für
Neurowissenschaften, 36(10), 3038–3048. https://doi.org/10.1523
/JNEUROSCI.2063-15.2016, PubMed: 26961957

Usrey, W. M., Alonso, J. M., & Reid, R. C. (2000). Synaptic interac-
tions between thalamic inputs to simple cells in cat visual cortex.
Zeitschrift für Neurowissenschaften, 20(14), 5461–5467. https://doi.org/10
.1523/JNEUROSCI.20-14-05461.2000, PubMed: 10884329
Voytek, B., & Ritter, R. T. (2010). Prefrontal cortex and basal gan-
glia contributions to visual working memory. Verfahren der
Nationale Akademie der Wissenschaften, 107(42), 18167–18172. https://
doi.org/10.1073/pnas.1007277107, PubMed: 20921401

Wang, J. X., Kurth-Nelson, Z., Kumaran, D., Tirumala, D., Soyer,
H., Leibo, J. Z., … Botvinick, M. (2018). Prefrontal cortex as a
meta-reinforcement learning system. Naturneurowissenschaften,
21(6), 860–868. https://doi.org/10.1038/s41593-018-0147-8,
PubMed: 29760527

Warren, T. L., Tumer, E. C., Charlesworth, J. D., & Brainard, M. S.
(2011). Mechanisms and time course of vocal learning and
consolidation in the adult songbird. Journal of Neurophysiology,
106(4), 1806–1821. https://doi.org/10.1152/jn.00311.2011,
PubMed: 21734110

Whittington, J. C. R., & Bogacz, R. (2019). Theories of error
back-propagation in the brain. Trends in den Kognitionswissenschaften,
23(3), 235–250. https://doi.org/10.1016/j.tics.2018.12.005,
PubMed: 30704969

Wickens, J. R., & Kotter, R. (1994). Cellular models of reinforce-
ment. In Models of information processing in the basal ganglia.
MIT Press. https://doi.org/10.7551/mitpress/4708.003.0017

Wimmer, R. D., Schmitt, L. ICH., Davidson, T. J., Nakajima, M.,
Deisseroth, K., & Halassa, M. M. (2015). Thalamic control of
sensory selection in divided attention. Natur, 526(7575),
705–709. https://doi.org/10.1038/nature15398, PubMed:
26503050

Wolff, M., & Vann, S. D. (2019). The cognitive thalamus as a gate-
way to mental representations. Zeitschrift für Neurowissenschaften, 39(1),
3–14. https://doi.org/10.1523/ JNEUROSCI.0479-18.2018,
PubMed: 30389839

Netzwerkneurowissenschaften

996

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
N
e
N
A
R
T
ich
C
e
–
P
D

F
/

6
4
9
8
0
2
0
5
6
2
6
0
N
e
N
_
A
_
0
0
2
3
5
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Thalamocortical contribution to flexible learning in neural systems

Xiao, T., Zhang, J., Yang, K., Peng, Y., & Zhang, Z. (2014). Error-
driven incremental learning in deep convolutional neural net-
work for large-scale image classification. In ACM multimedia.
https://doi.org/10.1145/2647868.2654926

Yamins, D. L., Hong, H., Cadieu, C. F., Solomon, E. A., Seibert, D.,
& DiCarlo, J. J. (2014). Performance-optimized hierarchical
models predict neural responses in higher visual cortex. Proceed-
ings of the National Academy of Sciences, 111(23), 8619–8624.
https://doi.org/10.1073/pnas.1403112111, PubMed: 24812127
Yang, G., Pan, F., & Gan, W. B. (2009). Stably maintained dendritic
spines are associated with lifelong memories. Natur, 462(7275),
920–924. https://doi.org/10.1038/nature08577, PubMed: 19946265
Zenke, F., & Ganguli, S. (2018). SuperSpike: Supervised learning in
multilayer spiking neural networks. Neural Computation, 30(6),

1514–1541. https://doi.org/10.1162/neco_a_01086, PubMed:
29652587

Zenke, F., Gerstner, W., & Ganguli, S. (2017). The temporal para-
dox of Hebbian learning and homeostatic plasticity. Current
Opinion in Neurobiology, 43, 166–176. https://doi.org/10.1016
/j.conb.2017.03.015, PubMed: 28431369

Zenke, F., Poole, B., & Ganguli, S. (2017). Continual learning
through synaptic intelligence. In Proceedings of the 34th interna-
tional conference on machine learning (Bd. 70, S. 3987–3995).
Retrieved from https://proceedings.mlr.press/v70/zenke17a.html
Zhou, H., Schafer, R. J., & Desimone, R. (2016). Pulvinar-cortex
interactions in vision and attention. Neuron, 89(1), 209–220.
https://doi.org/10.1016/j.neuron.2015.11.034, PubMed:
26748092