Can the Brain Strategically Go on Automatic Pilot?

Can the Brain Strategically Go on Automatic Pilot?
The Effect of If–Then Planning on
Behavioral Flexibility

Tim van Timmeren1,2

, 约翰·P. O’Doherty3, Nadza Dzinalija4, and Sanne de Wit1

抽象的

■ People often have good intentions but fail to adhere to
他们. Implementation intentions, a form of strategic planning,
can help people to close this intention–behavior gap. 他们的
effectiveness has been proposed to depend on the mental
formation of a stimulus–response association between a trig-
ger and target behavior, thereby creating an “instant habit.”
If implementation intentions do indeed lead to reliance on
habitual control, then this may come at the cost of reduced
behavioral flexibility. 此外, we would expect a shift
from recruitment of corticostriatal brain regions implicated
in goal-directed control toward habit regions. To test these
ideas, we performed a fMRI study in which participants
received instrumental training supported by either implemen-
tation or goal intentions, followed by an outcome revaluation
to test reliance on habitual versus goal-directed control. 我们

found that implementation intentions led to increased effi-
ciency early in training, as reflected by higher accuracy, faster
RT, and decreased anterior caudate engagement. 然而,
implementation intentions did not reduce behavioral flexibility
when goals changed during the test phase, nor did it affect
the underlying corticostriatal pathways. 此外, this study
showed that “slips of action” toward devalued outcomes are
associated with reduced activity in brain regions implicated
in goal-directed control (ventromedial prefrontal cortex and
lateral orbitofrontal cortex) and increased activity of the
fronto-parietal salience network (including the insula, dorsal
anterior cingulate cortex, and SMA). 综上所述, our behav-
ioral and neuroimaging findings suggest that strategic if–then
planning does not lead to a shift from goal-directed toward
habitual control.

D

w
n

A
d
e
d

F
r


H

t
t

p

:
/
/

d

r
e
C
t
.


t
.

e
d

/
j

/


C
n
A
r
t

C
e

p
d

F
/

/

/

/

3
5
6
9
5
7
2
0
8
2
8
8
1

/
j


C
n
_
A
_
0
1
9
9
0
p
d

.

F


y
G

e
s
t

t


n
0
7
S
e
p
e


e
r
2
0
2
3

介绍

At the start of the new year, many people reflect on their
future plans and form resolutions. 然而, they often
fail to put their good intentions into practice (Sheeran &
Webb, 2016). Strategic “if–then” plans, 也称为
implementation intentions, are an effective way to support
the translation of intentions to actions. 例如,
instead of formulating an abstract plan such as “I want to
lose weight,” an implementation intention links the
intended action to a specific cue or situation, 例如,
“If I get home, I will eat an apple,” thereby enhancing the
probability of success. 的确, many studies have shown
that implementation intentions support behavior change
better than goal intentions that merely specify the
intended action or outcome (Gollwitzer & Sheeran,
2006). In addition to increasing attention to the relevant
提示, the effectiveness of if–then planning has been
proposed to rely on creating a strong associative link
between the stimulus (S) in the if-part (“home”) 和
response (右) in the then-part (eat an apple), in a manner

1University of Amsterdam, 荷兰人, 2Utrecht Univer-
城市, 荷兰人, 3California Institute of Technology, Pasa-
dena, 4Amsterdam UMC, Location VUmc, 荷兰人

akin to habits acquired through behavioral repetition
(狄金森, 1985; Thorndike, 1911). These mentally
formed S–R associations may allow for automatic action
initiation (Gollwitzer, 2014)—a process often referred to
as strategic automaticity or “instant habits” (Gollwitzer,
1993, 1999, 2014).

The notion that merely using a verbal action-plan could
be sufficient to form a habit is fascinating, because a
central assumption in theories of habit formation is that
this process critically depends on behavioral repetition.
Support for the idea that implementation intentions
accelerate habit formation comes from research showing
that they increase (self-reported) automaticity (Orbell &
Verplanken, 2010; Parks-Stamm, Gollwitzer, & Oettingen,
2007; Brandstätter, Lengfelder, & Gollwitzer, 2001).
所以, implementation intentions lead to benefits
in terms of efficient goal attainment (Gollwitzer, 2014;
Gollwitzer & Sheeran, 2006). 然而, habits developed
through behavioral repetition also come at a cost,
即, decreased behavioral flexibility (狄金森,
1985). The question arises, 所以, if the use of imple-
mentation intentions also leads decreased flexibility when
goals change. This can be investigated using the
outcome-devaluation test, an experimental paradigm
originally used in rats (Adams & 狄金森, 1981) 和

© 2023 麻省理工学院. Published under a
Creative Commons Attribution 4.0 国际的 (抄送 4.0) 执照.

认知神经科学杂志 35:6, PP. 957–975
https://doi.org/10.1162/jocn_a_01990

later translated to humans (de Wit, Corlett, Aitken,
狄金森, & 弗莱彻, 2009; de Wit, Niry, Wariyar, Aitken,
& 狄金森, 2007; Valentin, 狄金森, & O’Doherty,
2007). In this task, participants first learn to make a
response to obtain a reward. 随后, the value of
the outcome associated with that response is devalued,
and the ability to flexibly adapt responding to this change
in outcome value is measured during an extinction test.
Sensitivity to outcome devaluation suggests that behavior
is based on knowledge and evaluation of their conse-
序列, and therefore under goal-directed control. 如果
implementation intentions lead to “instant habits,” then
we would predict reduced sensitivity to outcome devalu-
化, reflecting a shift from goal-directed toward more
rigid, habitual control (de Wit et al., 2018; Balleine &
O’Doherty, 2010).

We have previously tested this hypothesis (van Timmeren
& de Wit, 2022), using a computerized symmetrical
outcome-revaluation task (SORT; 沃森, Gladwin,
Verhoeven, & de Wit, 2022). Participants learn to make a
response (去) to certain ice cream vans to collect valuable
ice creams (and points) or to withhold a response (no-go)
to other ice cream vans delivering nonvaluable ice creams
(and a reduction of points). To investigate the effect of if–
then planning, we instructed them to use verbal imple-
mentation intentions for half of the stimuli and use goal
intentions for the other half. In the subsequent test phase,
some outcome values changed (IE。, outcome revalua-
的). Whereas participants should continue to respond
according to the learned S–R mappings on value-
congruent trials (IE。, still-valuable and still-not-valuable),
they should flexibly adjust their behavior on value-
incongruent trials (IE。, devalued and upvalued). 这
results of this previous study suggest that the use of imple-
心理状态 (compared with goal) intentions facilitates
instrumental learning, but also impairs performance when
some of the signaled outcome values change during the
test phase (van Timmeren & de Wit, 2022). This detrimen-
tal effect of if–then planning was observed across value-
congruent and incongruent trials, suggesting that it was
not mediated by strengthened S–R associations (as this
would have impacted the value-incongruent trials specifi-
卡莉). 反而, this result may have been driven by
reduced goal-directed control. Investigating the neural
processes underlying implementation intentions may
offer us a window on the underlying (goal-directed vs.
habitual) 流程.

为此, in the present study, we used fMRI to inves-
tigate the neural correlates of if–then planning of instru-
mental responses on the SORT. We capitalized on current
insights regarding the neural basis of goal-directed and
habitual control to investigate the notion that if–then
planning gives rise to “instant habits.” Decades of animal
research have provided detailed insights into the neurobi-
ology of goal-directed and habitual actions, 展示
that they are causally supported by anatomically distinct
but interacting corticostriatal systems (Balleine, 2019;

Balleine & O’Doherty, 2010; Yin, Knowlton, & Balleine,
2004). These findings are mirrored by (correlational)
neuroimaging evidence in humans, albeit less consis-
tently. 具体来说, previous fMRI studies have found that
goal-directed control is supported by the ventromedial
前额皮质 (vMPFC) and caudate whereas
outcome-insensitive habitual actions depend on the pre-
motor cortex and posterior putamen/dorsal striatum
( 沃森, van Wingen, & de Wit, 2018; Delorme et al.,
2016; 莫里斯, Quail, Griffiths, 绿色的, & Balleine, 2015;
de Wit et al., 2012; Tricomi, Balleine, & O’Doherty, 2009;
Valentin et al., 2007).

The present study is the first fMRI investigation with the
SORT, and we will therefore start with specifying our pre-
dictions regarding the general pattern of neural activity
independent of intentions. 第一的, we expected that over
the course of training (IE。, habit acquisition) 活动
would increase in regions associated with habitual control
whereas the involvement of regions implicated in goal-
directed control would decrease (Zwosta, Ruge, Goschke,
& Wolfensteller, 2018; Liljeholm, 邓恩, & O’Doherty,
2015; Tricomi et al., 2009). 第二, we expected neural
activity during training in these regions to be predictive
of revaluation insensitivity in the test phase ( 沃森
等人。, 2018; Zwosta et al., 2018; Liljeholm et al., 2015; 的
Wit et al., 2009). 第三, in line with previous work ( 沃森
等人。, 2018; Valentin et al., 2007), we hypothesized that, 在
the test phase, we would find higher activity in areas impli-
cated in goal-directed action, cognitive control, 和
response conflict when participants flexibly updated their
responses and equal (if anything reduced) activity in habit-
related regions. 最后, we expected that “slips of action”
would be associated with higher activity in habit regions
and reduced activity in goal-directed regions ( 沃森
等人。, 2018).

Our central aim was to investigate the neural basis of
implementation intentions and their effect on behavioral
灵活性. 为此, we measured neural activity related
to the effect of implementation intentions on acquisition
and flexible adjustment of instrumental actions on the
SORT. We hypothesized that the use of implementation
意图 (compared with goal intentions) during train-
ing would lead to increased habit acquisition as reflected
by higher accuracy, increased automaticity (测量的
with the Self-Reported Behavioral Automaticity Index;
加德纳, Abraham, Lally, & de Bruijn, 2012), and increased
brain activity in habit regions and equal—or if anything
reduced—activity in goal-directed regions. 而且, 我们
expected if–then planning to lead to increased reliance on
previously formed S–R associations in the subsequent test
phase as indicated by inflexible, habitual responding on
value-incongruent compared with value-congruent trials,
and higher activity of habit regions during the test phase.
最后, we expected that overcoming mentally rehearsed
S–R associations (as part of an if–then plan) would require
more goal-directed control and correspondingly engage
related neural regions.

958

认知神经科学杂志

体积 35, 数字 6

D

w
n

A
d
e
d

F
r


H

t
t

p

:
/
/

d

r
e
C
t
.


t
.

e
d

/
j

/


C
n
A
r
t

C
e

p
d

F
/

/

/

/

3
5
6
9
5
7
2
0
8
2
8
8
1

/
j


C
n
_
A
_
0
1
9
9
0
p
d

.

F


y
G

e
s
t

t


n
0
7
S
e
p
e


e
r
2
0
2
3

方法

All operationalizations, exclusion criteria, and main hypoth-
eses and analyses were preregistered on Open Science
Framework (https://osf.io/yrpxa).

参加者

Participants were recruited through the participant
portal of the University of Amsterdam Web site, flyers,
and word of mouth. We used the following inclusion
criteria: age 16–35 years, not having previously partici-
pated in a previous study using this same task, and any
contraindications for MRI. Data collection took place
between July and November 2020. Note that this is during
the first year of the COVID-19 outbreak; 然而, no strict
lockdowns were implemented during this period in The
荷兰. The study was approved by the Psychology
ethics committee of the University of Amsterdam and
performed in accordance with those guidelines. 全部
participants gave informed consent and received either
course credit or financial compensation (15 A/hr) 为了
their time (total ∼2 hr). An additional A20 voucher was
given to the participant with the highest score to moti-
vate participants to perform well on the task.

Forty-seven participants were enrolled, conforming to
our preregistered sampling plan. Our sample size was
based on a previous pilot study, which found a significant
effect of implementation intentions in 35 participants using
the same task and manipulation. 而且, a power analy-
sis with G*Power (Version 3.1.9.3) showed that our target
sample size of n = 40 should be sufficient to detect a small
behavioral effect ( f = 0.12) with an α level of .05 和
power of .8. Six participants were excluded from all analy-
塞斯. One participant quit half-way through participation,
and five participants were excluded based on performance
exclusion criteria (see Results for details). The remaining
41 参与者 (22 女性, 19 男人) had a mean age of
23.2 (标准差= 4.1) 年. All participants had normal or
corrected-to-normal vision, and all were right-handed
except one who was ambidextrous. All participants were
free of neurological or psychiatric disorders and completed
or were enrolled in higher professional education at the
time of participation, the vast majority being university stu-
凹痕. Two participants were native Germans who spoke
Dutch fluently; all others were native Dutch speakers.

Stimuli and Materials

程序

Participants performed a computerized instrumental learn-
ing task called the SORT (数字 1; 沃森, Gladwin, 等人。,
2022), programmed in Presentation (Version 18.1). Partic-
ipants played a hungry skateboarder with the objective to
collect ice creams to earn points and satisfy their hunger
by pressing a response button. They were informed that
the best performing participant at the end of the study
would receive a A20 voucher. Four pictures of ice creams

were used: a Cornetto, a Magnum, a Rocket ice lolly, and a
soft serve ice cream. The task consisted of three phases.
第一的, participants conducted an instrumental training
phase without strategic planning outside the scanner, 后
which they were moved to the MRI scanner and performed
an instrumental training phase with strategic planning
followed by a test phase (见图 1). The symmetrical
nature of the task stems from the inclusion of both valuable
and nonvaluable outcomes, which allows comparisons in
the test phase (when outcome values change) 之间
the value-congruent and value-incongruent conditions
to be made with the same response type (see Watson,
Gladwin, 等人。, 2022, for a more elaborate discussion on
the advantages of this task). The total experiment took
∼2 hr, 其中 1 hr was spent in the scanner.

The task used here is almost identical to a previous
study in which we tested the same hypothesis behaviorally
(van Timmeren & de Wit, 2022), apart from the following
变化. To minimize head movements, we used a static
version of the task here instead of having ice cream trucks
moving across the screen. We added one block of practice
with strategic planning before being moved to the scanner,
in order for participants to once read the intentions out
loud and be able to ask questions. 而且, we adapted
the task to promote stimulus–outcome (S-O) 学习
across intention conditions, to rule out that any effect of
implementation intentions on behavioral flexibility would
be mediated by reduced contingency knowledge, as was
the case in the original behavioral study (van Timmeren
& de Wit, 2022). 为此, we changed the way in which
the blocks were composed in the first part of training
(IE。, without intentions): Instead of alternating between
two sets of four ice cream vans, each block now contains
四 (out of eight) pseudorandomly selected stimuli (看
Instrumental Training section for details). More than with
the block-sets, participants are now forced to pay atten-
tion to all outcomes in the value-screen and evaluate for
which stimulus they should (不是) make a response.

Instrumental Training

At the start of the task, participants were instructed that
their goal was to collect valuable ice creams (which earn
points and alleviate hunger) and avoid collecting nonvalu-
able ice creams (which lose points and cause stomach
疼痛) 经过 (不是) responding to ice cream vans. 有
four different ice creams, and before each block of instru-
mental training, participants were shown which two ice
creams were valuable (绿色) and which two ice creams
were not valuable (红色的; Figure 1A). The position of the
valuable and nonvaluable ice creams (left/right) was coun-
terbalanced across participants. Each ice cream was associ-
ated with two out of eight vans (Figure 1B): one van always
predicting this ice cream as being valuable and the other as
being nonvaluable. Each block contained only half of the
vans: two associated with a valuable ice cream and two with
a nonvaluable ice cream. Participants were told to find out

van Timmeren et al.

959

D

w
n

A
d
e
d

F
r


H

t
t

p

:
/
/

d

r
e
C
t
.


t
.

e
d

/
j

/


C
n
A
r
t

C
e

p
d

F
/

/

/

/

3
5
6
9
5
7
2
0
8
2
8
8
1

/
j


C
n
_
A
_
0
1
9
9
0
p
d

.

F


y
G

e
s
t

t


n
0
7
S
e
p
e


e
r
2
0
2
3

D

w
n

A
d
e
d

F
r


H

t
t

p

:
/
/

d

r
e
C
t
.


t
.

e
d

/
j

/


C
n
A
r
t

C
e

p
d

F
/

/

/

/

3
5
6
9
5
7
2
0
8
2
8
8
1

/
j


C
n
_
A
_
0
1
9
9
0
p
d

.

F


y
G

e
s
t

t


n
0
7
S
e
p
e


e
r
2
0
2
3

数字 1. Overview of the study and experimental design. Participants were told they were playing a hungry skateboarder and their goal was to
collect some ice creams and not others to earn points. (A) Participants first received instrumental training. Each block started with a value-screen
(represented by the black rectangle), followed by a block of 16 training trials (see B). Each block contained four vans (pseudorandomly selected).
Training then continued with participants additionally using implementation intentions (trained with Van-Set A) or goal intentions (trained with
Van-Set B; see C), with intention instructions (see B) being presented before each instrumental learning block. 最后, participants completed six test
blocks in which all eight vans ( Van-Sets A and B) would appear intermixed and consequently the associated outcome-values of some vans changed
compared with training (see C, comparing the “Train” vs. “Test” columns). (乙) Train trial: When a van was presented, participants had to decide
whether to make a response within 500 毫秒, after which the ice cream appeared (irrespective of a response) on top of the van for 500 毫秒. Test
审判: identical to train blocks, but now (我) a banner appeared on top of the van instead of the ice cream to prevent feedback about the outcome
(IE。, nominal extinction) 和 (二) response time was reduced to 450 毫秒. Value screen: The outcome-value screen indicates which ice creams should
(绿色) and should not (红色的) be collected. Intention instructions: Vans were trained with either implementation intentions, indicating for which
ice cream van they should or should not make a response, or goal intentions, indicating for which ice cream they should (不是) make a response.
(C) An overview of stimulus-outcome contingencies (example set) and associated values across different phases of the task. The contingencies
between each ice cream and van remained consistent throughout the whole task, but the value of each ice cream (and hence the associated
response) was stable only during training. During the critical test phase, the associated outcome values changed (were incongruent) 关系到
the training value for half of the stimuli (indicated by arrows). This results in four conditions: still-valuable trials (valuable, congruent), upvalued trials
(valuable, incongruent), still-not-valuable trials (nonvaluable, congruent) and devalued trials (nonvaluable, incongruent). 例如, the first
van always delivered a Rocket, which was valuable throughout training but no longer valuable during test (IE。, devalued). Shown here is an example
of the contingencies in one of six test blocks; across the test phase, the correct response for each stimulus was equally often congruent and
incongruent. Deval = Devalued; still val = still-valuable; upval = upvalued; still not = still-not-valuable trials.

by trial and error which ice cream truck delivered which ice
cream, and that the S-O contingencies would remain the
same throughout the whole task. Participants first prac-
ticed with different discriminative stimuli (scooters) 和
结果 (pizzas) for two blocks to familiarize them with

this procedure. As mentioned previously, the composition
of the blocks (IE。, which four out of eight vans were pre-
sented during this block) was now pseudorandomized.
The conditions described above allow for six unique com-
binations of four vans, which were presented twice each

960

认知神经科学杂志

体积 35, 数字 6

(order randomized) during this first part of training for 12
blocks. The contingencies between ice creams and vans
and which of the ice creams was valuable/nonvaluable
were randomized across participants.

Each stimulus was shown 4 times per block, constituting
16 试验. Trial order was randomized per eight trials, 和
each van being presented twice in the first and twice in
the second half of a block. Each trial started with a jittered
1- to 5-sec intertrial interval. Participants were instructed
that they should respond as quickly as possible and before
the deliverer disappeared (后 500 毫秒). Irrespective of
the response, the associated outcome was then presented
为了 500 毫秒. 因此, participants did not receive direct
feedback about the accuracy of their response to balance
the feedback provided for valuable and nonvaluable out-
comes and to promote goal-directed (R-O) learning and
S-O knowledge. Each block ended with a 3-sec feedback
screen that displayed accuracy and late responses in that
block and total number of points collected (Figure 1D).

Instrumental Training with Intentions

The next phase of training took part in the MRI scanner.
Participants were told that instead of seeing which ice
creams were valuable or nonvaluable, each block would
now start with sentences that would help them perform
形式
出色地. These sentences came in two different
(Figure 1D). Goal intentions indicated for each ice
cream whether they should make a response (R-O), 为了-
mulized as “If I see [picture of an ice cream], then I WILL
press.” Implementation intentions indicated for each ice
cream van if they should make a response or not (S–R),
formulized as “If I see [picture of an ice cream van] then I
WILL (NOT) press.” Each intention was presented for
2500 msec and twice per intention block (randomized
命令). Half of the stimuli were trained using goal and
the other using implementation intentions. Each block
of verbal intentions was directly followed by a block of
instrumental training (identical to the previous phase)
with the corresponding stimuli. Blocks now alternated
between two sets of vans, one van-set being trained with
implementation intentions (S1–S4, “Van-Set A”) 和一个
with goal intentions (S5–S8, “Van-Set B”). 是否
training started with an implementation or goal intention
block was counterbalanced across participants. 在
end of regular instrumental training and before being
moved to the scanner, participants practiced each verbal
intention without instrumental training for one block,
followed by two blocks (one for each intention type) 和
instrumental training. During these first few practice
blocks outside the scanner, participants were asked to
read the intentions out loud. During the subsequent 24
blocks of training with intentions in the scanner, partici-
pants were instructed to subvocalize the intentions
instead of reading them out loud to minimize head
motion. Participants entered the scanner in a head-first
supine position and were able to view the screen using

a mirror attached to the head coil on which the task stim-
uli were presented. A button box allowed them to collect
ice creams by responding using their right index finger.
At the end of training with intentions, 参与者
completed a questionnaire on subjective automaticity
(Self-Report Behavioral Automaticity Index [SRBAI]) 和
were tested on their S-O knowledge (details below;
Figure 1E). We had planned to additionally obtain a (pre-
意图) baseline measure of these questionnaires, 但
because of a programming error, they were presented
after the practice blocks with intentions, making them
unusable as a baseline measure.

Test Phase

Participants completed six test blocks. The test phase
was similar to the first training phase (without intentions),
but with some important differences. 第一的, as intention
blocks were no longer presented, value-screens were again
shown at the start of each block, for the duration of 4 秒.
第二, participants were told that the ice cream deliverers
placed a banner on top of their van, blocking the view of
the ice cream they delivered (IE。, nominal extinction).
Because each van still kept on delivering the same ice
cream as during training, they should base their choice
on what they learned before. 第三, the feedback screens
presented at the end of each block no longer included
information on the accuracy of their responses, 但仅
the percentage of responses, nonresponses, and late
responses. We did this to prevent outcome-based learning
during the test phase. We explicitly instructed participants
that each block contained an equal amount of valuable and
nonvaluable outcomes so they knew they should aim for a
50%/50% 分配. 第四, we shortened the response
window to 450 msec to force rapid responding, which has
been shown to boost the expression of habitual slips
(Hardwick, Forrence, Krakauer, & Haith, 2019). 然而,
because a lot of participants responded just after the
450-msec time limit, we decided to include responses up
到 600 msec for both the behavioral and fMRI analysis to
increase the number of included trials in the fMRI analyses.
This change did not significantly impact the pattern of
behavioral results, which was unsurprising as the test phase
was conducted in extinction, meaning that no performance
feedback was provided during this period. Finally and cru-
cially, participants were informed that the final phase
would be more challenging because all eight ice cream
vans would appear intermixed during each block. The cru-
cial consequence of each block containing all eight stimuli
is that half of the vans would now deliver an ice cream with
a value incongruent with the value during training. 一些
ice cream vans for which they had been trained to always
make a go response during training, now delivered a (deva-
lued) ice cream that should not be collected. Vice versa,
other ice creams vans had carried nonvaluable outcomes
during training, but their signaled outcome was upvalued
and therefore required a go response. On other (价值-

van Timmeren et al.

961

D

w
n

A
d
e
d

F
r


H

t
t

p

:
/
/

d

r
e
C
t
.


t
.

e
d

/
j

/


C
n
A
r
t

C
e

p
d

F
/

/

/

/

3
5
6
9
5
7
2
0
8
2
8
8
1

/
j


C
n
_
A
_
0
1
9
9
0
p
d

.

F


y
G

e
s
t

t


n
0
7
S
e
p
e


e
r
2
0
2
3

congruent) test trials, the signaled outcome remained the
相同的 (IE。, still-valuable and still-not-valuable trials).

Consider for example the Rocket ice cream in Figure 1C.
在这个例子中, during training, this ice cream is always deliv-
ered by the van with a purple star and the van with the pink
圆圈. During training blocks with the van with the purple
星星, the Rocket is valuable and therefore requires a go
response. 相比之下, during training blocks with the van
with the pink circle, the Rocket is not valuable, and partici-
pants should refrain from pressing the space bar (IE。, no-go
response). 随后, during the test block all (ice cream
van), stimuli are presented, and in the example illustrated in
Figure 1C, the Rocket is instructed to be currently not valu-
有能力的. This means that the van with the purple star signals a
devalued outcome (IE。, this is value-incongruent with train-
ing and requires a different response), and the van with the
pink circle signals a still-valuable outcome (IE。, value congru-
耳鼻喉科; the learned response remains correct).

SRBAI

The SRBAI (Gardner et al., 2012) is a 4-item scale that cap-
tures self-reported habitual behavior patterns that we
adapted for to assess automaticity for (不是) responding
to the ice cream vans. Participants were presented with
each ice cream van and asked to indicate the associated
response (press or not press) and the degree to which
(不是) making a response was something they did: “auto-
机械地,” “without having to consciously remember,”
“without thinking,” and “before I realize I am doing it.”
Each item was scored on a scale ranging from 1 (strongly
disagree) 到 100 (strongly agree). The SRBAI scale was
previously shown to have good reliability and validity
(Gardner et al., 2012). Before the four SRBAI items
appeared, participants were asked to indicate which
response was associated with that stimulus (“making a
response” / “not making a response”) to test S–R knowl-
边缘. Cronbach’s alpha was calculated separately for each
of the four conditions (2 intentions × 2 价值观), using the
eight test items (four SRBAI questions for the two stimuli
per condition). The results indicate high internal reliabil-
性, with alpha ranging from .91 到 .95. The final score was
calculated separately for each intention by taking the
mean across the four items (范围: 1–100), with higher
scores reflecting more automatic behavior.

Test of Stimulus–Outcome Knowledge

Participants were asked about their knowledge of the S-O
contingencies by asking them for each ice cream vans
which ice cream it delivered. After selecting one of the four
ice creams, participants were asked to indicate how confi-
dent they were about their decision (0–100). Composite
scores, reflecting S-O knowledge, were calculated for each
intention and separately for go- and no-go-trained stimuli
by multiplying percentage of correct S-O contingencies
(0%/50%/100%) with percentage mean confidence.

Preregistered Behavioral Data Analysis

Behavioral data analyses were performed using IBM SPSS
统计数据 25 for Mac for frequentist statistics and JASP
Version 0.16.3 ( JASP Team, 2018) for Bayesian statistics.
For data analysis purposes, the training data were col-
lapsed across blocks of three, referred to as block-sets.
Accuracy is reflected by the percentage of trials on which
a correct response was made, calculated by the number of
correct responses divided by the total number of trials. 在
line with the fMRI analyses, trials on which a late response
was made were not included in the analyses (of both accu-
racy and RTs). To assess that learning took place over the
first part of the training without intentions, accuracy was
analyzed using a 2 × 4 repeated-measures ANOVA with
within-subject factors Value (valuable or nonvaluable)
and Block-set (1–4). The second part of training was ana-
lyzed using a 2 × 2 × 4 repeated-measures ANOVA, 和
Intention Type (implementation or goal intention) 作为
additional factor. RTs for correct responses (and thus only
for valuable go trials) were analyzed with similar ANOVAs.
For the test phase, data were analyzed using a 2 × 2 × 2
repeated-measures ANOVA with three factors: Intention
Type (implementation or goal intention), Test Value (valu-
able or nonvaluable during test), and Congruency (congru-
ent or incongruent with value during training). 因此, 为了
each intention type there are four conditions: still-valuable
试验 (valuable, congruent), upvalued trials (valuable, incon-
gruent), still-not-valuable trials (nonvaluable, congruent),
and devalued trials (nonvaluable, incongruent). 再次,
RT (including all responses up to 600 毫秒) were analyzed
using similar ANOVAs but now also analyzing responses on
no-go trials (IE。, responses on still-not-valuable and deva-
lued trials). Note that eight participants were excluded from
the no-go analyses because they performed perfectly on
still-not-valuable trials and thus did not make any response.
Subjective automaticity (SRBAI scores) for responding
to stimuli trained with implementation and goal intentions
at the end of training was compared using a paired t test.
最后, the relationship between automaticity and the
“revaluation insensitivity” index was tested for both
intention types separately using correlational analyses. A
revaluation insensitivity index was calculated for each
intention type by taking the difference between accuracy
for congruent and incongruent test trials separately for go
(still-valuable minus devalued) and no-go-trained stimuli
(still-not-valuable minus upvalued), with higher revalua-
tion insensitivity scores indicating more habitual perfor-
曼斯. Kendall’s tau was used as the four revaluation
indices, and SRBAI scores were not normally distributed.
In the case of violations of sphericity, we report Green-
house–Geisser corrected degrees of freedom and
p values. In addition to 95% 置信区间, 部分的
2) for the ANOVAs and Cohen’s d for paired
eta squared (ηp
t tests are reported as estimates of effect sizes.

We additionally conducted corresponding Bayesian
分析. For null results ( p > .05), as preregistered, 我们

962

认知神经科学杂志

体积 35, 数字 6

D

w
n

A
d
e
d

F
r


H

t
t

p

:
/
/

d

r
e
C
t
.


t
.

e
d

/
j

/


C
n
A
r
t

C
e

p
d

F
/

/

/

/

3
5
6
9
5
7
2
0
8
2
8
8
1

/
j


C
n
_
A
_
0
1
9
9
0
p
d

.

F


y
G

e
s
t

t


n
0
7
S
e
p
e


e
r
2
0
2
3

report the Bayes Factor01 (BF01), which quantifies the rel-
ative evidence in favor of the null hypothesis (H0) 超过
alternative hypothesis (H1). For ANOVAs, we report the
BFexcl, which quantifies the extent to which the data sup-
port inclusion of the factor of interest in the model (IE。,
the change from before posterior inclusion odds, across
matched models). 最后, although we interpret signifi-
cant findings on the basis of p < .05, we also report BFs for comprehensiveness and transparency (i.e., BF10 or BFincl for ANOVAs, which quantify evidence in favor of the alternative hypothesis over H0 and is identical to 1/BF01/excl). BFs were interpreted according to Table 1 in Wetzels and colleagues (2011), with BFs between one and three reflecting anecdotal support, BFs larger than three reflecting substantial support, and BFs larger than 10 reflecting strong support. In all Bayesian analyses, JASP’s default priors (cauchy = 0.707 for t tests and r = 0.5 for fixed and r = 1 for random effects for ANOVAs) were used. MRI Data Acquisition All MRIs were performed on a 3-Tesla, full-body Achieva dStream MRI-scanner (Philips Medical Systems) equipped with a 32-channel head coil. After entering the scanner, a low-resolution survey scan was made to determine the location of the field of view. fMRI scans were acquired at a ∼30° angle from the anterior–posterior commissure line to maximize signal sensitivity in orbital regions (Deichmann, Gottfried, Hutton, & Turner, 2003) using a T2*-weighted single-shot gradient echo imaging sequence with the following parameters: repetition time = 2000 msec; echo time = 28 msec; flip angle = 76.1°; voxel size = 3 mm3 with 0.3-mm slice gap; matrix size = 80 × 78; number of slices = 36; field of view = 240 × 118.5 × 240 mm. The Table 1. Imaging Results of the Training Phase (Exploratory) training with intentions was split in two runs of 598 scans each, whereas 415 scans were acquired for the test phase. The first six volumes of each run were discarded to allow T1 saturation to reach equilibrium. A high-resolution T1-weighted structural image was acquired before the final run (while participants com- pleted the post-training SRBAI and SO-test) using an MPRAGE sequence with the following parameters: voxel size = 1 mm3; field of view = 240 × 220 × 188 mm; rep- etition time = 8.2 msec; echo time = 3.7 msec, 220 slices, flip angle = 8°. fMRI Data Analysis Image Preprocessing MRI data were first converted to Brain Imaging Data Struc- ture format using in-house scripts. An initial check of data quality was done by visually inspecting the image-quality metrics derived from MRIQC v0.15.0 (Esteban et al., 2017). Data were preprocessed using fMRIPrep v20.1.1 (Esteban et al., 2019; RRID:SCR_016216), which is based on Nipype 1.5.0 (Gorgolewski et al., 2011; RRID: SCR_002502), with the default processing steps. These included brain extraction, segmentation, and surface recon- struction of the structural T1 image; spatial normalization of both the structural and functional data to MNI space; and head motion estimation, coregistration, susceptibility distortion correction, and resampling to 2 mm3 of the functional data. No slice-timing correction was performed. A comprehensive description of the preprocessing pipe- line is available here: https://osf.io/72bsh. fMRI Statistical Analyses The preprocessed functional data were further analyzed using Statistical Parametric Mapping software (SPM12, Contrast Region Increase over training Caudate nucleus head blocks (go) Amygdalo-hippocampal junction Angular gyrus Posterior putamen Decrease over training Anterior caudate L blocks (go) Anterior caudate R Primary motor/SMA Hippocampus/putamen Temporal cortex L Goal > implementation

Anterior caudate

intentions block-set 1 (Go)

SVC = small volume correction; L = left; R = right.

MNI Coordinates
(X, y, z)

Cluster Size
( Voxels)

z Score at
Peak Level

Correction

22

−10

20

26

−24

24

8

43

−46

13

6

−4

−52

−20

10

10

−24

14

−46

18

30

−14

38

4

2

−4

60

−8

−4

−4

443

348

214

34

912

537

860

657

591

40

4.37

5.17

4.92

3.96

6.53

6.39

5.44

4.83

5.69

3.69

Cluster

Peak

Peak

SVC Tricomi

Cluster

Cluster

Cluster

Cluster

Cluster

SVC striatum

van Timmeren et al.

963

D

w
n

A
d
e
d

F
r


H

t
t

p

:
/
/

d

r
e
C
t
.


t
.

e
d

/
j

/


C
n
A
r
t

C
e

p
d

F
/

/

/

/

3
5
6
9
5
7
2
0
8
2
8
8
1

/
j


C
n
_
A
_
0
1
9
9
0
p
d

.

F


y
G

e
s
t

t


n
0
7
S
e
p
e


e
r
2
0
2
3

Wellcome Trust Centre for Neuroimaging). The data were
spatially smoothed using a Gaussian kernel with a FWHM
的 8 mm and all functional data was high pass filtered
(with a 128-sec cutoff ) to remove slow signal drifts.

First-Level Analysis

For the first-level analysis of the fMRI data, a general linear
model was constructed for each participant, concatenated
over all three runs from the training and test phase. 为了
data on training with intentions, trial onsets of valuable
stimuli and nonvaluable stimuli for implementation and
goal intentions were modeled using stick functions, 麦-
ing four conditions. To look at the effect of time on train-
英, these were modeled as separate regressors per three
blocks, making four training block-sets. Only correct trials
(IE。, where an accurate (非)response was made) 是
包括. Blocks of verbal rehearsal of implementation
and goal intentions were additionally modeled as blocks
的 28 秒 (total duration of eight 3.5-sec trials). For the test
phase, stick functions modeled the trial onsets of still-
valuable and still-not-valuable (“value-congruent”; 这
outcome value is congruent with training phase) and deva-
lued and upvalued (“value-incongruent”; the outcome
value is not congruent with training phase) stimuli that
were trained with implementation or goal intentions
separately, making eight regressors. To investigate BOLD
activity during habitual (C)omission errors (habitual “slips”
in case of incongruent trials), separate regressors were
included for incorrect trials for all conditions. The follow-
ing regressors of no interest were included separately for
each run: one regressor for errors (only for training, 作为
test-errors/“slips” were modeled as regressors of interest)
and late trials, keypresses, feedback-displays, 价值-
screens (only for test phase), and six realignment parame-
ters capturing rotation and translation to correct for
residual participant motion. Three session constants were
included in the model. All onsets were then convolved
with the canonical hemodynamic response function, 和
an autoregressive AR(1) model was used to correct for
serial correlations. The general linear model was regressed
against the fMRI data to generate parameter estimates for
each participant.

Regressor-specific first-level contrast images were cre-
ated for the training- and test-regressors modeling the dif-
ferent conditions of interest to construct the planned
second-level full factorial models. These contrasts of
parameter estimates were then entered into between-
subjects ANOVAs to generate group-level random-effects
统计数据. To test for a difference in learning between
intention types, contrasts of parameter estimates of the
instrumental training phase were entered into a 2 × 4 ×
2 (Value × Block-set × Intention) factorial ANOVA. 福尔-
lowing estimation of the second-level model, t tests were
specified by adding linear weights to each instrumental
training block-set, modeling increases over training as
[−1.5 −0.5 0.5 1.5] and decreases as [1.5 0.5 −0.5 −1.5].

此外, first-level contrast images were created. 到
assess the effect of planning during training, contrasts were
created comparing training with implementation versus
goal intentions (across all blocks, separately for go and
no-go trials). To examine markers of goal-directed control
during test, we compared correct congruent trials with cor-
rect incongruent trials (IE。, [still-valuable go > upvalued
去] 和 [still-not-valuable no-go > devalued no-go]). 我们
also investigated situations where participants fail to adapt
to the new outcome value and continue to respond accord-
ing to the learned S–R association by comparing incorrect
incongruent trials (IE。, “slips of action”) with correct
incongruent trials. 再次, separate contrasts were created
for test-go- and test-no-go trials (IE。, [devalued go > upva-
lued go] 和 [upvalued no-go > devalued no-go]). 最后,
we also created a similar contrast comparing incorrect
incongruent trials (slips) with correct congruent trials
(IE。, [devalued go > still-valuable go] 和 [upvalued no-
go > still-not-valuable no-go]). More information about
the rationale behind these contrasts is provided in the
Results section. To assess the effect of planning strategy
on test performance, the same test-phase contrasts were
constructed but looking for an interaction with intention
类型 (例如, [still-valuable go > upvalued go × implementa-
tion > goal intention]). Parameter estimates generated
from these first-level analyses were entered into a
random-effects group analysis, and linear contrasts were
used to identify significant effects at the group level.

Higher level whole-brain statistical maps were corrected
for FWE at the cluster-level ( pFWE-cluster < .05) with a voxel cluster-defining threshold of p = .001 uncorrected. When activations did not reach statistical significance at the cluster level, we also checked the peak-voxel level with a threshold of p < .05 corrected ( pFWE-peak < .05). In such cases, we clearly indicate this in the text, and we report the peak-voxel level results so as to be as comprehensive as possible in our reporting. Finally, in an exploratory analysis, we further aimed to test for effects in specific regions of the striatum given prior published findings on the role of these struc- tures in goal-directed and habitual responding ( Watson et al., 2018; de Wit et al., 2012; Tricomi et al., 2009; Tanaka, Balleine, & O’Doherty, 2008; Valentin et al., 2007). In partic- ular, we defined an anatomical ROI to examine effects in the caudate nucleus, a region previously implicated in goal- directed processes, as well defining a functional ROI based on the results from Tricomi et al. (2009) that implicated the posterior putamen in habit-related processing. In addition, we identified several ROIs in our preregistra- tion: for habitual control, goal-directed control, response conflict, and implementation intentions. Three separate masks were created based on these ROIs to apply small vol- ume correction (SVC). Apart from a striatal ROI (encom- passing the bilateral caudate, putamen, and NAcc from the AAL atlas (Tzourio-Mazoyer et al., 2002); however, applying SVC with the three preregistered ROIs did not alter the pattern of results. This may be because of the large number of voxels included in the ROIs (especially the goal- 964 Journal of Cognitive Neuroscience Volume 35, Number 6 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . e d u / j / o c n a r t i c e - p d l f / / / / 3 5 6 9 5 7 2 0 8 2 8 8 1 / j o c n _ a _ 0 1 9 9 0 p d . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 directed mask) thereby reducing the sensitivity of the SVC. Therefore, we have opted to stick to reporting the whole- brain results for the confirmatory analyses. Whole-brain t-maps (without thresholding) of the main fMRI contrasts are available at https://neurovault.org/collections/13191/. There was no significant difference in learning to make go versus no-go responses (main effect of Value: F(1, 40) = 2.00, p = .17, ηp 2 = .05, BFexcl = 1.60; Block × Value inter- action: F(1.70, 68.16) = .25, p =. 57, ηp 2 = .01, BFexcl = 22.15). RESULTS All analyses reported in this section were preregistered at the start of this study, unless indicated otherwise in the text. We generally followed the preregistered analysis plan, but in some cases, the results prompted us to further explore the data. We should also point out that we prere- gistered these hypotheses before finishing data analysis of our related behavioral study (van Timmeren & de Wit, 2022). Hence, we preregistered the same behavioral hypotheses for this study, although the original behavioral study only partially supported our initial predictions—a point we will come back to in the discussion. We therefore incidentally deviate from the preregistration to keep our analyses in line with analyses and findings from the behav- ioral study, which is always clearly indicated. The total final sample used for the analyses consisted of 41 participants, after excluding the following participants. On the basis of the preregistered exclusion criteria, no par- ticipants were excluded on the training criterion (< 80% accuracy in the last block-set of training), whereas three were excluded because they made < 25% responses on upvalued trials trained with goal intentions in the test phase. The goal of this criterion was to ensure that partic- ipants understood the test-phase instructions and updated their performance accordingly, while not excluding partic- ipants based on the manipulation of interest (i.e. imple- mentation intentions). We additionally excluded two participants (post hoc) based on a very low overall response rate during the test phase. Although these partic- ipants made (just) > 25% upvalued responses, we deviated
from the preregistration because they were outliers on the
overall response rate and responded on less than one out
of three trials during the test, despite receiving explicit
instruction to aim for a response rate of ∼50% and receiv-
ing feedback about that at the end of each block. 因此,
they did not follow the test-phase instructions and their
performance is not reliable. Note that this criterion is
independent of actual task performance (准确性) 和
that the in-/exclusion of these two participants does not
change the general pattern of behavioral nor fMRI results.

Behavioral Results

Training Phase without Intentions

正如预期的那样, participants learned to make correct responses
over the first part of training (图2A), as revealed by a
significant main effect of Block-set on accuracy, F(2.46,
98.20) = 16.74, p < .001 ηp 2 = .30, BFincl = 2.81 × 105, and a marginally significant effect of Block-set on RT, F(2.45, 98.07) = 2.75, p = .058, ηp 2 = .06, BFincl = 0.81. Instrumental Training with Goal versus Implementation Intentions Following the first 12 blocks of instrumental training without planning, intentions were introduced during a practice block (still outside the scanner). Although we did not preregister to analyze those data, for completeness and in line with our previous behavioral study with this par- adigm investigating the same question (van Timmeren & de Wit, 2022), we conducted a paired t test comparing the final block of training without intentions to the practice block. This analysis revealed that participants benefitted from if–then planning on the valuable go trials, as reflected by higher accuracy (M = 96.1, SD = 12.4) relative to the preceding (pre)training block-set (baseline: M = 91.8, SD = 9.1, Z(40) = 2.57, p = .01, d = 0.59, 95% CI [.81, .22], BF10 = 1.34), whereas RTs were not affected, t(40) = −.01, p = .99, d = −0.001. In contrast, the use of goal intentions negatively impacted both accuracy (M = 87.6, SD = 14.7, Z(40) = −1.86, p = .065, d = −0.40, 95% CI [−.01, −.69], BF10 = 1.36) and RTs, t(40) = −2.03, p = .049, d = −0.32, BF10 = 1.08, of go trials compared with (pre)training. For no-go trials, no significant effects of imple- mentation, Z(40) = 1.03, p = .31, BF01 = 5.12, or goal inten- tion, Z(40) = .10, p = .93, BF01 = 5.68, were seen. Subsequently, when instrumental training was resumed during the scanning session, the 2 × 2 × 2 repeated- measures ANOVA indicated that the advantage of if–then planning was initially still apparent on valuable go trials (Figure 2A). In addition to a strong main effect of Value, driven by participants performing better overall on valu- able compared with nonvaluable trials, F(1, 84.47) = 10.93, p = .002, ηp 2 = .22, BFincl = 18.08, we found the expected preregistered three-way interaction between Intention, Value, and Block-set, F(3, 103.14) = 6.45, p < .001, ηp 2 = .14, BFincl = 857.7. Separate analyses of valuable and nonvaluable trials revealed a significant Intention × Block interaction for valuable, F(3, 81.78) = 6.21, p = .003, ηp 2 = .13, BFincl = 74.01, but not for non- valuable trials, F(3, 120) = 1.88, p = .14, ηp 2 = .05, BFexcl = 2.63. The significant effect on the valuable go trials was driven by higher accuracy with implementation compared with goal intentions during the first block-set, Z(40) = 3.34, p < .001, d = 0.85, 95% CI [.64, .94], BF10 = 22.76. At the end of training (Block-Set 4), there was no longer a significant effect of Intention Type on accuracy, Z(1, 40) = −.34, p = .80, ηp 2 = −1.43, BF01 = 5.87. The analysis of RTs (Figure 2A) revealed a main effect of Intention Type, F(1, 40) = 12.08, p = .001, ηp 2 = .23, BFincl = 11.12, with faster responses during blocks trained with implementation intentions (median = 365 msec, SD = van Timmeren et al. 965 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . e d u / j / o c n a r t i c e - p d l f / / / / 3 5 6 9 5 7 2 0 8 2 8 8 1 / j o c n _ a _ 0 1 9 9 0 p d . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . e d u / j / o c n a r t i c e - p d l f / / / / 3 5 6 9 5 7 2 0 8 2 8 8 1 / j o c n _ a _ 0 1 9 9 0 p d . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Figure 2. Behavioral results. (A) Over the course of training, participants learned to successfully respond for stimuli associated with valuable outcomes (Go) and to withhold making a response for stimuli associated with nonvaluable outcomes (no-go), as reflected by increasing accuracy rates. After six blocks of regular training, some stimuli continued to be trained using implementation intentions (blue) whereas others were trained with goal intentions (blue). Following one block of practice (black dotted line), participants were moved to the scanner and resumed training with intentions. Accuracy was significantly higher initially when using implementation intentions, but toward the end of training performance was almost perfect for both implementation and goal intentions. Across training with intentions, participants were faster during blocks trained with implementation versus goal intentions. (B) During the test phase, for some stimuli, the associated outcome changed in value (and thus response) compared with training (upvalued and devalued; see Figure 1C) and participants had to flexibly update their responses accordingly. For other stimuli, the associated value and response remained congruent with training (still-valuable and still-bot-valuable). Participants responded less accurately for incongruent compared with congruent trials, reflecting inflexibility as a consequence of learned S–R contingencies during training. However, training with implementation intentions did not lead to reduced flexibility. Similarly, there was no significant effect of training with implementation intentions on RT. (Shaded) error bars represent standard error of the mean. II = implementation intentions; GI = goal intentions. 17) compared with goal intentions (median = 374 msec, SD = 20), but no significant effect of Block-set, F(2.4, 98.6) = 2.31, p = .09, ηp 2 = .05, BFexcl = 3.41, nor an inter- action ( p = .20, ηp 2 = .04, BFexcl = 3.67). Symmetrical Outcome-Revaluation Test As expected, learned S–R associations had a clear impact on performance during the test phase (Figure 2B), as revealed by a main effect of Congruence, F(1, 40) = 65.08, p < .001, ηp 2 = .62, BFincl = 1.39 × 107. Because test value showed significant interactions with both Congruence, F(1, 40) = 10.73, p = .002, ηp 2 = .21, BFincl = 8.91, and Intention Type, F(1, 40) = 5.94, p = .02, ηp 2 = .13, BFincl = 1.27, separate follow-up comparisons were conducted for go (associated with still-valuable and upvalued outcomes) and no-go (associated with still- not-valuable and devalued outcomes) trials. Main effects of Congruence were seen for both the go, F(1, 40) = 16.82, p < .001, ηp 2 = .30,, BFincl = 76.40, and no-go, F(1, 40) = 56.46, p < .001, ηp 2 = .59, BFincl = 2.31 × 106, stimuli. As can be seen in Figure 2B, the congruency effect was larger for no-go trials mainly because of partic- ipants struggling more on devalued trials, where they had 966 Journal of Cognitive Neuroscience Volume 35, Number 6 to suppress responding to discriminative stimuli that pre- viously signaled a valuable outcome. Importantly, we were interested in the effect of implementation inten- tions on test performance. First, an analysis of the go test trials suggested that overall performance was worse when trained with implementation compared with goal inten- tions, F(1, 40) = 5.48, p = .02, ηp 2 = .12, although Bayesian statistics showed that this evidence was inconclusive (BFincl = 1.46). Importantly, in contrast to our preregis- tered hypothesis, there was no evidence for reduced flex- ibility as a consequence of if–then planning: The expected interaction of congruence with intention type failed to reach significance, F(1, 40) = 1.52, p = .23, ηp 2 = .04, BFexcl = 1.86. Given the direct relevance of the compari- son between intentions for our research question, we followed these analyses up with separate (exploratory) paired t tests for still-valuable and upvalued trials to also report Bayesian evidence against a difference. Findings indicate that intentions only had a significant negative effect on (congruent) still-valuable, Z(40) = −2.55, p = .01, d = −0.56, BFincl = 3.68, but not on (incongruent) upvalued trials, t(40) = −.75, p = .46, BFexcl = 4.54. Finally, for the no-go stimuli (still-not-valuable and deva- lued), no main, F(1, 40) = .42, p = .52, BFexcl = 4.37, nor interaction, F(1, 40) = .06, p = .81, BFexcl = 4.25, effects of intention type were observed. We also analyzed RTs during the test phase. A Value × Congruence interaction, F(1, 32) = 49.47, p < .001, ηp 2 = .61, BFincl = 2.91 × 105, prompted separate analyses for trials trained with go responses (still-valuable and deva- lued) and for trials trained with no-go responses (still- not-valuable and upvalued). Interestingly, there was a main effect of congruence for go-trained stimuli, suggesting significantly faster RTs on devalued trials (M = 418 msec, SE = 8.8) relative to still-valuable (M = 443 msec, SE = 6.8; F(1, 40) = 12.56, p = .001, ηp 2 = .24, BFincl = 23.40), in line with the idea that habitual slips of action are trig- gered fast and efficiently before one has the chance to suppress them. As late responses were excluded from this analysis (following the accuracy analysis), we ran an additional analysis including RTs for late responses to make sure that this effect was not driven by a higher number of (excluded) late responses on devalued trials. This analysis showed an even stronger main effect of con- gruence than the original analysis without late responses, F(1, 40) = 14.84, p < .001, ηp 2 = .27, BFincl = 36.88. No other significant effects of RTs were found (all p > .22,
BFexcl > 1.74).

Self-reported Automaticity and S-O Knowledge

Self-reported automaticity was at a high level overall
(median = 80.4%, 标准差= 16.7), but did not differ between
意图, t(40) = −.98, p = .34, BF01 = 3.80, nor did sub-
jective automaticity correlate with revaluation insensitivity
for implementation (rτ = −.09, p = .57, BF01 = 4.39) 或者
goal intentions (rτ = .22, p = .17, BF01 = 2.03).

Following van Timmeren and de Wit (2022), we also
explored differences in S-O knowledge between intention
types and their relationship with overall test accuracy. S-O
knowledge was high (median = 89.8%, 标准差= 22.1) 和,
contrary to our previous study, no longer differed signifi-
cantly between intention types, F(1, 40) = 2.07, p = .16,
ηp
2 = .05, BF01 = 2.6; 价值观, F(1, 40) = 3,42, p = .07, ηp
2 =
.08, BF01 = 2.4; or their interaction, F(1, 40) = .91, p = .35,
ηp
2 = .02, BF01 = 5.88, suggesting that the adaptation we
made to the task (IE。, using a pseudorandom selection of
stimuli instead of alternating between two block-sets in
the first part of training, 参见方法部分) had the
desired effect. S-O knowledge did correlate positively with
test accuracy (across all four conditions) for both imple-
mentation intentions (rτ = .30, p = .008, 95% CI [.08,
.52], BF10 = 7.91) and goal intentions (rτ = .39, p < .001, 95% CI [.21, .57], BF10 = 99.22). Conclusions: Behavioral Results We provide evidence for habit learning, as indicated by the general effect of previously learned S–R mappings on the ability to flexibly adapt responding when the cue signals a revalued outcome (i.e., incongruent). Importantly, although if–then planning seemed to increase efficiency relative to goal intentions, as reflected in superior acquisi- tion, this was not at the expense of flexibility when out- come values changed in the test phase. Neuroimaging Results Instrumental Training: Across Intentions (Exploratory) First, we were interested to explore general learning effects across intention types because this was the first time the SORT was used in the MRI scanner. These analyses showed that over the course of go training (i.e., on valuable trials), activity increased linearly in the head of the caudate nucleus extending into ACC (at p < .05 FWE rate corrected; pFWE-cluster < .05). Activation in the left amygdalo-hippocampal junction and the angular gyrus did not reach our cluster-level correction thresh- old, but did survive voxel-level correction at p < .05 ( pFWE-voxel < .05; Table 1). In this same contrast, we also observed a cluster in the posterior putamen, which survived a small-volume correction for the posterior putamen ROI (i.e., pFWE < .05 with SVC, defined as a 10-mm sphere at peak value of the cluster that showed a significant increase over training in the study of Tricomi et al. [2009]; x = 33, y = −24, z = 0). On the other hand, activity decreased over training in the bilateral anterior caudate (a more ventral part of the striatum), primary motor cortex (extending to mid-posterior cin- gulate), hippocampus extending into the putamen, and the left temporal cortex (all pFWE-cluster < 0.05 cor- rected). In contrast, on no-go trials, there were no vox- els that showed a significant linear change over training blocks. van Timmeren et al. 967 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . e d u / j / o c n a r t i c e - p d l f / / / / 3 5 6 9 5 7 2 0 8 2 8 8 1 / j o c n _ a _ 0 1 9 9 0 p d . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Instrumental Training: Comparing Goal and Implementation Intentions We then examined whether strategic planning affected instrumental training. The contrast comparing the aver- age BOLD signal of trials trained with implementation intentions and goal intentions did not reveal any signif- icant activations, neither on go nor no-go trials. We also tested for differences in learning between inten- tions over the course of training by adding linear weights to block-sets to compare increased activity over block-sets during implementation intentions with decreased activity during goal intentions, and vice versa. However, both tests of this interaction failed to show sig- nificant differences. The finding that implementation intentions showed the most pronounced effect behaviorally early in training prompted us to conduct an exploratory analysis of only the first training block-set. This analysis revealed signifi- cantly decreased activation in the anterior caudate ( pFWE < .05 with SVC, z = 3.69) on trials trained with implementa- tion intentions compared with goal intentions (Figure 3A and Table 1). For visual purposes, the extracted average BOLD signal from the anterior caudate cluster is shown separately for each block-set and intention in Figure 3B. As can be seen here, activity was indeed lower on imple- mentation intention trials during the first block-set only and subsequently decreased for both intentions. A whole-brain analysis also showed decreased activity for implementation relative to goal intentions at an uncor- rected threshold ( p < .001) in the right lateral orbitofron- tal cortex (OFC; pFWE-cluster = .061, z = 4.25; x = 26, y = 50, z = 14) and the left insula ( pFWE-cluster = .28, z = 3.76; x = −42, y = 20, z = 2). However, because these results did not survive FWE-correction, we refrain from interpret- ing them further. To rule out that these findings were driven by RTs, which were significantly shorter for imple- mentation compared with goal intentions, we performed an additional analysis controlling for trial-by-trial RT by including a parametric regressor (one for each of the two training runs) with RTs for each trial. This had no sig- nificant impact on the results, and we could qualitatively replicate all reported findings. Neural Predictors of Test Performance To determine whether brain activity during instrumen- tal training with implementation intentions was pre- dictive of test performance, we tested whether the average BOLD signal during training covaried with the revaluation insensitivity score. This preregistered test did not reveal significant neural predictors of test perfor- mance. For completeness, we also exploratively ran this analysis separately for goal intentions and across inten- tions, but this similarly did not reveal any significant results. Figure 3. Lower activity in the right anterior caudate early in training for implementation compared with goal intentions. (A) Voxels that showed significantly lower activation during the first block-set of training with implementation compared with goal intentions on go-trials (at pFWE < .05, small volume-corrected). The activity patterns shown are thresholded at p < .001 uncorrected. (B) Parameter estimates extracted from this anterior caudate cluster (peak at x = 13, y = 18, z = −4) over block-sets. Error bars represent 95% confidence intervals. a.u. = arbitrary units. Symmetrical Outcome-Revaluation Test: Markers of Goal-directed versus Habitual Performance In the test phase, changes in outcome value create con- flict between goal-directed control and learned S–R asso- ciations. Specifically, to perform the correct response on incongruent trials (i.e., upvalued go and devalued no- go), participants have to exert goal-directed control and override the learned S–R mapping. Conversely, on congruent trials (still-valuable go and still-not-valuable no-go), participants can rely on the learned S–R associa- tions. The advantage of the symmetrical outcome- revaluation test (compared with the original slips of action test) is that we can compare congruent and incongruent trials with each other unconfounded by test outcome value (and therefore required response: i.e., go or no-go). Therefore, to examine markers of goal- directed control, we firstly compared upvalued go with still-valuable go responses and found that this was asso- ciated with increased right insula activity ( pFWE-cluster < .05, z = 4.16; Table 2). No significant activations were seen in the contrast between devalued no-go and still- not-valuable no-go trials. To identify regions where participants fail to adapt and continue to respond according to the learned S–R 968 Journal of Cognitive Neuroscience Volume 35, Number 6 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . e d u / j / o c n a r t i c e - p d l f / / / / 3 5 6 9 5 7 2 0 8 2 8 8 1 / j o c n _ a _ 0 1 9 9 0 p d . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 MNI Coordinates (x, y, z) Max Cluster Size ( Voxels) z Score (Peak) Correction Table 2. Imaging Results of the Test Phase Contrast Region Upvalued go > still-valuable go

Insula R

Devalued slips > still-valuable go

Anterior insula L

Anterior insula R

Still-valuable go > devalued slips

vMPFC

Caudate

NAcc

Primary motor cortex

Paracentral lobule

Angular gyrus L

IPL L

38

−40

42

22

8

4

−26

−10

−30

24

26

26

42

28

20

12

−30

−52

Angular gyrus R

38

−50

IPL R

Occipital cortex

Devalued slips > upvalued go

Anterior insula L

SMA

dACC

Inferior parietal lobule L

Inferior parietal lobule R

Supramarginal gyrus

Upvalued go > devalued slips

ACC, caudate nucleus

Premotor/PMC

Lateral OFC

Superior parietal love L

Superior parietal love R

Occipital/visual cortex

−36

−36

8

8

−56

56

−36

−20

−26

−32

−28

30

−30

−74

26

8

18

−42

−4

26

22

0

62

−76

−62

−96

−2

2

−10

−4

2

4

60

66

52

58

8

−8

64

34

34

44

−8

18

42

0

36

38

16

468

611

621

388

252

336

1653

4.16

5.46

4.49

4.64

5.10

4.56

5.51

Cluster

Cluster

Cluster

Cluster

Peak

Cluster

Cluster

2510

5.33

Cluster

767

707

378

431

269

331

707

327

529

317

1482

4099

1307

5.25

4.29

5.47

4.18

4.60

4.44

4.29

4.17

4.80

4.19

4.12

5.04

6.43

Cluster

Cluster

Cluster

Cluster

Peak

Cluster

Cluster

Cluster

Cluster

Cluster

Cluster

Cluster

Cluster

L = left; R = right; NAcc = nucleus accumbens; IPL = inferior parietal lobule; (d)ACC = (dorsal) anterior cingulate cortex; PMC = primary motor
cortex; OFC = orbitofrontal cortex.

association, we contrasted incorrect incongruent trials
(devalued go and upvalued no-go) to correct incongruent
试验 (upvalued go and devalued no-go, 分别), 作为
the latter arguably require most goal-directed control to
override the learned S–R mapping. The contrast compar-
ing devalued go responses (IE。, slips of action) 和
upvalued go responses is shown in Figure 4A, 和
revealed increased activity in a fronto-parietal network,
including the left anterior insula extending to the inferior
lateral prefrontal cortex, SMA, dorsal anterior cingulate
cortex, bilateral inferior parietal lobule, and supramargi-
nal gyrus (all pFWE-cluster < .05; Table 2). Conversely, lower activity during slips of action compared with upva- lued go responses was seen in the left anterior cingu- late cortex extending into caudate nucleus, left lateral lobe, and several OFC, bilateral superior parietal occipital/primary visual areas (all pFWE-cluster < .05, Table 2). In addition, activation in the premotor/primary motor cortex did not survive cluster-level correction but did reach peak-voxel level significance ( pFWE-voxel < .05). Although the previous contrast between devalued slips and correct upvalued go responses maximizes the differ- ence between habitual versus goal-directed control, the conditions differ in terms of the original training out- come value (as well as test value). To mitigate this, we proceeded to compare devalued slips to still-valuable go responses, which only differ in their test outcome value. Thus, this contrast compares trials on which participants correctly continued responding according to the learned S–R association with trials on which they failed to over- ride this association. Although we have used the same approach previously (in the study of Watson et al., van Timmeren et al. 969 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . e d u / j / o c n a r t i c e - p d l f / / / / 3 5 6 9 5 7 2 0 8 2 8 8 1 / j o c n _ a _ 0 1 9 9 0 p d . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 nonvaluable outcome (devalued) with stimuli signaling a valuable outcome (upvalued or still-valuable). To control for this, we ran some additional exploratory analyses, com- paring activity during devalued slips with correct no-go responses on devalued and still-not-valuable trials. Although these contrasts are difficult to interpret by themselves—they are themselves confounded by pressing a button or not—looking at the overlap between all four contrasts overcomes the value-related confounds and hence could find a common process in the expression of habits. To this end, we used ImCalc to create binary images of all four contrasts thresholded at t(41) = 3.1 (equivalent to p < .001 uncorrected) and multiply them. The result of this inclusive masking analysis, which is akin to a conjunction analysis, shows that the bilateral anterior insula was commonly activated across all four contrasts (Figure 4B). Symmetrical Outcome-Revaluation Test: Comparing Goal and Implementation Intentions None of the planned contrasts comparing test-phase trials trained with implementation with goal intentions revealed significant activation patterns. DISCUSSION The aim of the present study was to investigate whether the brain can strategically go on automatic pilot. We inves- tigated this by measuring the impact of strategic planning (i.e., implementation intentions vs. goal intentions) on the acquisition of instrumental actions as well as subsequent flexible, behavioral adjustment. When strategic planning was first introduced during the instrumental learning phase of our paradigm, implementation intentions improved performance relative to goal intentions. Fur- thermore, in line with the idea that their beneficial effect was mediated by accelerated S–R learning, an exploratory analysis revealed that implementation intentions were associated with reduced activity in the anterior caudate, a brain area previously implicated in goal-directed control ( Watson et al., 2018; Liljeholm, Tricomi, O’Doherty, & Balleine, 2011). These effects of strategic planning on per- formance and neural activity were only apparent early in training, with participants reaching high levels of accuracy (and reduced activity in the anterior caudate) by the end of the learning phase independent of intention type. Our central question, however, was whether implementation intentions would actually impede performance when flexible, behavioral adjustment was required during the subsequent outcome-revaluation test. Importantly, we found no evidence for a detrimental effect of strategic planning on the ability to adapt behavior to changing outcome values, nor any effect on underlying neural activ- ity patterns. We conclude that strategic planning of S–R mappings may allow people to go on automatic pilot to Figure 4. (A) Neural correlates of slips of action in the test phase, as revealed by increased (red – yellow) and decreased (dark – light blue) activity during devalued slips compared with upvalued responses. Clusters that survived whole-brain FWE correction include increased activity in a fronto-parietal network, including the left anterior insula extending to the inferior lateral pFC, SMA, dorsal anterior cingulate cortex, bilateral inferior parietal lobule, and supramarginal gyrus. Conversely, lower activity was seen in the left anterior cingulate cortex extending into caudate nucleus, premotor/primary motor cortex, left lateral OFC, bilateral superior parietal lobe, and several occipital/primary visual areas. Results are shown here at p < .001 (uncorrected) for visual purposes, overlaid on the mean T1 image of all participants. (B) The bilateral anterior insula was found to be commonly activated during devalued slips (x = ±40, y = 26, z = 2). Shown here in yellow are the voxels that overlap between all four contrasts comparing devalued slips relative to correct (non-)responses during still-valuable, still-not-valuable, devalued and upvalued trials (thresholded at p < .001 uncorrected). 2018, the “slips versus respond valuable” contrast), this contrast was not preregistered and should thus be con- sidered exploratory. Similar to the comparison with upvalued go responses, this comparison of slips with still-valuable go responses revealed increased anterior insula activity (bilaterally) during slips, but decreased activity in vMPFC (extending to NAcc), primary motor cortex, paracentral lobule, a large occipital cluster, and large parietal clusters (bilateral) including the angular lobule (all pFWE-cluster < gyrus and inferior parietal 0.05; Figure 4A). As preregistered, we also compared upvalued no-go responses (“inhibition slips”) to correct devalued (no- go) trials, but this did not reveal any significant activation patterns. Moreover, we were not able to conduct the contrast between upvalued and still-valuable no-go trials, because of the low number of omission errors on still- valuable trials. Our results thus identify the anterior insula as a com- mon region associated with slips toward devalued out- comes, as activity in this region was higher during slips than during go responses toward upvalued and still- valuable outcomes. However, both contrasts are con- founded by expected value (the outcome value during the test phase) as they both compare stimuli signaling a 970 Journal of Cognitive Neuroscience Volume 35, Number 6 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . e d u / j / o c n a r t i c e - p d l f / / / / 3 5 6 9 5 7 2 0 8 2 8 8 1 / j o c n _ a _ 0 1 9 9 0 p d . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 increase behavioral efficiency, but that this does not have to come at the expense of behavioral flexibility. Therefore, mental rehearsal of S–R links does not appear to suffice for the formation of a rigid habit, refuting the notion of “instant habits” and suggesting that behavioral repetition may be crucial for the development of rigid habits. To shed light on the implications of these findings, we will first discuss them in some more detail, including the basic results (i.e., across strategies) on the relatively novel SORT. First, during the instrumental learning phase, we observed increasing accuracy and decreasing RTs over the course of training, suggesting that participants acquired the S–R mappings. This interpretation was fur- ther supported by high levels of subjective automaticity of responding at the end of the instrumental learning phase and increasing involvement of two distinct parts of the dorsal striatum: the posterior putamen, replicating findings from Tricomi et al. (2009), and the caudate nucleus head. Several previous fMRI studies have indi- rectly implicated the dorsal striatum in habit learning, either showing that with longer instrumental training this region becomes more active ( Wunderlich, Dayan, & Dolan, 2012; Tricomi et al., 2009) or that functional con- nectivity with the (pre-)motor cortex increases (Zwosta et al., 2018; Horga et al., 2015). Although increased activity of the posterior putamen was only significant with small- volume correction (so not very robust), activity of the caudate nucleus as well as the hippocampus survived whole-brain correction. Both regions have previously been implicated in the encoding of S–R representations (McNamee, Liljeholm, Zika, & O’Doherty, 2015). More- over, we found decreasing activity of the primary motor cortex (extending to mid-posterior cingulate), the hippo- campus (extending into the putamen), bilateral temporal cortex, and the right anterior caudate, previously implica- ted in goal-directed control (Liljeholm et al., 2011; Balleine & O’Doherty, 2010). In line with previous findings, these results suggest that dissociable neural regions support instrumental learning. Notably, these findings were specific to learning to make go responses for valuable out- comes. In contrast, we did not see any changes in neural activity over the course of no-go training, despite high accuracy and reported automaticity at the end of training. Thus, our neuroimaging analyses do not provide evidence for the development of “inhibition habits” ( Jahanshahi, Obeso, Rothwell, & Obeso, 2015). Importantly, when strategic planning was introduced after the first 12 training blocks, implementation inten- tions initially improved go performance (reflected by higher accuracy during the first block-set than in the pre- ceding (pre-)training block-set), whereas goal intentions impaired it (reflected by lower accuracy and slower RTs). In contrast, no-go learning (i.e., withholding a response for nonvaluable stimuli) was not affected by planning strategy. These findings replicate our previous results (van Timmeren & de Wit, 2022). Furthermore, in line with the notion of “instant habits,” this behavioral effect of implementation intentions early in training was accompa- nied by reduced activity in the anterior caudate relative to goal intentions. This early effect of implementation inten- tions quickly disappeared, however, and no differences with goal intentions were observed on accuracy and RTs in later training blocks, nor on subjective automaticity after training. Therefore, in support of the notion of strategic automaticity, it appears that instrumental acquisition ini- tially benefitted from if–then planning, while dependency on goal-directed control (as suggested by anterior caudate activity) was reduced. So far, there have been very few neuroimaging studies that have compared the use of implementation and goal intentions to support behavioral performance. One study (Gilbert, Gollwitzer, Cohen, Oettingen, & Burgess, 2009) showed that implementation intentions engaged the medial BA10 more (and lateral BA10 less) than a control condition, which was argued to reflect increased cue mon- itoring (and reduced internal information processing). This contrasts with our finding that implementation inten- tions lead to reduced engagement of the anterior caudate during the instrumental learning phase. However, their control condition was very different to ours. Whereas they specified the cue and the outcome that it signaled to be available (i.e., the S-O contingency; “if the cue appears, then I can score 5 points”), we used a goal intention con- trol condition that specified the R-O contingency, which is arguably more akin to a typical goal intention (e.g., “I will exercise to lose weight”). In the next phase of the SORT, signaled outcome values changed, requiring flexible adaptation of responding to the discriminative stimuli. This allowed us to determine whether strategic planning (during training) would induce the rigidity that is commonly regarded as a hallmark of learned habits that are stamped in through behavioral rep- etition. However, we failed to find convincing evidence that if–then planning impaired the ability to flexibly adjust responding when signaled outcome values changed. This was despite the fact that participants struggled to adjust learned S–R mappings overall, as reflected in a strong main effect of congruency. Furthermore, in line with the behavioral findings, we also found no evidence for an impact of planning on neural activation patterns during the extinction test phase. Therefore, this first neuroimag- ing investigation of the effect of implementation inten- tions on behavioral flexibility in an outcome-revaluation paradigm failed to provide evidence for a shift from goal- directed toward habitual control. The evidence for intact behavioral flexibility despite if– then planning contrasts with results from an earlier study with this paradigm (van Timmeren & de Wit, 2022). In that study, we found that implementation intentions impaired test-phase performance overall, but this did not lead to inflexibility as would be reflected by lower accuracy on incongruent trials specifically. This general impairment was most likely because of the fact that implementation intentions, by focusing attention on the S–R mappings, van Timmeren et al. 971 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . e d u / j / o c n a r t i c e - p d l f / / / / 3 5 6 9 5 7 2 0 8 2 8 8 1 / j o c n _ a _ 0 1 9 9 0 p d . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 blocked learning about the S-O contingencies. To prevent this from happening in the present study, we altered our paradigm to promote active S-O learning at first training phase, before intentions were introduced. As a result, par- ticipants already acquired high levels of S-O knowledge when they started using strategic planning. Integrating findings from both studies, it appears that when the agent has full knowledge of the (S-O) contingencies, implemen- tation intentions do not impair flexibility. This finding is encouraging, because in most applied situations in real life, agents are perfectly aware of the three-term instru- mental contingencies. Therefore, our results are in line with the idea of implementation intentions being “flexibly tenacious” (Legrand, Bieleke, Gollwitzer, & Mignon, 2017; Gollwitzer, Parks-Stamm, Jaudas, & Sheeran, 2008): People benefit from if–then planning when the situation specified in their plan is encountered (here in terms of higher accuracy and lower RTs during training), but are goal-directed in the sense that they only act on these planned S–R mappings when the signaled outcome is cur- rently a goal. Across intentions, however, we found that action slips toward devalued outcomes were associated with increased bilateral insula (both when compared with still-valuable and upvalued responses), replicating findings from the only study to date looking at neural activity dur- ing slips of action ( Watson et al., 2018). The insula is a functionally heterogeneous region ( Uddin, Nomi, Hébert-Seropian, Ghaziri, & Boucher, 2017), but the ante- rior part has been critically implicated in error and salience processing and response inhibition (Uddin, 2015; Chang, Yarkoni, Khaw, & Sanfey, 2013). Specifically, previous work shows that failure to inhibit a learned response (on the stop-signal paradigm) is associated with bilateral insular activity (Ramautar, Slagter, Kok, & Ridderinkhof, 2006). In addition, when compared with upvalued responses, slips were associated with increased activity in the dorsal anterior cingulate cortex, the SMA, and pari- etal cortex, all part of the salience network (Seeley et al., 2007). Conversely, lower activity during slips was seen in the vMPFC, or medial OFC, when compared with responses for still-valuable outcomes. Previous outcome- devaluation studies suggest that activity in this region mediates goal-directed instrumental learning (de Wit et al., 2012; Valentin et al., 2007). A similar contrast, com- paring devalued action slips with responses toward upva- lued outcomes, showed lower activity in the lateral OFC and ACC/caudate nucleus head, regions that have also been implicated in goal-directed control (e.g., Watson et al., 2018; McNamee et al., 2015; de Wit et al., 2012). Over- all, our results suggest that habitual slips of action arise as a consequence of lapses in goal-directed control (as reflected by decreased activity in these regions) rather than by increased activation of S–R habit regions (i.e., the dorsal striatum). Finally, the informal conjunction analysis of devalued slips (Figure 4B), controlling for differences in expected value and motor response, showed that the anterior insula was commonly activated across all contrasts, implicating it as a key region mediating habitual action slips. A lack of reliable, positive markers of habits is an impor- tant issue in human habit research ( Watson, O’Callaghan, Perkes, Bradfield, & Turner, 2022; Kruglanski & Szumowska, 2020; De Houwer, 2019; De Houwer, Tanaka, Moors, & Tibboel, 2018; Watson & de Wit, 2018). In the context of the present study, it begs the question whether habit strength independently contrib- utes to stimulus-dependent, outcome-insensitive responding (i.e., slips of action). A recently published study with the SORT adds weight to this concern, as we showed there that extensive instrumental training failed to impair test performance ( Watson, Gladwin, et al., 2022). The lack of reliable evidence for overtraining effects (see also de Wit et al., 2018) could mean different things, but our current findings may offer an interesting explanation. Specifically, we observed that when the plan- ning manipulation was first introduced during training, not only did implementation intentions improve perfor- mance, but goal intentions also significantly impaired performance. This may indicate that participants’ sponta- neous strategy up to that point had not been to form goal intentions, but instead to switch as soon as they could to the more efficient strategy of focusing on the S–R map- pings. In other words, they may have spontaneously formed implementation intentions (Bieleke & Keller, 2021). Therefore, rather than improving their perfor- mance with the explicit implementation intention manip- ulation, we impaired it in the goal intention condition. Such an early shift to reliance on S–R associations (i.e., within 12 blocks of training) may explain that previous experimental studies failed to find evidence for overtrain- ing, as their short training conditions may already have been sufficiently long to induce this, and beyond that early shift additional training may not have significantly enhanced the strength of those associations. This idea accords well with results from a study by Pool and colleagues (2022) who found that, following outcome devaluation on a free-operant task, already after moderate training (12 blocks), outcome-insensitive habitual responding was seen in the majority of participants. Our findings further reinforce this interpretation by showing significant changes in neural activity over the course of this relatively short training, with activity of the anterior caudate (implicated in goal-directed learning) decreasing and of the dorsal striatum (implicated in habitual control) increasing. From our study it is unclear, however, how activity in these regions developed in the earliest stages of instrumental training, as that took place outside the scanner. Future research should determine how many behavioral repetitions it takes to permit this shift to an S–R strategy, by assessing the effect of a goal intention manipulation at different time points during training. Our hypothesis is that, at the start of training, this would not yet have a negative impact—relative to implementa- tion intentions—but that it will after a few blocks. 972 Journal of Cognitive Neuroscience Volume 35, Number 6 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . e d u / j / o c n a r t i c e - p d l f / / / / 3 5 6 9 5 7 2 0 8 2 8 8 1 / j o c n _ a _ 0 1 9 9 0 p d . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 In conclusion, we provide evidence for increased effi- ciency but preserved flexibility following strategic if–then planning. These behavioral findings were mirrored in our analyses of the underlying brain activity: Implementation intentions did not reduce the engagement of goal-directed control when goals changed, nor increase activity in habit regions. Therefore, our findings suggest that this strategic planning technique supports the implementation of a new target behavior while still allowing for flexible adjustment when goals change. Reprint requests should be sent to Tim van Timmeren, Depart- ment of Social, Health and Organizational Psychology, Utrecht University, Heidelberglaan 1, Utrecht, The Netherlands, 3508 TC, or via e-mail: t.vantimmeren@uu.nl. Data Availability Statement Data to recreate the main behavioral analyses (with analysis pipeline and output) are available at OSF: https://www.doi.org/10.17605/OSF.IO/642QU. Whole- brain t-maps (without thresholding) of the main fMRI con- trasts are available at https://neurovault.org/collections /13191/. Author Contributions Tim van Timmeren: Conceptualization; Data curation; Formal Analysis; Investigation; Methodology; Project administration; Visualization; Writing—Original draft; Writing—Review & editing. John O’Doherty: Conceptu- alization; Writing—Review & editing. Nadza Dzinalija: Investigation; Project administration; Writing—Review & editing. Sanne de Wit: Conceptualization; Funding Acqui- sition; Resources; Supervision; Writing—Original draft; Writing—Review & editing. Funding Information Tim van Timmeren, VIDI grant from Dutch Research Council (Nederlandse Organisatie voor Wetenschappelijk Onderzoek) (https://dx.doi.org/10.13039/501100001722), grant number: Van der Gaag Fund. Sanne de Wit, VIDI grant from Dutch Research Council (Nederlandse Orga- nisatie voor Wetenschappelijk Onderzoek) (https://dx .doi.org/10.13039/501100003246), grant number: 016.145.382. Diversity in Citation Practices Retrospective analysis of the citations in every article pub- lished in this journal from 2010 to 2021 reveals a persistent pattern of gender imbalance: Although the proportions of authorship teams (categorized by estimated gender iden- tification of first author/last author) publishing in the Jour- nal of Cognitive Neuroscience ( JoCN ) during this period were M(an)/M = .407, W(oman)/M = .32, M/ W = .115, and W/ W = .159, the comparable proportions for the arti- cles that these authorship teams cited were M/M = .549, W/M = .257, M/ W = .109, and W/ W = .085 (Postle and Fulvio, JoCN, 34:1, pp. 1–3). Consequently, JoCN encour- ages all authors to consider gender balance explicitly when selecting which articles to cite and gives them the oppor- tunity to report their article’s gender citation balance. The authors of this article report its proportions of citations by gender category to be as follows: M/M = .512, W/M = .256, M/ W = .070, and W/ W = .163. REFERENCES Adams, C. D., & Dickinson, A. (1981). Instrumental responding following reinforcer devaluation. Quarterly Journal of Experimental Psychology, Section B, 33, 109–121. https://doi .org/10.1080/14640748108400816 Balleine, B. W. (2019). The meaning of behavior: Discriminating reflex and volition in the brain. Neuron, 104, 47–62. https:// doi.org/10.1016/j.neuron.2019.09.024, PubMed: 31600515 Balleine, B. W., & O’Doherty, J. P. (2010). Human and rodent homologies in action control: Corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology, 35, 48–69. https://doi.org/10.1038/npp.2009.131, PubMed: 19776734 Bieleke, M., & Keller, L. (2021). Individual differences in if–then planning: Insights from the development and application of the if–then planning scale (ITPS). Personality and Individual Differences, 170, 110500. https://doi.org/10.1016/j .paid.2020.110500 Brandstätter, V., Lengfelder, A., & Gollwitzer, P. M. (2001). Implementation intentions and efficient action initiation. Journal of Personality and Social Psychology, 81, 946–960. https://doi.org/10.1037/0022-3514.81.5.946, PubMed: 11708569 Chang, L. J., Yarkoni, T., Khaw, M. W., & Sanfey, A. G. (2013). Decoding the role of the insula in human cognition: Functional parcellation and large-scale reverse inference. Cerebral Cortex, 23, 739–749. https://doi.org/10.1093/cercor /bhs065, PubMed: 22437053 De Houwer, J. (2019). On how definitions of habits can complicate habit research. Frontiers in Psychology, 10, 2642. https://doi.org/10.3389/fpsyg.2019.02642, PubMed: 31849762 De Houwer, J., Tanaka, A., Moors, A., & Tibboel, H. (2018). Kicking the habit: Why evidence for habits in humans might be overestimated. Motivation Science, 4, 50–59. https://doi .org/10.1037/mot0000065 Deichmann, R., Gottfried, J. A., Hutton, C., & Turner, R. (2003). Optimized EPI for fMRI studies of the orbitofrontal cortex. Neuroimage, 19, 430–441. https://doi.org/10.1016/S1053 -8119(03)00073-9, PubMed: 12814592 Delorme, C., Salvador, A., Valabrègue, R., Roze, E., Palminteri, S., Vidailhet, M., et al. (2016). Enhanced habit formation in Gilles de la Tourette syndrome. Brain, 139, 605–615. https:// doi.org/10.1093/brain/awv307, PubMed: 26490329 de Wit, S., Corlett, P. R., Aitken, M. R., Dickinson, A., & Fletcher, P. C. (2009). Differential engagement of the ventromedial prefrontal cortex by goal-directed and habitual behavior toward food pictures in humans. Journal of Neuroscience, 29, 11330–11338. https://doi.org/10.1523/JNEUROSCI.1639 -09.2009, PubMed: 19741139 de Wit, S., Kindt, M., Knot, S. L., Verhoeven, A. A. C., Robbins, T. W., Gasull-Camos, J., et al. (2018). Shifting the balance between goals and habits: Five failures in experimental habit induction. Journal of Experimental Psychology: General, 147, 1043–1065. https://doi.org/10.1037/xge0000402, PubMed: 29975092 de Wit, S., Niry, D., Wariyar, R., Aitken, M. R. F., & Dickinson, A. (2007). Stimulus-outcome interactions during instrumental van Timmeren et al. 973 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . e d u / j / o c n a r t i c e - p d l f / / / / 3 5 6 9 5 7 2 0 8 2 8 8 1 / j o c n _ a _ 0 1 9 9 0 p d . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 discrimination learning by rats and humans. Journal of Experimental Psychology: Animal Behavior Processes, 33, 1–11. https://doi.org/10.1037/0097-7403.33.1.1, PubMed: 17227190 de Wit, S., Watson, P., Harsay, H. A., Cohen, M. X., ven de Vijver, I., & Ridderinkhof, K. R. (2012). Corticostriatal connectivity underlies individual differences in the balance between habitual and goal-directed action control. Journal of Neuroscience, 32, 12066–12075. https://doi.org/10.1523 /JNEUROSCI.1088-12.2012, PubMed: 22933790 Dickinson, A. (1985). Actions and habits: The development of behavioural autonomy. Philosophical Transactions of the Royal Society of London, Series B: Biological Sciences, 308, 67–78. https://doi.org/10.1098/rstb.1985.0010 Esteban, O., Birman, D., Schaer, M., Koyejo, O. O., Poldrack, R. A., & Gorgolewski, K. J. (2017). MRIQC: Advancing the automatic prediction of image quality in MRI from unseen sites. PLoS One, 12, e0184661. https://doi.org/10.1371/journal .pone.0184661, PubMed: 28945803 Esteban, O., Markiewicz, C. J., Blair, R. W., Moodie, C. A., Isik, A. I., Erramuzpe, A., et al. (2019). fMRIPrep: A robust preprocessing pipeline for functional MRI. Nature Methods, 16, 111–116. https://doi.org/10.1038/s41592-018-0235-4, PubMed: 30532080 Gardner, B., Abraham, C., Lally, P., & de Bruijn, G.-J. (2012). Towards parsimony in habit measurement: Testing the convergent and predictive validity of an automaticity subscale of the self-report habit index. International Journal of Behavioral Nutrition and Physical Activity, 9, 102. https:// doi.org/10.1186/1479-5868-9-102, PubMed: 22935297 Gilbert, S. J., Gollwitzer, P. M., Cohen, A. L., Oettingen, G., & Burgess, P. W. (2009). Separable brain systems supporting cued versus self-initiated realization of delayed intentions. Journal of Experimental Psychology: Learning, Memory, and Cognition, 35, 905–915. https://doi.org/10.1037 /a0015535, PubMed: 19586260 Gollwitzer, P. M. (1993). Goal achievement: The role of intentions. European Review of Social Psychology, 4, 141–185. https://doi.org/10.1080/14792779343000059 Gollwitzer, P. M. (1999). Implementation intentions: Strong effects of simple plans. American Psychologist, 54, 493–503. https://doi.org/10.1037/0003-066X.54.7.493 Gollwitzer, P. M. (2014). Weakness of the will: Is a quick fix possible? Motivation and Emotion, 38, 305–322. https://doi .org/10.1007/s11031-014-9416-3 Gollwitzer, P. M., Parks-Stamm, E. J., Jaudas, A., & Sheeran, P. (2008). Flexible tenacity in goal pursuit. In J. Y. Shah & W. L. Gardner (Eds.), Handbook of motivation science (pp. 325–341). Guilford Press. Gollwitzer, P. M., & Sheeran, P. (2006). Implementation intentions and goal achievement: A meta-analysis of effects and processes. Advances in Experimental Social Psychology, 38, 69–119. https://doi.org/10.1016/S0065-2601(06)38002-1 Gorgolewski, K., Burns, C. D., Madison, C., Clark, D., Halchenko, Y. O., Waskom, M. L., et al. (2011). Nipype: A flexible, lightweight and extensible neuroimaging data processing framework in Python. Frontiers in Neuroinformatics, 5, 13. https://doi.org/10.3389/fninf.2011.00013, PubMed: 21897815 Hardwick, R. M., Forrence, A. D., Krakauer, J. W., & Haith, A. M. (2019). Time-dependent competition between goal-directed and habitual response preparation. Nature Human Behaviour, 3, 1252–1262. https://doi.org/10.1038/s41562-019-0725-0, PubMed: 31570762 Horga, G., Maia, T. V., Marsh, R., Hao, X., Xu, D., Duan, Y., et al. (2015). Changes in corticostriatal connectivity during reinforcement learning in humans. Human Brain Mapping, 36, 793–803. https://doi.org/10.1002/hbm.22665, PubMed: 25393839 Jahanshahi, M., Obeso, I., Rothwell, J. C., & Obeso, J. A. (2015). A fronto-striato-subthalamic-pallidal network for goal-directed and habitual inhibition. Nature Reviews Neuroscience, 16, 719–732. https://doi.org/10.1038/nrn4038, PubMed: 26530468 JASP Team. (2018). JASP (version 0.8.6) [Computer Software]. Retrieved from https://jasp-stats.org/2018/02/28/now-jasp-0-8-6/. Kruglanski, A. W., & Szumowska, E. (2020). Habitual behavior is goal-driven. Perspectives on Psychological Science, 15, 1256–1271. https://doi.org/10.1177/1745691620917676, PubMed: 32569529 Legrand, E., Bieleke, M., Gollwitzer, P. M., & Mignon, A. (2017). Nothing will stop me? Flexibly tenacious goal striving with implementation intentions. Motivation Science, 3, 101–118. https://doi.org/10.1037/mot0000050 Liljeholm, M., Dunne, S., & O’Doherty, J. P. (2015). Differentiating neural systems mediating the acquisition vs. expression of goal-directed and habitual behavioral control. European Journal of Neuroscience, 41, 1358–1371. https:// doi.org/10.1111/ejn.12897, PubMed: 25892332 Liljeholm, M., Tricomi, E., O’Doherty, J. P., & Balleine, B. W. (2011). Neural correlates of instrumental contingency learning: Differential effects of action-reward conjunction and disjunction. Journal of Neuroscience, 31, 2474–2480. https://doi.org/10.1523/JNEUROSCI.3354-10.2011, PubMed: 21325514 McNamee, D., Liljeholm, M., Zika, O., & O’Doherty, J. P. (2015). Characterizing the associative content of brain structures involved in habitual and goal-directed actions in humans: A multivariate fMRI study. Journal of Neuroscience, 35, 3764–3771. https://doi.org/10.1523/JNEUROSCI.4677-14 .2015, PubMed: 25740507 Morris, R. W., Quail, S. L., Griffiths, K. R., Green, M. J., & Balleine, B. W. (2015). Corticostriatal control of goal-directed action is impaired in schizophrenia. Biological Psychiatry, 77, 187–195. https://doi.org/10.1016/j.biopsych.2014.06.005, PubMed: 25062683 Orbell, S., & Verplanken, B. (2010). The automatic component of habit in health behavior: Habit as cue-contingent automaticity. Health Psychology, 29, 374–383. https://doi.org /10.1037/a0019596, PubMed: 20658824 Parks-Stamm, E. J., Gollwitzer, P. M., & Oettingen, G. (2007). Action control by implementation intentions: Effective cue detection and efficient response initiation. Social Cognition, 25, 248–266. https://doi.org/10.1521/soco.2007 .25.2.248 Pool, E. R., Gera, R., Fransen, A., Perez, O. D., Cremer, A., Aleksic, M., et al. (2022). Determining the effects of training duration on the behavioral expression of habitual control in humans: A multilaboratory investigation. Learning & Memory, 29, 16–28. https://doi.org/10.1101/lm.053413.121, PubMed: 34911800 Ramautar, J. R., Slagter, H. A., Kok, A., & Ridderinkhof, K. R. (2006). Probability effects in the stop-signal paradigm: The insula and the significance of failed inhibition. Brain Research, 1105, 143–154. https://doi.org/10.1016/j.brainres .2006.02.091, PubMed: 16616048 Seeley, W. W., Menon, V., Schatzberg, A. F., Keller, J., Glover, G. H., Kenna, H., et al. (2007). Dissociable intrinsic connectivity networks for salience processing and executive control. Journal of Neuroscience, 27, 2349–2356. https://doi.org/10 .1523/JNEUROSCI.5587-06.2007, PubMed: 17329432 Sheeran, P., & Webb, T. L. (2016). The intention–behavior gap. Social and Personality Psychology Compass, 10, 503–518. https://doi.org/10.1111/spc3.12265 Tanaka, S. C., Balleine, B. W., & O’Doherty, J. P. (2008). Calculating consequences: Brain systems that encode the causal effects of actions. Journal of Neuroscience, 28, 974 Journal of Cognitive Neuroscience Volume 35, Number 6 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . e d u / j / o c n a r t i c e - p d l f / / / / 3 5 6 9 5 7 2 0 8 2 8 8 1 / j o c n _ a _ 0 1 9 9 0 p d . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 6750–6755. https://doi.org/10.1523/JNEUROSCI.1808-08 .2008, PubMed: 18579749 Thorndike, E. L. (1911). Animal intelligence: Experimental studies. The Macmillan Company. https://doi.org/10.5962/bhl .title.55072 Tricomi, E., Balleine, B. W., & O’Doherty, J. P. (2009). A specific role for posterior dorsolateral striatum in human habit learning. European Journal of Neuroscience, 29, 2225–2232. https://doi.org/10.1111/j.1460-9568.2009.06796.x, PubMed: 19490086 Tzourio-Mazoyer, N., Landeau, B., Papathanassiou, D., Crivello, F., Etard, O., Delcroix, N., et al. (2002). Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain. Neuroimage, 15, 273–289. https://doi.org/10.1006/nimg.2001 .0978, PubMed: 11771995 Uddin, L. Q. (2015). Salience processing and insular cortical function and dysfunction. Nature Reviews Neuroscience, 16, 55–61. https://doi.org/10.1038/nrn3857, PubMed: 25406711 Uddin, L. Q., Nomi, J. S., Hébert-Seropian, B., Ghaziri, J., & Boucher, O. (2017). Structure and function of the human insula. Journal of Clinical Neurophysiology, 34, 300–306. https://doi.org/10.1097/ WNP.0000000000000377, PubMed: 28644199 Valentin, V. V., Dickinson, A., & O’Doherty, J. P. (2007). Determining the neural substrates of goal-directed learning in the human brain. Journal of Neuroscience, 27, 4019–4026. https://doi.org/10.1523/JNEUROSCI.0564-07.2007, PubMed: 17428979 van Timmeren, T., & de Wit, S. (2022). Instant habits versus flexible tenacity: Do implementation intentions accelerate habit formation? Quarterly Journal of Experimental Psychology. https://doi.org/10.1177/17470218221147024, PubMed: 36476147 Watson, P., & de Wit, S. (2018). Current limits of experimental research into habits and future directions. Current Opinion in Behavioral Sciences, 20, 33–39. https://doi.org/10.1016/j .cobeha.2017.09.012 Watson, P., Gladwin, T. E., Verhoeven, A. A. C., & de Wit, S. (2022). Investigating habits in humans with a symmetrical outcome-revaluation task. Behavior Research Methods. https://doi.org/10.3758/s13428-022-01922-4, PubMed: 35867208 Watson, P., O’Callaghan, C., Perkes, I., Bradfield, L., & Turner, K. (2022). Making habits measurable beyond what they are not: A focus on associative dual-process models. Neuroscience & Biobehavioral Reviews, 142, 104869. https://doi.org/10.1016/j .neubiorev.2022.104869, PubMed: 36108980 Watson, P., van Wingen, G., & de Wit, S. (2018). Conflicted between goal-directed and habitual control, an fMRI investigation. eNeuro, 5, ENEURO.0240-18.2018. https://doi .org/10.1523/ENEURO.0240-18.2018, PubMed: 30310863 Wetzels, R., Matzke, D., Lee, M. D., Rouder, J. N., Iverson, G. J., & Wagenmakers, E.-J. (2011). Statistical evidence in experimental psychology: An empirical comparison using 855 t tests. Perspectives on Psychological Science, 6, 291–298. https://doi.org/10.1177/1745691611406923, PubMed: 26168519 Wunderlich, K., Dayan, P., & Dolan, R. J. (2012). Mapping value based planning and extensively trained choice in the human brain. Nature Neuroscience, 15, 786–791. https://doi.org/10 .1038/nn.3068, PubMed: 22406551 Yin, H. H., Knowlton, B. J., & Balleine, B. W. (2004). Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning. European Journal of Neuroscience, 19, 181–189. https://doi.org/10 .1111/j.1460-9568.2004.03095.x, PubMed: 14750976 Zwosta, K., Ruge, H., Goschke, T., & Wolfensteller, U. (2018). Habit strength is predicted by activity dynamics in goal-directed brain systems during training. Neuroimage, 165, 125–137. https://doi.org/10.1016/j.neuroimage.2017.09 .062, PubMed: 28970144 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . e d u / j / o c n a r t i c e - p d l f / / / / 3 5 6 9 5 7 2 0 8 2 8 8 1 / j o c n _ a _ 0 1 9 9 0 p d . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 van Timmeren et al. 975Can the Brain Strategically Go on Automatic Pilot? 图像
Can the Brain Strategically Go on Automatic Pilot? 图像
Can the Brain Strategically Go on Automatic Pilot? 图像
Can the Brain Strategically Go on Automatic Pilot? 图像
Can the Brain Strategically Go on Automatic Pilot? 图像

下载pdf