REPORT
The Development of Relational Reasoning:
An Eyetracking Analysis of Strategy Use
and Adaptation in Children and Adults
Performing Matrix Completion
Jesse Niebaum1 and Yuko Munakata2
un accès ouvert
journal
1Center for Mind and Brain, Université de Californie, Davis, Davis, Californie
2Département de psychologie, Université de Californie, Davis, Davis, Californie
Mots clés: relational reasoning, matrix completion, eyetracking, cognitive development
ABSTRAIT
Relational reasoning is a key component of fluid intelligence and an important predictor of
academic achievement. Relational reasoning is commonly assessed using matrix completion
tasks, in which participants see an incomplete matrix of items that vary on different dimensions
and select a response that best completes the matrix based on the relations among items.
Performance on such assessments increases dramatically across childhood into adulthood.
Cependant, despite widespread use, little is known about the strategies associated with good or
poor matrix completion performance in childhood. This study examined the strategies children
and adults use to solve matrix completion problems, how those strategies change with age,
and whether children and adults adapt strategies to difficulty. We used eyetracking to infer
matrix completion strategy use in 6- and 9-year-old children and adults. Across ages, scanning
across matrix rows and columns predicted good overall performance, and quicker and higher
rates of consulting potential answers predicted poor performance, indicating that optimal
matrix completion strategies are similar across development. Indices of good strategy use
increased across childhood. As problems increased in difficulty, children and adults increased
their scanning of matrix rows and columns, and adults and 9-year-olds also shifted strategies to
rely more on consulting potential answers. Adapting strategies to matrix difficulty, particularly
increased scanning of rows and columns, was associated with good overall performance in
both children and adults. These findings underscore the importance of both spontaneous and
adaptive strategy use in individual differences in relational reasoning and its development.
INTRODUCTION
Children’s ability to discover and utilize patterns between different objects and mental repre-
sentations, a key component of fluid intelligence known as relational or inductive reasoning,
improves dramatically across development (Crone et al., 2009; Ferrer et al., 2009; Handley
et coll., 2004; Richland et al., 2006; Siegler & Svetina, 2002) and is strongly associated with
academic success and other positive life outcomes (Green et al., 2017; Peng et al., 2019; Primi
et coll., 2010). Relational reasoning is commonly assessed with matrix completion tasks, dans
which a 3 × 3 matrix or other dimensional variant is presented with the bottom right entry
missing (Chiffre 1). Items within the matrix vary on different dimensions, such as increasing
Citation: Niebaum, J., & Munakata, Oui.
(2023). The Development of Relational
Reasoning: An Eyetracking Analysis
of Strategy Use and Adaptation in
Children and Adults Performing Matrix
Completion. Open Mind: Discoveries
in Cognitive Science, 7, 197–220.
https://doi.org/10.1162/opmi_a_00078
EST CE QUE JE:
https://doi.org/10.1162/opmi_a_00078
Supplemental Materials:
https://doi.org/10.1162/opmi_a_00078
Reçu: 24 Octobre 2022
Accepté: 24 Avril 2023
Intérêts concurrents: The authors
declare no conflict of interests.
Auteur correspondant:
Jesse Niebaum
jcniebaum@ucdavis.edu
droits d'auteur: © 2023
Massachusetts Institute of Technology
Publié sous Creative Commons
Attribution 4.0 International
(CC PAR 4.0) Licence
La presse du MIT
je
D
o
w
n
o
un
d
e
d
F
r
o
m
h
t
t
p
:
/
/
d
je
r
e
c
t
.
m
je
t
.
/
e
d
toi
o
p
m
je
/
je
un
r
t
je
c
e
–
p
d
F
/
d
o
je
/
je
/
/
.
1
0
1
1
6
2
o
p
m
_
un
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
un
_
0
0
0
7
8
p
d
/
.
je
F
b
oui
g
toi
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Matrix Completion Strategies Across Development Niebaum and Munakata
Chiffre 1. An example matrix problem superimposed with hypothetical fixation sequences to
demonstrate different strategic indices. The correct response is the top right option from the answer
array. Yellow: Examples of scanning across rows and columns (encoding and integration, lequel
supports constructive matching). Blue: Examples of consultations to the solution array (toggling,
which supports response elimination). The correct response is marked with a star, and responses
that could be eliminated with a novel feature (different shape) are marked with a diamond.
size or differing colors. Participants are instructed to select an item from an array of potential
solutions that best fulfills the relations within the matrix. Given the widespread use of matrix
completion tasks and their strong associations with other indices of intelligence, prior research
has sought to ascertain the strategies individuals use while performing matrix completion
tasks.
Eyetracking and self-report have been used to infer matrix completion strategies in adults
(Carpenter et al., 1990; Gonthier & Roulin, 2020; Hayes et al., 2011; Kucharský et al., 2020;
Rivollier et al., 2021; Vigneau et al., 2006). Two general strategies have been characterized
(Bethell-Fox et al., 1984; Snow, 1980): constructive matching, in which a participant generates
a predicted solution based on the relations encoded from the matrix and then searches the
solution array for an item matching that prediction, and response elimination, in which each
potential solution is evaluated in turn for its fit in the matrix. Constructive matching is charac-
terized by examining the rows and columns of a matrix to encode and integrate relations
before examining any potential solutions, whereas response elimination is characterized by
toggling between each potential solution and the matrix to decide whether a potential answer
is the correct missing item. Individuals systematically differ in their implementation of these
two strategies, and strategy use is a key determinant of matrix completion performance. Adults
who implement constructive matching perform better, whereas those who implement response
elimination perform poorly (Bethell-Fox et al., 1984; Gonthier & Roulin, 2020; Hayes et al.,
2011; Vigneau et al., 2006).
In matrix completion tasks, problems typically increase in difficulty as the task progresses,
often because the number of features and relations that must be encoded and integrated
increases, in addition to perceptual and relational complexity (Carpenter et al., 1990; Primi,
OPEN MIND: Discoveries in Cognitive Science
198
je
D
o
w
n
o
un
d
e
d
F
r
o
m
h
t
t
p
:
/
/
d
je
r
e
c
t
.
m
je
t
.
/
e
d
toi
o
p
m
je
/
je
un
r
t
je
c
e
–
p
d
F
/
d
o
je
/
je
/
.
/
1
0
1
1
6
2
o
p
m
_
un
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
un
_
0
0
0
7
8
p
d
.
/
je
F
b
oui
g
toi
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Matrix Completion Strategies Across Development Niebaum and Munakata
2001; Vodegel Matzen et al., 1994). This variation in difficulty within the task suggests that
strategy implementation may shift within individuals across the task (Bethell-Fox et al.,
1984; Gonthier & Roulin, 2020; Jarosz et al., 2019; Perret & Dauvier, 2018). Because
constructive matching is comparatively more demanding on working memory than response
elimination (Snow, 1980), implementing constructive matching also becomes more demand-
ing as the task progresses. Adults with greater working memory capacity were more likely to
implement constructive matching (Gonthier & Thomassin, 2015), and analyses of strategy use
across the task found that adults were more likely to report shifting from constructive matching
to using less-demanding response elimination more often as problems became more difficult
(Gonthier & Roulin, 2020). Adults have also reported combining elements of both strategies,
an “isolate-and-eliminate” strategy, by encoding one relation and then eliminating potential
responses before returning to the matrix to encode more relations ( Jarosz et al., 2019;
Arendasy & Sommer, 2013). Par exemple, adults may turn to response elimination or combine
strategies on more difficult problems after failing to successfully generate a potential solution with
constructive matching. Ainsi, examining strategy adaptation is also crucial for understanding
performance on relational reasoning tasks and fluid intelligence more generally.
The first examination of children’s matrix completion strategies using eyetracking suggested
interesting commonalities with and divergences from adults. Like in adults, indices reflecting
constructive matching were associated with better performance: High-performing 5–6- and 7–
8-year-olds had more trials on which they scanned across a matrix row or column (Chen et al.,
2016). Older children also performed better and scanned rows and columns more than youn-
ger children. Cependant, high-performing 5–6-year-olds toggled their fixations between the
matrix and potential solutions more than low performers, indicative of response elimination,
and had similar numbers of toggles as the older children overall. In contrast, the number of
toggles did not distinguish high- from low-performing 7–8-year-olds. These results suggest that
with development, children may shift from relying on and benefitting from response elimina-
tion strategies to increasingly using constructive matching.
Although eyetracking studies have thus provided insights into developments in relational
reasoning, many questions remain. Par exemple, do younger children truly benefit from
response elimination strategies in a way that distinguishes them from older children and
adultes? Answering this question is important for evaluating whether response elimination is
uniquely adaptive in younger children, potentially due to limitations in cognitive processes
such as working memory and attentional control (Dauvier et al., 2014; Handley et al.,
2004; Kane & Engle, 2002). Cependant, whether younger children benefit from response elim-
ination is unclear because the number of toggles is biased by response time in children and
adultes, with longer response times predicting more toggles (Chen et al., 2016; Vigneau et al.,
2006). Autrement dit, the rate of toggling could decrease due to longer response times, alors que
the number of toggles still increases. Par conséquent, the apparent benefits from toggling for youn-
ger children may in fact reflect benefits from more time on task, benefits from implementing a
systematic strategy like response elimination over an erratic strategy, or better task comprehen-
sion. We address this issue in the current work by calculating a toggle rate, which adjusts for
the bias in response time and is a better predictor of matrix completion performance in adults
than the number of toggles (Hayes et al., 2011; Laurence et al., 2018; Vigneau et al., 2006), dans
addition to analyzing the number of toggles to replicate prior analyses (Chen et al., 2016).
Plus loin, low-performing 5–6-year-olds, who had low numbers of toggles, also performed at
chance levels, suggesting that the greater number of toggles observed in 7–8-year-olds and
high-performing 6-year-olds could reflect better task comprehension rather than more adaptive
strategy selection. We address this issue in the current work by replicating analyses with and
OPEN MIND: Discoveries in Cognitive Science
199
je
D
o
w
n
o
un
d
e
d
F
r
o
m
h
t
t
p
:
/
/
d
je
r
e
c
t
.
m
je
t
.
/
e
d
toi
o
p
m
je
/
je
un
r
t
je
c
e
–
p
d
F
/
d
o
je
/
je
.
/
/
1
0
1
1
6
2
o
p
m
_
un
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
un
_
0
0
0
7
8
p
d
.
/
je
F
b
oui
g
toi
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Matrix Completion Strategies Across Development Niebaum and Munakata
without participants excluded based on performance relative to chance levels and by analyz-
ing the types of errors that children make.
Both strategy use and adaptation are crucial for understanding cognition across development. Dans
matrix completion tasks, young children commonly respond with duplicates of items in the matrix
problem, reflecting a bias towards perceptual similarity rather than relational encoding (Siegler &
Svetina, 2002). With age, children shift to extracting relational features across items, which leads to
improvements on matrix completion tasks and drives the overall development of relational reason-
ing (Gentner, 1988; Stevenson & Hickendorff, 2018). Young children, cependant, are capable of
relational reasoning: With extensive training and instruction, 4-year-old children can transition
from responding with duplicate items to responses that exhibit relational features, both on matrix
completion problems and other analogical reasoning tasks (Chen et al., 2016). Spontaneous
strategy implementation is directly linked with task performance and overall fluid intelligence
children and adults (Hayes et al., 2015; Nusbaum & Silvia, 2011; Steiner & Carr, 2003), and plays
a key role in learning across domains in childhood, including memory (Bjorklund et al., 1997;
Imbo & Vandierendonck, 2007), mathematics (Carr & Jessup, 1997; Jordan & Montani, 1997),
and reading (Guthrie et al., 2000; Paris & Oka, 1986). Children have also shown adaptability in
strategy use with increased knowledge and instruction (Chen et al., 2016; Siegler & Jenkins, 2014;
Stevenson & Hickendorff, 2018) and in response to difficulty across many cognitive domains
(Siegler, 1987). Fluid intelligence in children has positively correlated with benefits and perfor-
mance gains due to overt strategy interventions (Borkowski et al., 1987; Nusbaum & Silvia, 2011).
Whether children adapt matrix completion strategies to difficulty like adults and whether
strategy adaptation influences performance is less known. Prior work inferring children’s strat-
egies for matrix completion did not vary difficulty within the task, precluding analyses of strat-
egy adaption. Children aged 6 à 12 years have been shown to respond more slowly on more
difficult matrix completion problems, and this adaptive matrix study time predicted overall
performance across childhood (Perret & Dauvier, 2018). Cependant, allocating more time to
more difficult problems could arise from factors other than adaptations in strategy, tel que
better task understanding (par exemple., understanding that more difficult problems are unlikely to be
solved quickly) or higher motivation to perform well. Par exemple, adults who performed
poorly began to respond more quickly on trials beyond a certain level of difficulty, suggérant
that these adults gave up on solving more difficult problems (Gonthier & Roulin, 2020).
Because longer response times could also reflect greater motivation, more targeted analyses
are needed to test the role of strategy adaptations in relational reasoning.
The current study used eyetracking to assess developmental changes in the role of strategy
use and adaption in matrix completion performance across development (Eckstein et al., 2017).
To infer strategy use, we first examined the role of eyetracking indices of constructive matching
and response elimination strategies in explaining performance in children and adults. Nous
assessed toggle rate to adjust for differences in response time, allowing us to determine whether
response elimination is more beneficial for younger children than older children and adults. Dans
addition, we assessed the specificity of strategic indices for predicting accuracy at the trial level
in addition to general task performance. To assess adaptations in strategy use, we varied matrix
difficulty within the task. This novel procedure is a strength of our design because task experi-
ence is decoupled from matrix difficulty; thus, potential strategy learning across the task is not
aligned with problem difficulty, and the potential for decreased motivation with task progres-
sion due to anticipating increasingly difficult problems is attenuated. We examined whether
children and adults shifted strategies on more difficult matrix problems and assessed the role
of strategy adaptation in relational reasoning across development by testing whether strategy
adaption predicted matrix completion performance across children and adults.
OPEN MIND: Discoveries in Cognitive Science
200
je
D
o
w
n
o
un
d
e
d
F
r
o
m
h
t
t
p
:
/
/
d
je
r
e
c
t
.
m
je
t
.
/
e
d
toi
o
p
m
je
/
je
un
r
t
je
c
e
–
p
d
F
/
d
o
je
/
je
/
/
.
1
0
1
1
6
2
o
p
m
_
un
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
un
_
0
0
0
7
8
p
d
.
/
je
F
b
oui
g
toi
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Matrix Completion Strategies Across Development Niebaum and Munakata
MÉTHODES
Participants
We assessed matrix completion performance in 6-year-olds (n = 38; M = 6.35 années (SD = 0.28),
range: 6.02–6.96, 23 female), 9-year-olds (n = 43; M = 9.74 années (SD = 0.25), range: 8.93–10.07
(2 exact age unknown), 25 female), and college-aged adults (n = 51; M = 19.68 années (SD = 2.05),
range: 17.90–30.72 (1 exact age unknown), 30 female). Eight additional 6-year-olds were
recruited but not included in the final sample: three quit during the matrix completion task, four
quit the study before the matrix completion task, and one had no valid eyetracking data. These
age groups were selected based on prior research showing dramatic improvements, high vari-
ability, and likely strategy changes in matrix completion performance at 6 years of age and from
6 à 9 years of age (Chen et al., 2016; Dauvier et al., 2014; Siegler & Svetina, 2002; Tunteler et al.,
2008; Tunteler & Reising, 2007); thus, we aimed to capture specific periods of performance
improvements across development. We recruited approximately 40 participants per group, lequel
is consistent with prior work in adults analyzing individual differences and exceeding analytic
group sizes in prior work in children (Chen et al., 2016; Hayes et al., 2011). Adults were recruited
to bridge indices of strategy use in exclusively child or adult samples and to examine whether
patterns of strategy adaptation were generally similar in children and adults.
Children were recruited from a database maintained at the University of Colorado Boulder.
Informed consent was obtained from a legal parent/guardian, and child verbal or written assent
was also obtained. Children received nominal monetary compensation for travel costs and a
moderate prize for participating. Adults were recruited from the Department of Psychology
and Neuroscience subject pool at the University of Colorado Boulder and received partial course
credit. Informed consent was obtained prior to participation. Most participants were Caucasian
and from middle to high socioeconomic backgrounds. Participants completed matrix comple-
tion within a battery of cognitive assessments, and all procedures were approved by the local
Comité d'examen institutionnel (Protocol 16-0543).
Matrix Completion
Adults completed 24 digitized matrix completion problems selected from the 36-problem
Raven’s Advanced Progressive Matrices (RAPM) assessment. These stimuli were obtained from
the RAPM set used in Hayes et al. (2011) (Bors & Stokes, 1998; Raven, 2000). Children com-
pleted 24 Raven’s-like problems derived from matrix generation software (Matzen et al.,
2010). The selected matrix problems included one, deux, or three relations, except for the final
problem, which included a logical relation (Supplementary Materials). These matrices were
designed to instantiate the row- and column-wise processing strategies that have predicted
adult performance in other matrix completion tasks and to systematically vary difficulty across
matrix problems. Typical relations within the matrix included increasing/decreasing size, tous
different/same shapes, increasing/decreasing number of items, etc.. All matrices used for child
participants except the final problem had been normed in prior work with 100% accuracy in
adultes (Matzen et al., 2010). We intentionally selected several items with only one relation and
with extremely high accuracy in adults due to prior work showing poor performance in young
children on 3 × 3 matrix problems (par exemple., Chen et al., 2016) and to systematically vary the num-
ber of relations across different matrix completion problems. Performance on a separate, plus
difficult subset of these generated matrices correlated highly with Raven’s Standard Progressive
Matrices in adults (Matzen et al., 2010). Child groups completed the same matrix task for
validity in assessing differences in strategy implementation across childhood. Adults com-
pleted different matrices than children to bridge comparisons between strategic indices used
OPEN MIND: Discoveries in Cognitive Science
201
je
D
o
w
n
o
un
d
e
d
F
r
o
m
h
t
t
p
:
/
/
d
je
r
e
c
t
.
m
je
t
.
/
e
d
toi
o
p
m
je
/
je
un
r
t
je
c
e
–
p
d
F
/
d
o
je
/
je
.
/
/
1
0
1
1
6
2
o
p
m
_
un
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
un
_
0
0
0
7
8
p
d
.
/
je
F
b
oui
g
toi
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Matrix Completion Strategies Across Development Niebaum and Munakata
only in children in prior work but with a set of matrices commonly used in the eyetracking
literature in adults, potentially confirming similar patterns of strategy use and performance. Nous
do not make direct comparisons between the child and adult groups; instead, adults were used
to confirm benefits of similar qualitative strategy use, as assessed by indices previously used in
enfants, and to investigate strategy adaptation to difficulty in both child and adult popula-
tion. This procedure also ensured sufficient variation in performance for assessing strategy
adaptations to difficulty in adults.
All participants completed two practice items: one in which shapes were consistent within
columns but differed across rows, and one in which shape and color were consistent within
rows but differed across columns. Instructions and corrective feedback were given by the
experimenter, followed by a repeatable practice trial without instructions. The final practice
trial was repeated if participants selected the incorrect answer or needed additional practice
with spacebar presses or mouse navigation. Trials were initiated by successfully fixating on a
centralized cross for 500 ms or by an experimenter via keypress upon failing to detect fixation.
All participants were instructed to press the spacebar when they knew the correct answer.
Alors, the matrix disappeared, and only the solution array remained, mirroring prior testing
procedures in adults (Hayes et al., 2011, 2015). A cursor appeared in the center of the screen
for participants to click the correct answer. No feedback was provided after the initial instruc-
tion. Performance was assessed as the percentage of correct trials from the trials remaining
after data preprocessing. To increase variance in matrix completion performance, an addi-
tional index of performance, a matrix relation score, was created by inferring the number of
correct relations participants encoded from their responses. Par exemple, a participant could
select a response that contains 2 of the 3 necessary relations for the correct response; such a
response was given a higher score than a response containing 0 of the necessary relations. Ce
procedure has been used previously to increase the range of performance, thereby increasing
statistical power (Hayes et al., 2015). Details and analyses with the matrix relation score are
included in Supplementary Materials.
Matrices were presented in sets of eight with increasing anticipated matrix difficulty, en utilisant
either performance in prior samples for adults or the number of relations as a proxy for difficulty
in children (Carpenter et al., 1990). Ainsi, participants completed three sets of increasingly dif-
ficult matrices over the 24 matrix problems. The number of relations significantly correlated
with matrix accuracy in 9-year-olds (r = .58, p < .003) and marginally correlated in 6-year-olds
(r = .39, p < .068), indicating successful variation in matrix difficulty. For children, each set of
eight problems contained three matrices with one relation, three matrices with two relations,
and then two matrices with three relations, except for the final problem.
Participants were seated approximately 60 cm from the computer screen and underwent a
5-point calibration procedure prior to the session. Recalibration was performed as needed.
E-Prime 1.2 was used for task presentation (Psychology Software Tools Inc., Pittsburgh,
USA). Eyetracking data were captured with a Tobii X50 Eyetracker with 50 Hz sampling rate
using Clearview software (Tobii Technologies, Stockholm, Sweden). AOIs were drawn around
each item in the matrix (1–9) and the entire solution array (10). Response time was considered
total detected fixation time on the defined AOIs.
Data Preprocessing
Eyetracking data were pre-processed using the ‘gazepath’ package in R (van Renswoude et al.,
2018). This software parses raw eyetracking data into fixations and saccades using an adaptive
classification algorithm to calculate velocity thresholds within participants. This procedure is
OPEN MIND: Discoveries in Cognitive Science
202
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
/
.
/
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
/
.
i
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Matrix Completion Strategies Across Development Niebaum and Munakata
designed to correct for individual differences in data quality. Thus, this processing method is
well suited for analyzing developmental samples, in which data quality could systematically
differ between age groups. Fixations were set to a minimum duration of 100 ms, and saccades
were removed prior to analyses. Full descriptions of eyetracking preprocessing and details on
missing fixation data are included in Supplementary Materials. In total, 4 trials from adults,
8 trials from 9-year-olds, and 43 trials from 6-year-olds were excluded due to poor data quality.
Most excluded trials in 6-year-olds were clustered within 5 participants, and all significant
correlations between strategy use and overall performance remained significant when exclud-
ing only these participants.
Only fixations detected while the matrix completion problem was presented were
analyzed—fixations while navigating the mouse to the solution array, i.e., after spacebar press,
were not assessed, as in prior work (Hayes et al., 2011, 2015). Detected fixations were plotted
on a generic matrix to correct for potential drift in calibration across trials. Trial-level correc-
tions to fixation data were made blind to participant performance, matrix difficulty, fixation
duration, and fixation sequence.
We also calculated the percentage of detected fixation time on a trial by dividing the
summed fixation time on AOIs by the full trial time. Thus, this metric includes saccades,
missing data, and fixation outside of the matrix problem as non-valid data. Expectedly, adults
had a lower percentage of missing fixation data (M = 24%) than 9-year-olds (M = 31%), who in
turn had a lower percentage of missing fixation data than 6-year-olds (M = 43%, all adjusted
p’s < .002). This metric was included as a covariate to determine whether age differences in
strategic indices were driven by systematic differences in available fixation data.
Strategic Indices from Eyetracking
We computed several different strategic indices derived to specifically capture constructive
matching and response elimination strategies because any given index of strategy use derived
from eyetracking often has poor to adequate reliability ( Vigneau et al., 2006). By including
several indices, we are able to make stronger overall inferences about strategy use, strategy
adaptation, and relationships with performance. Eyetracking indices draw upon prior work
in adults (Hayes et al., 2011; Vigneau et al., 2006) and children (Chen et al., 2016) to bridge
comparisons across the existing literature (Figure 1):
Encoding: A consecutive series of three fixations across each item in a matrix row or column at
any point during a trial was coded as a trial with encoding (Figure 1, yellow). This index
reflects constructive matching (Chen et al., 2016).
Integration: A consecutive series of three fixations across a matrix row and across a matrix
column at any point during a trial was coded as a trial with integration (Figure 1, yellow);
i.e., an instance of horizontal encoding and an instance of vertical encoding. This index
reflects constructive matching (Chen et al., 2016).
Number of Toggles: Total number of gaze transitions from the matrix to the response array or
vice-versa (Figure 1, blue). Although biased by response time, the number of toggles may
reflect response elimination (Chen et al., 2016; Vigneau et al., 2006).
Toggle Rate: Number of Toggles on a trial divided by the total time detected looking at the
matrix problem. This index reduces bias in toggle number due to longer individual
response times (correlation between response time and number of toggles: r = .85,
t (117) = 17.57, p < .001). Reported values are the number of detected toggles per second.
Higher values on this index reflect response elimination ( Vigneau et al., 2006).
OPEN MIND: Discoveries in Cognitive Science
203
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
.
/
/
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
/
.
i
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Matrix Completion Strategies Across Development Niebaum and Munakata
Time to First Toggle: The time prior to the first fixation on the response array. Longer times
reflect more constructive matching, whereas shorter times reflect response elimination
( Vigneau et al., 2006).
Proportion of Time on Matrix: The amount of time fixated on the matrix divided by the total
amount of time fixated on the matrix and the solution array. Higher proportions reflect
constructive matching, whereas lower proportions reflect response elimination ( Vigneau
et al., 2006).
Matrix Time Distribution Index: The proportion of time fixated on matrix items 1, 2, 4, and 5 rel-
ative to the time fixated on the matrix, minus the proportion of time fixated on matrix items
3, 6, 7, 8 and 9 relative to the time fixated on the matrix. Values near 0 indicate more even
looking time across the whole matrix, which could reflect more complete encoding of
matrix relations and thus better constructive matching. Lower values indicate more looking
time on the last row and column of the matrix, which could indicate less complete encod-
ing of relations and thus worse constructive matching (Vigneau et al., 2006).
Matrix Difficulty
Matrix difficulty was calculated by subtracting the mean percentage correct for each matrix prob-
lem within each age group from 100 (e.g., Perret & Dauvier, 2018). Thus, higher numbers indi-
cate more difficult problems. The matrix difficulty parameter strongly correlated with response
time in all age groups (adults: r = .86, t = 7.97, p < .001; 9yo: r = .82, t = 6.66, p < .001; 6yo:
r = .48, t = 2.59, p = .017), replicating prior work showing that children and adults take longer to
respond on more difficult problems (Gonthier & Roulin, 2020; Perret & Dauvier, 2018).
Statistical Analysis
All statistical analyses were conducted with R software (version 1.2.5042, R Core Team, 2020).
Multilevel models were conducted with the “lme4” package (Bates et al., 2007). Figures were
created with the “ggpubr” (Kassambara, 2020), “ggExtra” (Attali & Baker, 2019), “cowplot”
(Wilke, 2019), and “ggplot2” packages (Wickham, 2009), using color schemes detailed in Wong
(2011). Data, code, and materials are available on the project’s Open Science Framework page
(For peer review: https://osf.io/428fh/).
RESULTS
Descriptive statistics for performance, response time, and strategic eyetracking indices across
all matrix problems for the full sample are provided in Table 1. Additional descriptive statistics
for each variable and correlations between strategic indices are included in Supplementary
Materials. Although several 6-year-olds (n = 11) scored below chance (<12.5%), participants
with poor performance were retained in initial analyses to capture potential changes in strat-
egy use, as in Chen et al. (2016). Poor performance in a subset of 6-year-olds was expected,
given prior working showing that some 5–6-year-olds often systematically respond with
answers that duplicate features in the matrix (e.g., Siegler & Svetina, 2002) and that many
5–6-year-olds perform below chance (Chen et al., 2016; Stevenson & Hickendorff, 2018).
As expected, 9-year-olds scored significantly better than 6-year-olds (t(52.55) = 8.83, p <
.001). Child groups exhibited unequal variance in accuracy according to Levene’s test
(F(1,81) = 16.75, p < .001), indicating that 9-year-olds had significantly less variance in accu-
racy than 6-year-olds. Notably, this restricted range could attenuate correlations between
strategy use and performance in 9-year-olds, while the very low performance for some
6-year-olds could exaggerate correlations between strategy use and performance. Statistical
OPEN MIND: Discoveries in Cognitive Science
204
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
.
/
/
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
.
/
i
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Matrix Completion Strategies Across Development Niebaum and Munakata
Table 1. Overall Performance and Strategic Indices Across Age Groups
6-year-olds
Percent Correct*
Relational Score*
Mean
33.93% (25.62)
22.68 (8.67)
Response Time per Trial (in seconds)
7.63 (3.91)
Percentage of Trials with Encoding*
29.38% (27.36)
Percentage of Trials with Integration*
6.84% (10.38)
Number of Toggles per Trial
Toggle Rate (per second)*
Time to First Toggle*
2.68 (1.06)
0.47 (0.14)
2.20 (1.56)
Range
4%–90%
9.5–40
1.82–18.59
0%–88%
0%–46%
1.05–5.79
0.23–0.77
0.49–7.41
Proportion of Time on Matrix*
63.43% (12.51)
0.27%–0.86%
Matrix Time Distribution*
−0.43 (0.30)
−0.94–0.07
9-year-olds
Percent Correct*
Relational Score*
Response Time per Trial (in seconds)
Mean
74.70% (12.66)
36.64 (3.47)
8.89 (3.10)
Percentage of Trials with Encoding*
57.89% (18.56)
Percentage of Trials with Integration*
16.00% (10.09)
Number of Toggles per Trial
Toggle Rate (per second)*
Time to First Toggle*
Proportion of Time on Matrix*
Matrix Time Distribution*
Adults
Percent Correct
Relational Score
2.59 (0.69)
0.38 (0.11)
3.92 (1.75)
76.62% (5.63)
−0.17 (0.19)
Mean
51.16% (15.68)
36.7 (5.97)
Response Time per Trial (in seconds)
21.66 (7.90)
Percentage of Trials with Encoding
77.65% (20.34)
Percentage of Trials with Integration
35.71% (22.90)
Number of Toggles per Trial
Toggle Rate (per second)
Time to First Toggle
Proportion of Time on Matrix
Matrix Time Distribution
4.92 (1.63)
0.27 (0.08)
7.97 (3.70)
79.38% (5.29)
0.03 (0.22)
Range
42%–92%
27.5–41.00
4.05–16.32
26%–96%
0%–46%
1.52–5.33
0.22–0.73
1.33–9.78
64%–90%
−0.54–0.12
Range
17%–88%
20–49
6.36–45.56
21%–100%
0%–83%
1.71–10.29
0.14–0.48
2.07–17.61
63%–92%
−0.41–0.95
Skew
.34
.29
.68
0.76
2.06
.94
.2
1.47
−0.48
0.08
Skew
−0.55
−0.88
0.37
0.17
.58
1.43
1.11
1.09
0.02
−0.33
Skew
−0.24
−0.62
0.46
−1.00
0.03
0.74
0.79
0.50
−0.78
1.26
Kurtosis
−1.32
Reliability
.76
−1.35
−0.08
−0.96
4.5
.75
−1.03
1.94
0.33
−1.16
Kurtosis
−0.5
0.08
−0.28
−0.8
0.04
3.58
1.75
1.16
−0.14
−1.19
Kurtosis
−0.32
0.51
0.36
0.18
−1.09
0.83
−0.08
−0.15
1.86
3.98
.79
.81
.61
.80
.80
.27
.71
Reliability
.40
.77
.50
.39
.85
.95
.73
.57
Reliability
.57
.56
.52
.44
.69
.89
.41
.44
Data are presented as the mean (SD) or percentage of trial (SD). Reliability is the raw Cronbach’s alpha coefficients for all strategic indices and
task performance.
* Indicates significant differences between child groups ( p < .001). Differences between children and adults were not assessed.
OPEN MIND: Discoveries in Cognitive Science
205
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
/
.
/
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
.
/
i
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Matrix Completion Strategies Across Development Niebaum and Munakata
differences between children and adults were not assessed because adults completed a differ-
ent set of matrix problems.
To preview the series of analyses: First, we tested whether the implementation of specific
strategies increases across childhood via eyetracking indices. Second, we tested the relation-
ship between strategic indices and overall performance, including the specificity of these indi-
ces in predicting trial accuracy. Third, we investigated whether age groups adapted strategy to
increasing difficulty and whether strategy adaptation (or persistence) predicted better overall
performance across age groups. This analytic strategy tests whether the strategies linked with
good overall performance are also better at the trial level and on more difficult problems.
Analyses of relationships between matrix completion strategy use and performance on
Analysis-Synthesis, a separate fluid intelligence task, are included in Supplementary Materials.
Differences in Strategic Indices Between Child Groups
We performed a univariate outlier analysis (>2.5 SDs from group mean) for each index and
removed these participants from each age group for the following analysis (5 adultes, 4 9-année-
olds, et 4 6-year-olds). Analyses with the full sample are included in Supplementary
Materials and qualitatively mirror the results reported below.
Strategies associated with constructive matching increased from 6- to 9-year-olds. Nine-
year-olds had significantly more trials with encoding (t = 6.42, p < .001) and integration (t =
6.05, p < .001) than 6-year-olds. Nine-year-olds had significantly longer times to first toggle to
the response array (t = 6.77, p < .001), spent more time fixating on the matrix relative to the
response array (t = 7.15, p < .001), and spent more time fixating on the initial rows and columns
of the matrix relative to the latter rows and columns (t = 5.10, p < .001) compared with 6-year-
olds. In contrast, the number of toggles, a metric of response elimination, was not different
between child groups (t = −1.02, p = .314); however, toggle rate, a measure of response elim-
ination that corrects for differences in response time, was significantly lower in 9-year-olds
than 6-year-olds (t = −4.27, p < .001). We reproduced these results including a covariate
indexing the percentage of available eyetracking data, and differences between child groups
remained significant for all strategic indices (Supplementary Materials), suggesting that these
results were not solely due to differences in data availability.
Indices of Constructive Matching Predict Good Performance Across Age
We next tested whether strategic indices were associated with performance. If the optimal
strategies change across development, number of toggles and toggle rate should positively cor-
relate with performance in 6-year-olds but negatively correlate with performance in adults.
Performance significantly positively correlated with the proportion of trials with encoding
and integration in 6-year-olds and adults (all p’s < .05), while weaker positive correlations
were observed in 9-year-olds ( p < .11). In contrast to prior work in children, the mean number
of toggles per problem was not associated with performance. However, toggle rate correlated
negatively with performance in adults and 6-year-olds ( p’s < .001), with a smaller negative
correlation in 9-year-olds ( p = .052). Time to first toggle and matrix time distribution positively
correlated with performance across age groups (all p’s < .05). Proportion matrix time positively
correlated with performance in adults and 6-year-olds, while weaker positive correlations
were observed in 9-year-olds. All correlations are reported in Table 2 and visualized in
Figure 2. Analyses using the matrix relation score, which increases the range of task perfor-
mance, generally strengthened correlations across age groups (Supplementary Materials).
These results are inconsistent with the hypothesis that response elimination is especially
OPEN MIND: Discoveries in Cognitive Science
206
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
.
/
/
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
/
.
i
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
O
P
E
N
M
N
D
I
:
i
D
s
c
o
v
e
r
i
e
s
i
n
C
o
g
n
i
t
i
v
e
S
c
e
n
c
e
i
Table 2.
Correlations Between Matrix Completion Performance and Eyetracking Indices of Strategy
6-year-olds
9-year-olds
Adults
Encoding
Integration
Toggle Number
Toggle Rate
r
.63
.63
.22
95% CI
[.38, .80]
t
4.64
p
<.001
[.38, .80]
4.62
<.001
[−.13, .52]
1.25
.220
−.67
[−.82, −.43]
−5.09
<.001
Time to First Toggle
Proportion Matrix Time
Matrix Time Distribution
.76
.43
.44
[.56, .87]
6.54
<.001
[.10, .67]
[.12, .68]
2.68
2.77
.012
.009
r
.26
.29
−.02
−.19
.35
.19
.35
95% CI
[−.06, .53]
[−.03, .56]
[−.33, .30]
[−.57, .002]
t
1.66
p
.105
1.85
.072
−0.10
−2.01
.921
.052
r
.41
.27
.18
95% CI
[.13, .62]
[−.02, .52]
[−.11, .45]
t
2.97
1.85
1.23
p
.005
.071
.226
−.56
[−.73, −.32]
−4.44
<.001
[.04, .60]
2.26
.030
[−.14, .47]
1.15
.257
[.04, .60]
2.27
.029
.49
.48
.35
[.24, .69]
3.76
<.001
[.22, .68]
3.63
<.001
[.06, .58]
2.44
.019
M
a
t
r
i
x
C
o
m
p
l
e
t
i
o
n
S
t
r
a
t
e
g
i
e
s
A
c
r
o
s
s
D
e
v
e
l
o
p
m
e
n
t
N
i
e
b
a
u
m
a
n
d
M
u
n
a
k
a
t
a
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
.
/
/
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
/
.
i
2
0
7
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Matrix Completion Strategies Across Development Niebaum and Munakata
Figure 2. Relationships between overall task performance (percentage of correct responses) and eyetracking indices of (A) encoding, (B) inte-
gration, (C) toggle rate (in toggles/second), (D) time to first toggle (in seconds), (E) proportion of time fixated on the matrix, and (F) matrix
distribution time. In general, constructive matching (indexed via encoding, integration, time to first toggle, proportional time on matrix, and matrix
distribution time) positively predicted performance, whereas response elimination (indexed via toggle rate) negatively predicted performance.
beneficial for younger children. Strategic indices of good performance were qualitatively
similar from childhood into adulthood: indices reflecting constructive matching were
associated with better performance, and indices reflecting response elimination were asso-
ciated with poor performance.
Given the high number of 6-year-olds with accuracy below chance (n = 11), we replicated
our analyses with these participants excluded. Our aim in this follow-up analysis was to deter-
mine whether the large correlations observed in 6-year-olds reflected divergences between
children who understood the matrix completion task and those who did not, rather than gen-
uine correlations between strategy use and task performance. We observed highly convergent
results with low-performing 6-year-olds excluded. Performance positively correlated with
the proportion of trials with encoding (r = .44, t = 2.25, p = .036), integration (r =. 43, t = 2.17,
p = .042), time to first toggle (r = .66, t = 4.00, p < .001), and matrix distribution time (r = .52,
t = 2.81, p = .010) and negatively correlated with toggle rate (r = −.65, t = −3.95, p < .001).
Proportion matrix time was not significantly correlated with performance (r = .19, t = 0.87,
p = .393). Further analysis of errors for 6-year-olds scoring below chance suggested that
these participants were not responding randomly; instead, these participants were more
likely to respond with a duplicate item and less likely to select a response that contained
a novel feature than expected by chance. Increased use of response elimination predicted
a greater likelihood of selecting a duplicate answer (Supplementary Materials).
Specificity of Strategic Indices for Predicting Trial Accuracy
Next, we tested the specificity of these strategic indices for predicting correct responses at the
trial level. We assessed relationships between strategic indices and trial accuracy by con-
ducting separate multilevel logistic regression models for each strategic index correlated with
aggregate task performance, with random intercepts for participants. Number of toggles was
excluded because the index was not related to overall performance. All models included the
matrix difficulty parameter as a covariate. Predictors of trial accuracy varied across age groups
(Table 3): Trial accuracy was predicted by encoding, lower toggle rate, longer time to first
OPEN MIND: Discoveries in Cognitive Science
208
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
/
.
/
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
.
/
i
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
O
P
E
N
M
N
D
I
:
i
D
s
c
o
v
e
r
i
e
s
i
n
C
o
g
n
i
t
i
v
e
S
c
e
n
c
e
i
Table 3.
Specificity of Strategic Indices for Predicting Trial Accuracy
Encoding
Integration
Toggle Rate
6-year-olds
9-year-olds
Adults
B
0.52
0.28
95% CI
[0.03, 1.00]
z
2.09
p
.037
[−0.48, 1.05]
0.73
.466
B
0.31
0.06
95% CI
[−0.09, 0.71]
z
1.50
p
.134
[−0.45, 0.57]
0.23
.818
B
0.28
0.13
95% CI
[−0.10, 0.66]
[−0.19, 0.45]
z
1.43
0.80
p
.153
.423
−1.07
[−1.91, −0.24]
−2.52
.012
−0.57
[−1.40, 0.26]
−1.35
.178
−2.09
[−3.24, −0.94]
−3.56
<.001
Time to First Toggle
Proportion Matrix Time
Matrix Time Distribution
0.14
1.12
0.30
[0.06, 0.22]
3.29
.001
[−0.03, 2.27]
1.91
.057
[−0.19, 0.80]
1.20
.229
0.04
0.70
0.43
[−0.02, 0.09]
1.38
.167
[−0.82, 2.23]
0.90
.369
[−0.08, 0.93]
1.66
.098
0.03
1.64
0.40
[0.01, 0.05]
[0.28, 3.01]
[−0.03, 0.83]
2.54
2.36
1.81
.011
.018
.070
M
a
t
r
i
x
C
o
m
p
l
e
t
i
o
n
S
t
r
a
t
e
g
i
e
s
A
c
r
o
s
s
D
e
v
e
l
o
p
m
e
n
t
N
i
e
b
a
u
m
a
n
d
M
u
n
a
k
a
t
a
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
.
/
/
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
/
.
i
2
0
9
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Matrix Completion Strategies Across Development Niebaum and Munakata
toggle, and greater proportion of fixation time on the matrix in 6-year-olds and by lower toggle
rate and greater proportion of fixation time on the matrix in adults, with no significant predic-
tors in 9-year-olds. We conducted follow-up models with the full child sample, including
interactions between age group and eyetracking index, and found that all indices except
integration significantly predicted trial accuracy (Supplementary Materials). These findings
generally mirror the aggregate task results, in which increased use of constructive matching
was linked with increased probability of responding correctly across age groups. These results
indicate some potential for specific strategic indices, particularly encoding, toggle rate, and
greater proportion of fixation time on the matrix, for predicting correct responses at the trial level.
However, the lack of consistent correlations suggests that predicting trial-level accuracy remains
difficult with these somewhat coarse strategy indices. Some problems may not require systematic
strategies and instead rely only on pattern completion to derive the correct answer, which may
explain the lack of significant correlations in the 9-year-old group, who performed very well
overall. Adults completed problems from Advanced Progressive Matrices, which involved a
broader and more complex range of rules than the child matrices; some of these matrices
may require different and more complex strategies than those derived from eyetracking.
Strategy Adaptations with Increased Matrix Difficulty
To determine whether children and adults adapted strategy to matrix difficulty, we conducted
an item-level analysis in which each strategic index was averaged within trial across each age
group. Then, the mean of each strategic index on that trial was regressed onto matrix difficulty.
We include analysis of the number of toggles because this index is also informative for poten-
tial strategy changes in response to difficulty; utilization of pure constructive matching alone
would not lead to an increased number of toggles with increased difficulty, as only one toggle
to the response array would be necessary to locate the correct response after using construc-
tive matching. Increased response elimination could be reflected in an increased number of
toggles with increased difficulty. Thus, toggle rate could decrease due to longer response times
on more difficult trials, reflecting more constructive matching, while the number of toggles
may also increase, reflecting more response elimination.
All age groups exhibited evidence of shifts in strategy in accordance with matrix difficulty
(Table 4). In 6-year-olds, encoding and time to first toggle significantly increased with matrix
difficulty; toggle rate decreased with difficulty. In 9-year-olds, encoding, integration, and time
to first toggle, as well as the number of toggles, increased with matrix difficulty. In adults, inte-
gration, number of toggles, and time to first toggle increased with matrix difficulty, and toggle
rate decreased with difficulty. Thus, all age groups adapted their strategy to trial difficulty, gen-
erally showing increases in indices of constructive matching on more difficult trials. However,
adults and 9-year-olds also showed evidence of increased reliance on a hybrid strategy incor-
porating elements of response elimination with increased matrix difficulty, as the number of
toggles increased with matrix difficulty.
Adaptive Strategy Use Predicts Matrix Completion Performance
To test whether adaptive strategy use predicted matrix completion performance, we conducted
a series of multilevel models in which each strategic index on a trial was predicted by matrix
difficulty within each age group, with random slopes for participants. We then extracted the
random participant slopes as indices of adaptive strategy use. Values different from 0 indicate
greater adaptation to difficulty. For example, higher values in adaptive encoding indicate a
greater probability of encoding as matrix difficulty increases.
OPEN MIND: Discoveries in Cognitive Science
210
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
.
/
/
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
.
/
i
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
O
P
E
N
M
N
D
I
:
i
D
s
c
o
v
e
r
i
e
s
i
n
C
o
g
n
i
t
i
v
e
S
c
e
n
c
e
i
Encoding
Integration
Toggle Number
Toggle Rate
Table 4.
Correlations Between Matrix Difficulty and Eyetracking Indices of Strategy
6-year-olds
r
.56
.30
.21
95% CI
[.20, .79]
[−.11, .63]
[−.21, .57]
t
3.17
p
.004
1.50
.149
1.02
.319
r
.43
.72
.73
9-year-olds
95% CI
[.04, .71]
t
2.26
p
.034
[.45, .87]
4.89
<.001
[.47, .88]
5.04
<.001
Adults
r
.28
.57
.70
95% CI
[−.14, .61]
[.22, .79]
t
1.35
3.28
p
.192
.003
[.41, .86]
4.54
<.001
.138
.004
.084
.229
−.48
[−.74, −.09]
−2.55
.62
.24
.26
[.29, .82]
[−.18, .59]
[−.16, .60]
3.69
1.18
1.25
.018
.001
.250
.223
−.52
[−.77, −.15]
−2.88
.009
−.31
[−.64, .10]
−1.54
Time to First Toggle
Proportion Matrix Time
Matrix Distribution
.48
.21
.22
[.09, .74]
2.55
.018
[−.21, .56]
[−.21, .57]
1.00
.328
1.03
.313
.57
.36
.26
[.21, .79]
[−.05, .67]
[−.17, .60]
3.21
1.81
1.24
2
1
1
M
a
t
r
i
x
C
o
m
p
l
e
t
i
o
n
S
t
r
a
t
e
g
i
e
s
A
c
r
o
s
s
D
e
v
e
l
o
p
m
e
n
t
N
i
e
b
a
u
m
a
n
d
M
u
n
a
k
a
t
a
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
/
/
.
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
.
/
i
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
O
P
E
N
M
N
D
I
:
i
D
s
c
o
v
e
r
i
e
s
i
n
C
o
g
n
i
t
i
v
e
S
c
e
n
c
e
i
Table 5.
Correlations Between Performance and Adaptive Strategy Use
Encoding
Integration
Toggle Number
Toggle Rate
6-year-olds
9-year-olds
Adults
r
.68
.66
.23
95% CI
[.44, .83]
t
5.22
p
<.001
[.42, .82]
5.00
<.001
[−.12, .53]
1.35
.188
r
.31
.32
.20
95% CI
[−.00, .57]
t
2.02
p
.051
[.01, .58]
2.07
.045
[−.13, .48]
1.23
.228
r
.43
.27
.24
95% CI
[.16, .64]
[−.02, .52]
[−.05, .50]
t
3.15
1.85
1.70
p
.003
.071
.102
−.68
[−.83, −.44]
−5.21
<.001
−.33
[−.58, −.01]
−2.10
.042
−.57
[−.74, −.34]
−4.63
<.001
Time to First Toggle
Proportion Matrix Time
Matrix Distribution
.76
.45
.46
[.57, .88]
6.69
<.001
[.13, .68]
[.15, .69]
2.83
2.97
.008
.006
.40
.30
.50
[.10, .64]
2.66
.012
[−.02, .56]
1.89
.066
[.21, .70]
3.48
.001
.51
.49
.33
[.26, .70]
3.93
<.001
[.24, .69]
3.77
<.001
[.04, .56]
2.29
.027
M
a
t
r
i
x
C
o
m
p
l
e
t
i
o
n
S
t
r
a
t
e
g
i
e
s
A
c
r
o
s
s
D
e
v
e
l
o
p
m
e
n
t
N
i
e
b
a
u
m
a
n
d
M
u
n
a
k
a
t
a
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
/
/
.
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
/
.
i
2
1
2
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Matrix Completion Strategies Across Development Niebaum and Munakata
Figure 3. Relationships between overall task performance and strategy adaptations to difficulty for (A) encoding, (B) integration, and (C)
toggle rate (in seconds). Adaptive constructive matching (indexed via increases in encoding and integration with matrix difficulty) generally
positively predicted performance, whereas adaptive response elimination (indexed via increases in toggle rate with matrix difficulty) negatively
predicted performance.
Across age groups, accuracy generally positively correlated with a greater probability of
encoding ( p’s < .051) and integration ( p’s < .066) with increasing difficulty. Increases in toggle
rate correlated negatively with accuracy across all groups ( p’s < .05). Increases in the time to
first toggle ( p’s < .02), proportion of relative matrix time ( p’s < .066), and matrix distribution
time ( p’s < .05) generally positively correlated with performance across groups. The mean
number of toggles on matrix problems was not significantly associated with accuracy. All
correlations are reported in Table 5 and visualized in Figure 3. These results indicate that
individuals at all ages who were more likely to adapt strategy use to matrix difficulty were also
more likely to perform better overall. Increases in constructive matching on more difficult
problems generally predicted better performance.
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
/
/
.
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
.
/
i
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
DISCUSSION
Matrix completion is one of the most commonly used assessments of fluid intelligence across
the lifespan, and performance on matrix completion tasks predicts success in school and other
positive life outcomes. We sought to investigate the strategies that children and adults imple-
ment while performing matrix completion tasks using eyetracking and assessing how strategies
related to task performance. Implementation of constructive matching increased across child-
hood, and response elimination decreased across childhood. Constructive matching was asso-
ciated with better performance in both children and adults, whereas response elimination was
associated with poor performance. We did not find evidence that response elimination was
more beneficial for younger children than older children or adults. All age groups adapted
their strategies to matrix difficulty: Children and adults implemented constructive matching
more on more difficult problems, and older children and adults also were more likely to
use aspects of response elimination as problems become more difficult, suggesting that adults
and older children also implement different strategies if constructive matching fails to yield a
correct response. Strategy adaptation also predicted overall performance in both children and
adults: Increased use of constructive matching on more difficult problems predicted better
overall performance. We discuss these findings in turn, as well as their implications for
OPEN MIND: Discoveries in Cognitive Science
213
Matrix Completion Strategies Across Development Niebaum and Munakata
understanding individual differences and developmental changes in relational reasoning and
fluid intelligence more broadly.
What Drives Observed Changes in Strategy Use Across Childhood?
Children’s increased use of constructive matching with age is likely supported by correspond-
ing increases in working memory capacity (Gathercole et al., 2003). Increases in working
memory correlates with improvements in relational reasoning across childhood (Hornung
et al., 2011; Kail, 2007). In adults, higher working memory capacity correlates with better
spontaneous strategy use on matrix completion tasks, particularly greater use of constructive
matching (Gonthier & Roulin, 2020; Gonthier & Thomassin, 2015; Jarosz & Wiley, 2012;
Jastrzębski et al., 2018). Because constructive matching is more demanding on working
memory (Bethell-Fox et al., 1984), increases in capacity could decrease the demands associ-
ated with constructive matching, thereby making constructive matching less demanding for
children as they age.
Improvements in cognitive control with age likely also support children in their ability to
inhibit primary task goals (e.g., find the correct solution) to first complete subgoals (e.g.,
encode relations) (Engel de Abreu et al., 2010), which could also drive increased use of con-
structive matching. This explanation is consistent with analyses of the types of errors children
commit on matrix completion and other relational reasoning tasks. Young children often select
answers that are duplicates of items in the matrix or the relational items in analogical reason-
ing tasks (as also observed here; Supplementary Materials), whereas older children are more
likely to select partial relational matches or the correct answer (Chen et al., 2016; Glady et al.,
2017; Siegler & Svetina, 2002; Stevenson & Hickendorff, 2018). Thus, young children perform
poorly in systematic ways, selecting answers based on perceptual similarity instead of relation-
ships across items. Developmental transitions to systematically selecting partial relational
matches indicate that children progress but still fail to completely encode and integrate all
relationships, instead favoring a solution that may only satisfy the first identified relation
between items.
Improvements in children’s inhibitory control may help children avoid salient distractors
and focus on encoding all necessary relations to obtain a correct response (Richland &
Burchinal, 2013; Richland et al., 2006). Young children are less likely than adults to focus
on task subgoals in other types of relational reasoning tasks compared. For example, in a
typical A:B::C:? analogy task with eyetracking, 5- and 8-year-olds first focused on the C item
instead of encoding the A:B analogy (Thibaut & French, 2016). This fixation pattern was
associated with poor performance (Glady et al., 2017; Starr et al., 2018). In contrast, adults
were more likely to focus first on the A:B analogy prior to gazing at potential answers and
performed better than children (Starr et al., 2018; Vendetti et al., 2017).
Such changes in strategy could also be driven by developmental transitions in the temporal
dynamics of cognitive control. For example, at around 5 to 6 years, children transition from
implementing primarily reactive forms of control, in which control is recruited as needed in
the moment, to proactive control, in which control is recruited in anticipation of need and
sustained (Chatham et al., 2009; Gonthier et al., 2019; Lucenet & Blaye, 2014). Transitioning
from a more reactive strategy that focuses on salient stimuli for primary task goals, like poten-
tial answers, to more proactive strategies that prioritize encoding the initial relational informa-
tion, could support constructive matching. Here, 6-year-olds looked quickly to potential
answers, with most 6-year-olds fixating on the answer array only one or two seconds after
the matrix problem was shown. In prior work, showing the A:B relation first and encouraging
OPEN MIND: Discoveries in Cognitive Science
214
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
/
/
.
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
.
/
i
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Matrix Completion Strategies Across Development Niebaum and Munakata
children to verbalize relationships, which may support proactive control (Doebel et al., 2018)
and encourage constructive matching, improved task performance (Glady et al., 2017). Five-
year-olds were also shown to be more likely than older children and adults to make errors in
an analogical reasoning task if more distracting potential answers were available, even if they
were able to successfully encode the A:B relationship, suggesting less ability to inhibit distrac-
tors (Glady et al., 2017). Low working memory capacity additionally makes inhibiting distrac-
tors among the response options more difficult ( Jarosz & Wiley, 2012), which may further
cause children to select duplicate response options. As children’s working memory capacity
increases with age, children are likely more capable of inhibiting more distracting responses to
focus on encoding relations. Our exploratory finding that indices of response elimination
predicted a greater likelihood of selecting a duplicate response further indicates that younger
children may be distracted by feature matches while frequently consulting potential answers.
What Drives Observed Associations Between Strategies and Performance?
While constructive matching increased and response elimination decreased across childhood,
all age groups showed an association between constructive matching and better performance.
This link is unlikely to reflect better task comprehension, given that consistent results were
observed when excluding 6-year-olds who performed below chance, and given that the same
pattern is observed across older age groups who likely understand the task. Using the rate of
toggling to index response elimination revealed a consistent link between poor performance
and response elimination and indicated that response elimination is not adaptive for younger
children, contrary to prior claims. Instead, the process of constructive matching likely causes
better performance. In prior work, young children performing above chance but receiving
feedback explicitly designed to encourage scanning rows and columns continued to show
improvements in task performance across the task (Chen et al., 2016; Parker et al., 1972).
In adults, manipulating matrix presentation by showing only single rows or columns to encour-
age constructive matching improved performance (Hayes, 2014). Training constructive match-
ing via strategy recommendations and by initially omitting the solution array also improved
adult performance (Gonthier & Thomassin, 2015; cf. Mitchum & Kelley, 2010).
Adaptations in Strategy Use and Links with Performance
Children and adults shifted strategy across the task. All age groups were more likely to use
constructive matching on more difficult problems, likely due to the increased number of rela-
tions that needed to be encoded and integrated, requiring more scans across rows and col-
umns. This finding may seem to contrast with earlier reports in adults finding decreased use
of constructive matching with difficulty (Gonthier & Roulin, 2020). We believe this divergence
could reflect the use of self-report to assess strategy use in prior work. For example, adults may
use constructive matching on more difficult matrix problems but report using it less than
response elimination on difficult trials if constructive matching fails because they devote rel-
atively less time to constructive matching than other strategies. Self-report could also be more
likely to capture the strategy that participants used to derive their answer. Thus, participants
could still remain more likely to implement constructive matching on more difficult trials while
reporting a broader array of strategies across the longer duration of difficult trials. We found
evidence that adults and 9-year-olds also increasingly implemented response elimination on
more difficult problems; these groups increased the number of toggles to the solution array as
difficulty increased. More difficult problems involved keeping in mind more relational fea-
tures, resulting in a need to more frequently consult the response array. This pattern could
reflect a hybrid strategy between response elimination and constructive matching, in which
OPEN MIND: Discoveries in Cognitive Science
215
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
/
.
/
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
/
.
i
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Matrix Completion Strategies Across Development Niebaum and Munakata
specific relations were isolated via constructive matching and then used to eliminate specific
response options, consistent with and extending recent evidence in adults (Gonthier & Roulin,
2020; Jarosz et al., 2019). Such qualitative changes in strategy could also be driven by older
children and adults being less willing or less able to meet the increased demands of constructive
matching on difficult problems or failing to generate an appropriate answer with constructive
matching. Older children and adults likely increasingly use a hybrid strategy or response elimi-
nation if constructive matching fails (Arendasy & Sommer, 2013). Six-year-olds did not exhibit
strong evidence of shifts to response elimination strategies, likely because these children were
already implementing response elimination more often than older children. Young children
implementing constructive matching may also be less likely to switch to response elimination
as an alternative strategy after constructive matching fails, as young children may be less likely
to adopt different strategies than older children (Siegler & Svetina, 2006).
Adaptations in strategy use predicted matrix completion performance in both children and
adults. Increased use of constructive matching, specifically in indices of encoding and integra-
tion, with increasing difficulty predicted better overall performance across age groups,
whereas increases in indices reflecting response elimination, specifically toggle rate, predicted
worse performance in children and adults. Thus, better performance in matrix completion is
not solely due to selecting a more optimal strategy like constructive matching but also
increased use of this strategy on more difficult problems. Poor performers may lack the work-
ing memory capacity to continue implementing constructive matching on difficult problems,
leading to poorer performance, or lack the motivation to implement a more cognitively
demanding strategy on difficult problems. These findings also demonstrate the importance
of investigating variability in strategy use within individuals for understanding matrix comple-
tion performance, in both children and adults. Accuracy decreased not only with anticipated
trial difficulty but also with decreased use of more optimal strategies on difficult problems.
Further, it is unlikely that these relationships are due to fatigue or boredom or individual dif-
ferences in task learning as the task progressed; unlike many prior studies, in which item order
and difficulty are confounded, we included easy and difficult problems across the task, which
may also help increase the validity of our developmental findings (Sun et al., 2019).
Limitations and Future Directions
Eyetracking measure are informative for ascertaining strategy use in children and adults and do
well in explaining individual differences in performance; however, many questions and impor-
tant next steps remain. First, the indices used here are relatively coarse, and some indices do
not utilize all available fixation data. For example, successful integration relies not only on
encoding more than one relation within a matrix problem but also the ability to combine both
of these relations to select a response, which cannot be assured through only successive
fixations. We took a comprehensive analytic approach by examining relationships between
strategy and matrix completion performance at subject, problem, and trial levels to ensure
the robustness of our results and general conclusions. Nonetheless, replicability could be lim-
ited by the poor to adequate reliability of many of the variables derived from eyetracking, by
idiosyncrasies with specific matrix completion problems, and because some of our conclu-
sions are based on zero-order correlations without correction for multiple comparisons,
although results across analyses converged across analytic approaches. Future work should
continue to include a variety of different indices of strategies, such as eyetracking combined
with self-report, and different types of problems with different anticipated difficulties. For
example, given the overall good performance of 9-year-olds, the low incidence of encoding
and integration overall compared with the total number of problems, and the increase in
OPEN MIND: Discoveries in Cognitive Science
216
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
/
.
/
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
.
/
i
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Matrix Completion Strategies Across Development Niebaum and Munakata
indices like encoding and integration with problem difficulty, these indices may not best cap-
ture strategy use for easier problems in older age children.
Further, younger children were missing a greater percentage of valid eyetracking data; this
could suggest that younger children were processing the problem differently or more intermit-
tently engaged with the problem than older children and adults, which could influence indices
of strategy derived from eyetracking. Alternatively, because we did not utilize a headrest, this
may simply indicate that younger children moved more while solving the problem, resulting in
lapses of valid eyetracking data. Because we compared 6-year-olds, some of whom did not
fully comprehend the task, with 9-year-olds, who performed well overall, we cannot directly
compare whether constructive matching was more or less beneficial for performance across
child ages. Further, we cannot make direct comparisons on the benefits of strategy use for
performance with adults because this group performed a different matrix completion task.
Because of these design decisions, we can only infer that constructive matching is beneficial
for matrix completion performance across development.
Cluster analyses and analyses applying reinforcement learning algorithms to fixation
sequences, as well as self-reported strategy use, have also shown promise in explaining matrix
completion performance in adults (Gonthier & Thomassin, 2015; Hayes et al., 2011;
Kucharský et al., 2020). Such measures may be valuable to explore across development.
Second, although we observed consistent associations between strategy use and performance
across development, longitudinal studies will be informative for answering causal questions
about strategy change, including generalizability to improvements on other cognitive assess-
ments and real-world outcomes. For example, shifts in strategy could occur concomitantly
with specific improvements in related cognitive processes like working memory and cognitive
control and improvements in academic domains. These associations between strategy use and
potentially relevant factors such as proactive control and motivation, particularly in develop-
ment, need testing. Lastly, our sample was recruited from a primarily affluent area and
included only college-attendees in the adult sample, who may have greater familiarity with
these types of cognitive assessments, limiting potential generalizability to other populations
and across time (Brouwers et al., 2009). Such explanations may explain why strategic indices
from eyetracking sometimes generalize poorly across different adult samples and different
matrix completion problems (Hayes et al., 2011).
Conclusion
Our results suggest a systematic relationship between strategy use and performance on matrix
completion that persists across development. Individuals may perform poorly on matrix com-
pletion tasks due to poor initial strategy selection or because they do not adapt their strategy to
the particular demands of a matrix problem. Strategy selection and adaptation may thus be
central to the development of fluid intelligence and individual differences in fluid intelligence,
such that understanding the factors that support strategy selection and adaptation may be more
informative than tracking changes in task performance.
ACKNOWLEDGMENTS
The authors thank Alexandra Alfaro, Sarah Dinegar, Hayden Morano, Jennifer Felker, Grace
Dostart, Rich Cheng, Kelsey Mills, and Sarah Broadbent for help in participant recruitment and
data collection, Matias Lopez-Rosenfeld and William Chapman for early assistance in data
wrangling, Corentin Gonthier, Taylor Hayes, and Linda Matzen for advice and materials,
and Tim Curran, Kristin Lagattuta, Randall O’Reilly, Hilary Traut, and members of the
OPEN MIND: Discoveries in Cognitive Science
217
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
.
/
/
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
.
/
i
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Matrix Completion Strategies Across Development Niebaum and Munakata
Cognitive Development Center at CU Boulder and Cognition in Context Lab and Research in
Social Cognition group at UC-Davis for helpful discussions.
AUTHOR CONTRIBUTIONS
Jesse Niebaum: Conceptualization; Data curation; Formal analysis; Funding acquisition; Inves-
tigation; Methodology; Project administration; Supervision; Visualization; Writing—Original
draft; Writing—Review & editing. Yuko Munakata: Conceptualization; Methodology; Project
administration; Resources; Writing—Review & editing.
DATA AVAILABILITY
Materials, data, and analysis scripts for this manuscript are available on the project’s Open
Science Framework repository (https://osf.io/428fh/).
FUNDING INFORMATION
J. C. N. is supported by a National Science Foundation Graduate Research Fellowship (grant
No. 1650042).
REFERENCES
Arendasy, M. E., & Sommer, M. (2013). Reducing response elimi-
nation strategies enhances the construct validity of figural matri-
ces. Intelligence, 41(4), 234–243. https://doi.org/10.1016/j.intell
.2013.03.006
Attali, D., & Baker, C. (2019). ggExtra: Add marginal histograms to
“ggplot2”, and more “ggplot2” enhancements (version 0.9).
https://CRAN.R-project.org/package=ggExtra
Bates, D., Sarkar, D., Bates, M. D., & Matrix, L. (2007). The lme4
package. R Package Version, 2(1), 74.
Bethell-Fox, C. E., Lohman, D. F., & Snow, R. E. (1984). Adaptive
reasoning: Componential and eye movement analysis of geomet-
ric analogy performance. Intelligence, 8(3), 205–238. https://doi
.org/10.1016/0160-2896(84)90009-6
Borkowski, J. G., Carr, M., & Pressley, M. (1987). “Spontaneous”
strategy use: Perspectives from metacognitive theory. Intelligence,
11(1), 61–75. https://doi.org/10.1016/0160-2896(87)90027-4
Bjorklund, D. F., Miller, P. H., Coyle, T. R., & Slawinski, J. L. (1997).
Instructing children to use memory strategies: Evidence of utiliza-
tion deficiencies in memory training studies. Developmental
Review, 17(4), 411–441. https://doi.org/10.1006/drev.1997.0440
Bors, D. A., & Stokes, T. L. (1998). Raven’s Advanced Progres-
sive Matrices: Norms for first-year university students and the
development of a short form. Educational and Psychological
Measurement, 58(3), 382–398. https://doi.org/10.1177
/0013164498058003002
Brouwers, S. A., Van de Vijver, F. J. R., & Van Hemert, D. A. (2009).
Variation in Raven’s progressive matrices scores across time and
place. Learning and Individual Differences, 19(3), 330–338.
https://doi.org/10.1016/j.lindif.2008.10.006
Carpenter, P. A., Just, M. A., & Shell, P. (1990). What one intelli-
gence test measures: A theoretical account of the processing in
the Raven progressive matrices test. Psychological Review, 97(3),
404–431. https://doi.org/10.1037/0033-295X.97.3.404,
PubMed: 2381998
Carr, M., & Jessup, D. L. (1997). Gender differences in first-grade
mathematics strategy use: Social and metacognitive influences.
Journal of Educational Psychology, 89(2), 318–328. https://doi
.org/10.1037/0022-0663.89.2.318
Chatham, C. H., Frank, M. J., & Munakata, Y. (2009). Pupillometric
and behavioral markers of a developmental shift in the temporal
dynamics of cognitive control. Proceedings of the National Acad-
emy of Sciences, 106(14), 5529–5533. https://doi.org/10.1073
/pnas.0810002106, PubMed: 19321427
Chen, Z., Honomichl, R., Kennedy, D., & Tan, E. (2016). Aiming to
complete the matrix: Eye-movement analysis of processing strat-
egies in children’s relational thinking. Developmental Psychol-
ogy, 52(6), 867–878. https://doi.org/10.1037/dev0000113,
PubMed: 27148778
Crone, E. A., Wendelken, C., Van Leijenhorst, L., Honomichl, R. D.,
Christoff, K., & Bunge, S. A. (2009). Neurocognitive development
of relational reasoning. Developmental Science, 12(1), 55–66.
https://doi.org/10.1111/j.1467-7687.2008.00743.x, PubMed:
19120413
Dauvier, B., Bailleux, C., & Perret, P. (2014). The development of
relational integration during childhood. Developmental Psychol-
ogy, 50(6), 1687–1697. https://doi.org/10.1037/a0036655,
PubMed: 24749552
Doebel, S., Dickerson, J. P., Hoover, J. D., & Munakata, Y. (2018).
Using language to get ready: Familiar labels help children
engage proactive control. Journal of Experimental Child Psychol-
ogy, 166, 147–159. https://doi.org/10.1016/j.jecp.2017.08.006,
PubMed: 28898678
Eckstein, M. K., Guerra-Carrillo, B., Singley, A. T. M., & Bunge,
S. A. (2017). Beyond eye gaze: What else can eyetracking reveal
about cognition and cognitive development? Developmental
Cognitive Neuroscience, 25, 69–91. https://doi.org/10.1016/j
.dcn.2016.11.001, PubMed: 27908561
Engel de Abreu, P. M. J., Conway, A. R. A., & Gathercole, S. E.
(2010). Working memory and fluid intelligence in young chil-
dren. Intelligence, 38(6), 552–561. https://doi.org/10.1016/j
.intell.2010.07.003
Ferrer, E., O’Hare, E. D., & Bunge, S. A. (2009). Fluid reasoning and
the developing brain. Frontiers in Neuroscience, 3(1), 46–51.
https://doi.org/10.3389/neuro.01.003.2009, PubMed: 19753096
Gathercole, S. E., Brown, L., & Pickering, S. J. (2003). Working
memory assessments at school entry as longitudinal predictors
OPEN MIND: Discoveries in Cognitive Science
218
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
.
/
/
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
.
/
i
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Matrix Completion Strategies Across Development Niebaum and Munakata
of National Curriculum attainment levels. Educational and Child
Psychology, 20(3), 109–122. https://doi.org/10.53841/ bpsecp
.2003.20.3.109
Gentner, D. (1988). Metaphor as structure mapping: The relational
shift. Child Development, 59(1), 47–59. https://doi.org/10.2307
/1130388
Glady, Y., French, R. M., & Thibaut, J. P. (2017). Children’s failure in
analogical reasoning tasks: a problem of focus of attention and
information integration? Frontiers in Psychology, 8, Article 707.
https://doi.org/10.3389/fpsyg.2017.00707, PubMed: 28588516
Gonthier, C., & Roulin, J.-L. (2020). Intraindividual strategy shifts in
Raven’s matrices, and their dependence on working memory
capacity and need for cognition. Journal of Experimental Psy-
chology: General, 149(3), 564–579. https://doi.org/10.1037
/xge0000660, PubMed: 31318256
Gonthier, C., & Thomassin, N. (2015). Strategy use fully mediates
the relationship between working memory capacity and perfor-
mance on Raven’s matrices. Journal of Experimental Psychology:
General, 144(5), 916–924. https://doi.org/10.1037/xge0000101,
PubMed: 26413890
Gonthier, C., Zira, M., Colé, P., & Blaye, A. (2019). Evidencing the
developmental shift from reactive to proactive control in early
childhood and its relationship to working memory. Journal of
Experimental Child Psychology, 177, 1–16. https://doi.org/10
.1016/j.jecp.2018.07.001, PubMed: 30165288
Green, C. T., Bunge, S. A., Chiongbian, V. B., Barrow, M., & Ferrer,
E. (2017). Fluid reasoning predicts future mathematical perfor-
mance among children and adolescents. Journal of Experimental
Child Psychology, 157, 125–143. https://doi.org/10.1016/j.jecp
.2016.12.005, PubMed: 28152390
Guthrie, J. T., Wigfield, A., & VonSecker, C. (2000). Effects of inte-
grated instruction on motivation and strategy use in reading. Jour-
nal of Educational Psychology, 92(2), 331–341. https://doi.org/10
.1037/0022-0663.92.2.331
Handley, S. J., Capon, A., Beveridge, M., Dennis, I., & Evans, J. S. B.
(2004). Working memory, inhibitory control and the develop-
ment of children’s reasoning. Thinking & Reasoning, 10(2),
175–195. https://doi.org/10.1080/13546780442000051
Hayes, T. R. (2014). Mechanisms of visual relational reasoning
[Doctoral dissertation]. The Ohio State University.
Hayes, T. R., Petrov, A. A., & Sederberg, P. B. (2011). A novel method
for analyzing sequential eye movements reveals strategic influence
on Raven’s Advanced Progressive Matrices. Journal of Vision,
11(10), 10. https://doi.org/10.1167/11.10.10, PubMed: 21926182
Hayes, T. R., Petrov, A. A., & Sederberg, P. B. (2015). Do we really
become smarter when our fluid-intelligence test scores improve?.
Intelligence, 48, 1–14. https://doi.org/10.1016/j.intell.2014.10
.005, PubMed: 25395695
Hornung, C., Brunner, M., Reuter, R. A., & Martin, R. (2011). Chil-
dren’s working memory: Its structure and relationship to fluid
intelligence. Intelligence, 39(4), 210–221. https://doi.org/10
.1016/j.intell.2011.03.002
Imbo, I., & Vandierendonck, A. (2007). The development of strat-
egy use in elementary school children: Working memory and
individual differences. Journal of Experimental Child Psychology,
96(4), 284–309. https://doi.org/10.1016/j.jecp.2006.09.001,
PubMed: 17046017
Jarosz, A. F., Raden, M. J., & Wiley, J. (2019). Working memory
capacity and strategy use on the RAPM. Intelligence, 77, Article
101387. https://doi.org/10.1016/j.intell.2019.101387
Jarosz, A. F., & Wiley, J. (2012). Why does working memory capacity
predict RAPM performance? A possible role of distraction. Intelli-
gence, 40(5), 427–438. https://doi.org/10.1016/j.intell.2012.06.001
Jastrzębski, J., Ciechanowska, I., & Chuderski, A. (2018). The strong
link between fluid intelligence and working memory cannot be
explained away by strategy use. Intelligence, 66, 44–53. https://
doi.org/10.1016/j.intell.2017.11.002
Jordan, N. C., & Montani, T. O. (1997). Cognitive arithmetic and
problem solving: A comparison of children with specific and
general mathematics difficulties. Journal of Learning Disabilities,
30(6), 624–634. https://doi.org/10.1177/002221949703000606,
PubMed: 9364900
Kail, R. V. (2007). Longitudinal evidence that increases in process-
ing speed and working memory enhance children’s reasoning.
Psychological Science, 18(4), 312–313. https://doi.org/10.1111/j
.1467-9280.2007.01895.x, PubMed: 17470254
Kane, M. J., & Engle, R. W. (2002). The role of prefrontal cortex in
working-memory capacity, executive attention, and general fluid
intelligence: An individual-differences perspective. Psychonomic
Bulletin & Review, 9(4), 637–671. https://doi.org/10.3758
/BF03196323, PubMed: 12613671
Kassambara, A. (2020). ggpubr: ‘ggplot2’ based publication ready
plots (R package version 0.1). https://CRAN.R-project.org
/package=ggpubr
Kucharský, Š., Visser, I., Truțescu, G.-O., Laurence, P. G.,
Zaharieva, M., & Raijmakers, M. E. J. (2020). Cognitive strategies
revealed by clustering eye movement transitions. Journal of Eye
Movement Research, 13(1). https://doi.org/10.16910/jemr.13.1.1,
PubMed: 33828785
Laurence, P. G., Mecca, T. P., Serpa, A., Martin, R., & Macedo, E. C.
(2018). Eye movements and cognitive strategy in a fluid intelligence
test: Item type analysis. Frontiers in Psychology, 9, Article 380.
https://doi.org/10.3389/fpsyg.2018.00380, PubMed: 29619002
Lucenet, J., & Blaye, A. (2014). Age-related changes in the temporal
dynamics of executive control: A study in 5- and 6-year-old chil-
dren. Frontiers in Psychology, 5, Article 831. https://doi.org/10
.3389/fpsyg.2014.00831, PubMed: 25120523
Matzen, L. E., Benz, Z. O., Dixon, K. R., Posey, J., Kroger, J. K., &
Speed, A. E. (2010). Recreating Raven’s: Software for systematically
generating large numbers of Raven-like matrix problems with
normed properties. Behavior Research Methods, 42(2), 525–541.
https://doi.org/10.3758/BRM.42.2.525, PubMed: 20479184
Mitchum, A. L., & Kelley, C. M. (2010). Solve the problem first:
Constructive solution strategies can influence the accuracy of
retrospective confidence judgments. Journal of Experimental
Psychology: Learning, Memory, and Cognition, 36(3), 699–710.
https://doi.org/10.1037/a0019182, PubMed: 20438267
Nusbaum, E. C., & Silvia, P. J. (2011). Are intelligence and creativity
really so different? Fluid intelligence, executive processes, and
strategy use in divergent thinking. Intelligence, 39(1), 36–45.
https://doi.org/10.1016/j.intell.2010.11.002
Paris, S. G., & Oka, E. R. (1986). Children’s reading strategies,
metacognition, and motivation. Developmental Review, 6(1),
25–56. https://doi.org/10.1016/0273-2297(86)90002-X
Parker, R. K., Sperr, S. J., & Rieff, M. L. (1972). Multiple classifi-
cation: A training approach. Developmental Psychology, 7(2),
188–194. https://doi.org/10.1037/h0033017
Peng, P., Wang, T., Wang, C., & Lin, X. (2019). A meta-analysis on
the relation between fluid intelligence and reading/mathematics:
Effects of tasks, age, and social economics status. Psychological
Bulletin, 145(2), 189–236. https://doi.org/10.1037/bul0000182,
PubMed: 30652909
Perret, P., & Dauvier, B. (2018). Children’s allocation of study time
during the solution of Raven’s Progressive Matrices. Journal of
I n t e l l i g e n c e , 6 ( 1 ) , A r t i c l e 9 . h t t p s : / / d o i . o r g / 1 0 . 3 3 9 0
/jintelligence6010009, PubMed: 31162436
OPEN MIND: Discoveries in Cognitive Science
219
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
/
/
.
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
/
.
i
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3
Matrix Completion Strategies Across Development Niebaum and Munakata
Primi, R. (2001). Complexity of geometric inductive reasoning
tasks: Contribution to the understanding of fluid intelligence.
Intelligence, 30(1), 41–70. https://doi.org/10.1016/S0160
-2896(01)00067-8
Primi, R., Ferrão, M. E., & Almeida, L. S. (2010). Fluid intelligence
as a predictor of learning: A longitudinal multilevel approach
applied to math. Learning and Individual Differences, 20(5),
446–451. https://doi.org/10.1016/j.lindif.2010.05.001
R Core Team. (2020). R: A language and environment for statistical
computing (Version 4.0.2). R Foundation for Statistical Computing.
Raven, J. (2000). The Raven’s Progressive Matrices: Change and sta-
bility over culture and time. Cognitive Psychology, 41(1), 1–48.
https://doi.org/10.1006/cogp.1999.0735, PubMed: 10945921
Richland, L. E., & Burchinal, M. R. (2013). Early executive function
predicts reasoning development. Psychological Science, 24(1),
87–92. https://doi.org/10.1177/0956797612450883, PubMed:
23184588
Richland, L. E., Morrison, R. G., & Holyoak, K. J. (2006). Children’s
development of analogical reasoning: Insights from scene anal-
ogy problems. Journal of Experimental child Psychology, 94(3),
249–273. https://doi.org/10.1016/j.jecp.2006.02.002, PubMed:
16620867
Rivollier, G., Quinton, J.-C., Gonthier, C., & Smeding, A. (2021).
Looking with the (computer) mouse: How to unveil problem-
solving strategies in matrix reasoning without eye-tracking.
Behavior Research Methods, 53(3), 1081–1096. https://doi.org
/10.3758/s13428-020-01484-3, PubMed: 32974871
Siegler, R. S. (1987). Some general conclusions about children’s
strategy choice procedures. International Journal of Psychology,
22(5–6), 729–749. https://doi.org/10.1080/00207598708246800
Siegler, R. S., & Jenkins, E. A. (2014). How children discover
new strategies. Psychology Press. https://doi.org/10.4324
/9781315807744
Siegler, R. S., & Svetina, M. (2002). A microgenetic/cross-sectional
study of matrix completion: Comparing short-term and long-term
change. Child Development, 73(3), 793–809. https://doi.org/10
.1111/1467-8624.00439, PubMed: 12038552
Siegler, R. S., & Svetina, M. (2006).What leads children to adopt
new strategies? A microgenetic/cross-sectional study of class
inclusion. Child Development, 77(4), 997–1015. https://doi.org
/10.1111/j.1467-8624.2006.00915.x, PubMed: 16942502
Snow, R. E. (1980). Aptitude processes. In R. E. Snow, P. A. Federico, &
W. E. Montague (Eds.), Aptitude, learning and instruction: Cogni-
tive process analyses (pp. 27–63). Lawrence Erlbaum Associates.
Starr, A., Vendetti, M. S., & Bunge, S. A. (2018). Eye movements
provide insight into individual differences in children’s analogi-
cal reasoning strategies. Acta Psychologica, 186, 18–26. https://
doi.org/10.1016/j.actpsy.2018.04.002, PubMed: 29669270
Steiner, H. H., & Carr, M. (2003). Cognitive development in gifted
children: Toward a more precise understanding of emerging dif-
ferences in intelligence. Educational Psychology Review, 15(3),
215–246. https://doi.org/10.1023/A:1024636317011
Stevenson, C. E., & Hickendorff, M. (2018). Learning to solve
figural matrix analogies: The paths children take. Learning and
Individual Differences, 66, 16–28. https://doi.org/10.1016/j
.lindif.2018.04.010
Sun, S., Schweizer, K., & Ren, X. (2019). Item-position effect in
Raven’s matrices: A developmental perspective. Journal of
Cognition and Development, 20(3), 370–379. https://doi.org/10
.1080/15248372.2019.1581205
Thibaut, J.-P., & French, R. M. (2016). Analogical reasoning, control
and executive functions: A developmental investigation with
eye-tracking. Cognitive Development, 38, 10–26. https://doi.org
/10.1016/j.cogdev.2015.12.002
Tunteler, E., Pronk, C. M. E., & Resing, W. C. M. (2008). Inter- and
intra-individual variability in the process of change in the use of
analogical strategies to solve geometric tasks in children: A micro-
genetic analysis. Learning and Individual Differences, 18(1), 44–60.
https://doi.org/10.1016/j.lindif.2007.07.007
Tunteler, E., & Resing, W. C. M. (2007). Change in spontaneous
analogical transfer in young children: A microgenetic study.
Infant and Child Development, 16(1), 71–94. https://doi.org/10
.1002/icd.505
van Renswoude, D. R., Raijmakers, M. E. J., Koornneef, A., Johnson,
S. P., Hunnius, S., & Visser, I. (2018). Gazepath: An eye-tracking
analysis tool that accounts for individual differences and data
quality. Behavior Research Methods, 50(2), 834–852. https://doi
.org/10.3758/s13428-017-0909-3, PubMed: 28593606
Vendetti, M. S., Starr, A., Johnson, E. L., Modavi, K., & Bunge, S. A.
(2017). Eye movements reveal optimal strategies for analogical
reasoning. Frontiers in Psychology, 8, Article 932. https://doi
.org/10.3389/fpsyg.2017.00932, PubMed: 28626443
Vigneau, F., Caissie, A. F., & Bors, D. A. (2006). Eye-movement anal-
ysis demonstrates strategic influences on intelligence. Intelligence,
34(3), 261–272. https://doi.org/10.1016/j.intell.2005.11.003
Vodegel Matzen, L. B. L., van der Molen, M. W., Dudink, A. C. M.
(1994). Error analysis of Raven test performance. Personality and
Individual Differences, 16(3), 433–445. https://doi.org/10.1016
/0191-8869(94)90070-1
Wickham, H. (2009). ggplot2: Elegant graphics for data analysis.
Springer. https://doi.org/10.1007/978-0-387-98141-3
Wilke, C. O. (2019). cowplot: Streamlined plot theme and plot
annotations for ‘ggplot2’ (R package version 0.9). https://CRAN
.R-project.org/package=cowplot
Wong, B. (2011). Points of view: Color blindness. Nature Methods,
8(6), 441. https://doi.org/10.1038/nmeth.1618, PubMed: 21774112
OPEN MIND: Discoveries in Cognitive Science
220
l
D
o
w
n
o
a
d
e
d
f
r
o
m
h
t
t
p
:
/
/
d
i
r
e
c
t
.
m
i
t
.
/
e
d
u
o
p
m
i
/
l
a
r
t
i
c
e
-
p
d
f
/
d
o
i
/
i
.
/
/
1
0
1
1
6
2
o
p
m
_
a
_
0
0
0
7
8
2
1
3
3
8
2
3
o
p
m
_
a
_
0
0
0
7
8
p
d
.
/
i
f
b
y
g
u
e
s
t
t
o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3