RESEARCH ARTICLE

RESEARCH ARTICLE

Gene Expression Correlates of the Cortical
Network Underlying Sentence Processing

Xiang-Zhen Kong1, Nathalie Tzourio-Mazoyer2,3,4, Marc Joliot2,3,4, Evelina Fedorenko5,
Jia Liu6, Simon E. Fisher1,7, and Clyde Francks1,7

1Language and Genetics Department, Max Planck Institute for Psycholinguistics, Nijmegen, The Netherlands
2University of Bordeaux, GIN, IMN, UMR 5293, Bordeaux, France
3CNRS, GIN, IMN, UMR 5293, Bordeaux, France
4CEA, GIN, IMN, UMR 5293, Bordeaux, France
5Department of Brain and Cognitive Sciences and McGovern Institute for Brain Research, MIT, Cambridge, MA, 02139, USA
6Faculty of Psychology, Beijing Normal University, Beijing 100875, China
7Donders Institute for Brain, Cognition and Behaviour, Radboud University, Nijmegen, The Netherlands

Keywords: sentence-processing network, language, functional network, correlated gene expression
network, neural development, autism

ABSTRACT

A pivotal question in modern neuroscience is which genes regulate brain circuits that underlie
cognitive functions. However, the field is still in its infancy. Here we report an integrated
investigation of the high-level language network (i.e., sentence-processing network) in the
human cerebral cortex, combining regional gene expression profiles, task fMRI, large-scale
neuroimaging meta-analysis, and resting-state functional network approaches. We revealed
reliable gene expression–functional network correlations using three different network
definition strategies, and identified a consensus set of genes related to connectivity within the
sentence-processing network. The genes involved showed enrichment for neural development
and actin-related functions, as well as association signals with autism, which can involve
disrupted language functioning. Our findings help elucidate the molecular basis of the brain’s
infrastructure for language. The integrative approach described here will be useful for
studying other complex cognitive traits.

INTRODUCTION

A pivotal question in modern neuroscience is which genes regulate brain circuits that underlie
cognitive functions. In the past decade, imaging genetics has provided a powerful approach
for exploring this question in humans, by combining neuroimaging data and genotype infor-
mation from the same subjects and searching for associations between interindividual variabil-
ity in neuroimaging phenotypes and genotypes within a sample. Although numerous imaging
genetics studies have now been published (Avinun, Nevo, Knodt, Elliott, & Hariri, 2018; Bigos
& Weinberger, 2010; Carter et al., 2017; Thompson et al., 2014), there remain key issues that
affect the field, including sample size limitations, the need to correct for multiple comparisons,
and the small effect sizes that are typical of associations with common gene variants.

Recently, researchers have begun to probe the molecular genetic architecture of the human
brain, not only through genotypes and interindividual variability, but also using regional gene-
expression mapping in postmortem brain tissues, in combination with neuroimaging data collected
from living individuals (Bartres-Faz et al., 2019; Kong, Song, Zhen, & Liu, 2017; McColgan et al.,
2018; Richiardi et al., 2015; Romero-Garcia, Warrier, Bullmore, Baron-Cohen, & Bethlehem,

a n o p e n a c c e s s

j o u r n a l

Citation: Kong, X.-Z., Tzourio-Mazoyer,
N., Joliot, M., Fedorenko, E., Liu, J.,
Fisher, S. E., & Francks, C. (2020). Gene
expression correlates of the cortical
network underlying sentence
processing. Neurobiology of Language,
1(1), 77–103. https://doi.org/10.1162/
nol_a_00004

DOI:
https://doi.org/10.1162/nol_a_00004

Supporting Information:
https://doi.org/10.1162/nol_a_00004

Received: 10 July 2019
Accepted: 19 December 2019

Corresponding Authors:
Clyde Francks
clyde.francks@mpi.nl
Xiang-Zhen Kong
xiangzhen.kong@outlook.com

Handling Editor:
Kate E. Watkins

Competing Interests: The authors have
declared that no competing interests
exist.

Copyright: © 2020 Massachusetts
Institute of Technology. Published
under a Creative Commons Attribution
4.0 International (CC BY 4.0) license.

The MIT Press

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

2018; Romme, de Reus, Ophoff, Kahn, & van den Heuvel, 2017; Vertes et al., 2016). This approach
has brought important new advances: Transcriptional profiles have been linked to neural architecture
with respect to both functional connectivity measured during the resting state (also called intrinsic
connectivity) (Richiardi et al., 2015; Vertes et al., 2016) and structural connectivity (Fulcher
& Fornito, 2016; Richiardi et al., 2015), as well as to alterations of connectivity in brain disorders
such as schizophrenia (Romme et al., 2017), autism spectrum disorder (ASD; Romero-Garcia
et al., 2018), and Huntington’s disease (McColgan et al., 2018). A necessary assumption is that the
group-averaged gene expression map, within a given postmortem data set, is sufficiently representa-
tive to be matched with the group-averaged neuroimaging-derived map from a set of living subjects.

The present study was inspired by a number of recent studies that probed connectivity data
in relation to inter-regional gene expression similarity. Richiardi et al. (2015) used data from
resting-state functional MRI (rs-fMRI) to show that the network modularity structure, derived
from functional connectivity patterns across the cortex, was correlated with inter-regional sim-
ilarity of gene expression profiles. Specifically, regions within a module (i.e., regions with
stronger functional connectivity) showed more similar gene expression profiles than across
different modules. French and Pavlidis (2011) showed that average gene expression similarity
is higher for regions with structural connections, compared to unconnected regions, in adult
rodent brain, whereas Goel, Kuceyeski, LoCastro, and Raj (2014) examined the association
between gene expression similarity and pairwise structural connectivity in the human brain
(Goel et al., 2014). Seidlitz et al. (2018) showed a significant correlation between inter-regional
gene co-expression and edge weights based on morphometric similarity in the human brain. In
another study, Romme et al. (2017) investigated the transcriptional profiles of a set of genes
known to contain inherited variants associated with schizophrenia, and found that they were
significantly correlated with regional reductions in the strength of white matter connections in
people with schizophrenia. Whitaker et al. (2016) investigated links between transcriptome data
and adolescent cortical development, specifically cortical thickness shrinkage and intracortical
myelination, and implicated oligodendroglial genes in these processes, as well as a set of genes
associated with risk for schizophrenia. More recently, gene expression in the human brain has
also been linked to regional differences in cortical scaling (Reardon et al., 2018) and anatomical
hierarchy (Burt et al., 2018), using similar spatial correlation approaches. (For a current review
on linking neuroimaging and transcriptome, see Fornito, Arnatkeviciute, & Fulcher, 2019).

These previous studies combining gene expression and brain imaging data have provided
strong evidence that patterns of gene expression covary with the anatomical and functional
organization of the human brain. However, the ultimate goal is to gain a rich and detailed
picture of the genetic and molecular mechanisms that support each core cognitive ability.
Here we focus on the quintessential and human-unique capacity for language. Previous stud-
ies have suggested that language-related cognitive performance is highly heritable (e.g., Dale
et al., 1998; Guen, Amalric, Pinel, Pallier, & Frouin, 2018; Newbury, Bishop, & Monaco,
2005), and that brain activations associated with semantic comprehension tasks are also her-
itable (Guen et al., 2018). Moreover, genetic factors also play a substantial role in susceptibil-
ity to language-related neurodevelopmental disorders such as childhood apraxia of speech
(Eising et al., 2018), developmental language disorder (specific language impairment), and de-
velopmental dyslexia (Deriziotis & Fisher, 2017). Crucially, although a small number of genes—
such as FOXP2 (e.g., Fisher & Scharff, 2009; Lai, Fisher, Hurst, Vargha-Khadem, & Monaco,
2001)—have now been unambiguously linked to language-related disorders, these genes cannot
by themselves explain the large majority of heritable variation; and they cannot conceivably
create or maintain the necessary brain circuits underlying language without interacting with a
large number of other genes (Graham & Fisher, 2015; Konopka & Roberts, 2016).

Neurobiology of Language

78

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

In addition, linguistic deficits are often found with heritable, neurodevelopmental disorders
for which impaired language function is not necessarily diagnostic, including intellectual disabil-
ity, autism spectrum disorder (or ASD), and schizophrenia (Bearden et al., 2000; Geschwind
& Flint, 2015; Kaufman, Ayub, & Vincent, 2010; Kleinhans, Muller, Cohen, & Courchesne,
2008; Lombardo et al., 2015; Schizophrenia Working Group of the Psychiatric Genomics
Consortium, 2014; Tager-Flusberg, Paul, & Lord, 2005; Tomblin, 2011). Linguistic ability also
correlates with intelligence in the general population (Silva, Williams, & McGee, 1987). Thus,
identifying genes and molecular mechanisms associated with language will not only yield a bet-
ter understanding of the biological pathways that lead to the emergence of language phylo- and
ontogenetically, but will also help identify susceptibility factors for language impairments in neu-
ropsychiatric conditions, which could lead to improved diagnostic and treatment strategies.

To shed further light on the molecular architecture underpinning language circuits, here we
synergistically combined task fMRI data, resting-state functional connectivity, and gene transcrip-
tion profiles in the human brain. Specifically, we targeted sentence-level processing as an essen-
tial, high-level linguistic function, which has been linked to a network of regions, particularly in
the left temporal and frontal cortices (Dronkers, Wilkins, Van Valin, Redfern, & Jaeger, 2004;
Fedorenko & Thompson-Schill, 2014; Labache et al., 2018; Price, 2012; Vigneau et al., 2006),
as opposed to lower level language-related functions, which can rely, for example, on primary
auditory and motor areas. First, we defined the cortical regions for left hemispheric sentence pro-
cessing based on task fMRI data, using three different sets of criteria and data to ensure robustness
and generalizability across approaches. Then, we estimated the resting-state functional connectiv-
ity networks among these regions, using rs-fMRI data from two independent data sets. Previous
studies have shown that functionally defined language-responsive regions form an integrated sys-
tem also in the resting state (Blank, Kanwisher, & Fedorenko, 2014; Cole, Bassett, Power, Braver,
& Petersen, 2014; Labache et al., 2018). Next, we examined the correlations between these func-
tional connectivity patterns and the corresponding inter-regional similarity patterns of gene expres-
sion, based on postmortem data. Similar pairwise connectivity/similarity correlation approaches
have been applied in a number of previous studies aiming to probe connectivity data in relation
to gene-expression data (e.g., Goel et al., 2014; Seidlitz et al., 2018). We then assessed the con-
tributions of each individual gene to the observed correlations, and identified a consensus set of
top genes (showing positive contribution scores) across all six analyses (i.e., three definition strat-
egies for the sentence-processing regions by two rs-fMRI data sets for estimating the functional
connectivity). Finally, using several bioinformatics databases, we explored the biological roles
and expression specificity of these genes, and tested whether they showed an enrichment for as-
sociation signals with ASD, schizophrenia, or intelligence, using genome-wide association study
(GWAS) data. In addition, we analyzed three other functional networks by way of comparison to
these language-related networks: the spatial navigation network, frontoparietal multiple demand
network, and default mode network. Given left-hemisphere dominance of the language network
and limited postmortem gene expression data for the right hemisphere (see AHBA), we focused on
the left hemisphere in the present study.

MATERIALS AND METHODS

Figure 1 shows a schematic of our approach for measuring the correlation between functional
connectivity and gene expression profiles within a given network of brain regions. This anal-
ysis pipeline consisted of (1) defining sets of cortical regions using task-activation data (directly
or via meta-analysis), (2) estimating the resting-state functional connectivity and gene expres-
sion similarity among these regions, and (3) assessing the correlation between the functional
connectivity and gene expression networks, along with subsequently (4) estimating each

Neurobiology of Language

79

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

Figure 1.
Schematic of the pipeline for computing the correlation between resting-state functional connectivity and transcriptomic similarity within
a network of regions first defined according to task fMRI data. Regional time series from rs-fMRI were used to estimate functional connectivity
between each pair of regions (in the upper matrix, red indicates stronger functional connectivity and blue indicates weaker functional connectivity).
Regional gene expression profiles were applied to estimate inter-regional transcriptional similarity (in the lower matrix, red indicates higher similarity
and yellow indicates lower similarity; lower panel). Resting-state functional connectivity patterns (orange axis in the right-hand panel) are correlated
with patterns of transcriptional similarity (blue axis in the right-hand panel). Each gene’s contribution to this correlation is estimated by removing that
gene at the step of gene expression network construction (red cross) and repeating the subsequent analysis.

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

gene’s individual contribution using a “leave-one-out” approach (see subsequent text for de-
tails of each data set and procedure).

Data Sets

fMRI data set acquisitions were approved by the institutional review board at each site. Written
informed consent was obtained when necessary from all participants (N = 244), before they
took part.

BIL&GIN

This data set included 144 healthy right-handed adults (mean age ± SD 27 ± 6 years; 72
female) drawn from the larger BIL&GIN database, which is roughly balanced for handedness
(Mazoyer et al., 2016). Each participant completed three slow-event fMRI runs (gradient echo
planar imaging, repetition time [TR] = 2.0 s, acquisition voxel size = 3.75 × 3.75 × 3.75 mm3;
3T Philips Intera Achieva scanner) in which they were asked to complete three different sen-
tence tasks including covertly producing, listening to, or reading sentences and familiar word
lists as reference. Of these participants, 137 also completed rs-fMRI scans using the same im-
aging sequence as that used for the tasks, which lasted 8 minutes (240 volumes). Immediately
before rs-fMRI scanning, the participants were instructed to “keep their eyes closed, to relax, to
refrain from moving, to stay awake, and to let their thoughts come and go.” Note that the latter
scanning session took place around 1 year before the task fMRI scans. (For more details see
further below and Labache et al., 2018.)

NeuroSynth

Neurosynth (http://neurosynth.org) is a platform for large-scale synthesis of task fMRI data
(Yarkoni, Poldrack, Nichols, Van Essen, & Wager, 2011). It uses text-mining techniques to

Neurobiology of Language

80

Genes and the Sentence-Processing Network

detect frequently used terms as proxies for concepts of interest in the neuroimaging literature:
Terms that occur at a high frequency in a given study are associated with all activation co-
ordinates in that publication, allowing for automated term-based meta-analysis. Despite the
automaticity and potentially high noise resulting from the large-scale meta-analysis, this ap-
proach has been shown to be robust and meaningful (e.g., Helfinstein et al., 2014; Kong,
Song, et al., 2017; Kong, Wang, et al., 2017; Yarkoni et al., 2011), due to the high number of
studies included. We used database version 0.6 (current as of July 2018), which included
413,429 activation peaks reported in 11,406 studies (see Defining Cortical Regions for
Sentence Processing).

EvLabN60

This data set included statistical maps of the task fMRI contrast for passively reading sentences
versus nonwords from 60 participants (ages from 19 to 45 years; 41 female; all right-handed).
This fMRI task (TR = 2.0 s, acquisition voxel size 2.1 × 2.1 × 4.0 mm3; 3 T Siemens Trio scan-
ner) was designed to localize the sentence-processing network (for details, see Fedorenko,
Hsieh, Nieto-Castanon, Whitfield-Gabrieli, & Kanwisher, 2010). The sentences > nonwords
contrast has been shown previously to reliably activate language-sensitive regions and to be
robust to the materials, task, and modality of presentation (Fedorenko, Behr, & Kanwisher,
2011; Fedorenko et al., 2010; Mahowald & Fedorenko, 2016; Scott, Gallee, & Fedorenko,
2017). In addition, the EvLabN60 data set was used to define two other networks used as
comparisons to sentence-processing networks (below). A spatial working memory task was
designed to localize the frontoparietal multiple demand system (i.e., contrast Hard versus
Easy) (Blank et al., 2014; Fedorenko, Duncan, & Kanwisher, 2013) and the default mode net-
work (i.e., contrast Easy versus Hard) (Leech, Kamourieh, Beckmann, & Sharp, 2011;
McKiernan, Kaufman, Kucera-Thompson, & Binder, 2003; Park, Polk, Hebrank, & Jenkins,
2010). Participants were instructed to keep track of four (Easy condition) or eight (Hard con-
dition) sequentially presented locations in a 3×4 grid. In both conditions, participants per-
formed a two-alternative forced-choice task at the end of each trial to indicate the set of
locations they just saw. (For more details, see Fedorenko et al., 2011.)

GEB

GEB (http://www.brainactivityatlas.org), which is an abbreviation for “Gene-Environment-
Brain-Behavior,” provided an independent rs-fMRI data set for the present study. GEB is an
ongoing project that focuses on linking individual differences in human brain and behaviors
to environmental and genetic factors (Kong, Song, et al., 2017; Wang et al., 2016; Zhen et al.,
2017; Zhen et al., 2015). Rs-fMRI data from 40 college students (20 female; mean age ± SD
20.3 ± 0.91 years) were included in this study. The resting-state scan lasted 8 minutes and
consisted of 240 contiguous echo-planar-imaging (EPI) volumes (TR = 2.0 s; acquisition voxel
size = 3.125 × 3.125 × 3.6 mm3; 3 T Siemens Trio scanner). During the scan, participants were
instructed to relax and remain still, with their eyes closed. The data set has high quality in
terms of minimal head motion and registration errors, and has been used in several previous
studies (e.g., Kong, Wang, et al., 2017; Wang et al., 2016).

AHBA

AHBA (Allen Human Brain Atlas; http://www.brain-map.org) is a publicly available online re-
source for gene-expression data. The atlas characterizes gene expression (i.e., messenger RNA
[mRNA] quantification) in postmortem human brain based on genome-wide microarray-based

Neurobiology of Language

81

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

measurement, for more than 20,000 genes at ∼500 sampling sites distributed over the whole
brain. (See Hawrylycz et al., 2012 for more detail about the data collection.) Normalized ex-
pression data were used in the present study. To date (search conducted on March 30, 2017),
six adult donors with no history of neuropsychiatric or neurological conditions were available
in the database (ages 24, 31, 34, 49, 55, and 57 years; one female). Left hemisphere cerebral
cortical data are available for all six donors, whereas right-hemisphere data are available for
only two. Detailed information on donors and analysis methods is available at http://www.
brain-map.org. Structural brain imaging data of each donor were used to align sampling sites
into standard coordinate space. We also used gene-expression data from the Allen Institute’s
BrainSpan project (http://www.brainspan.org/), which includes human brain tissues from (em-
bryo) 8 weeks postconception to (adult) 40 years of age, sampling an average 13 regions
(range, 1–17) from one to three brains per time point, and measured using RNA sequencing
(Li et al., 2018). We used these latter data for examining the expression of specific genes of
interest across human brain development.

Genome-wide association study results for ASD, schizophrenia, and intelligence

We downloaded GWAS summary statistics from the Psychiatric Genomics Consortium (2018;
PGC) for ASD with up to 7,387 cases and 8,567 controls (Autism Spectrum Disorders Working
Group of the Psychiatric Genomics Consortium, 2017), and schizophrenia with up to 36,989
cases and 113,075 controls (Schizophrenia Working Group of the Psychiatric Genomics
Consortium, 2014). In addition, we downloaded GWAS results for intelligence, based on
78,308 individuals from the UK Biobank, CHIC consortium, and five additional cohorts
(GWAS Summary Statistics, 2018; Sniekers et al., 2017). This kind of data comprises summary
statistics from genetic association testing between each of millions of single nucleotide poly-
morphisms (SNPs) distributed over the genome, in relation to the trait or disorder of interest.

Defining Cortical Regions for Sentence Processing

We employed three strategies and data sources for defining the cortical sentence-processing
network. To refer to the regions defined under each of these three approaches, we use the
terms Supramodal Sentence Areas (SmSAs), Synthesized Sentence Areas (SSAs), and One-
contrast Sentence Areas (OcSAs). Given left-hemisphere dominance of the language network
(see Introduction) and limited postmortem gene-expression data for the right hemisphere (see
AHBA), we focused on the left hemisphere in the present study. Note that we made use of ex-
isting, previously published task-fMRI data sets for this study (see below), and do not repeat all
of the method details here.

SmSAs

We applied the definition of left-hemispheric high-order and supramodal sentence areas pro-
vided by The SENSAAS atlas (Labache et al., 2018). This atlas of language integrative and
supramodal areas, involving 142 healthy right-handers, is based on the conjunction of activa-
tion across sentence production, listening, and reading, as contrasted with activation for lists of
overlearned words (again presented as production, listening, or reading tasks) in the same par-
ticipants. Then, a second criterion was applied whereby leftward activation asymmetry was
required during the three sentence minus words contrasts. (See Labache et al., 2018 for a full
description of this definition approach.) Task fMRI data were used from the BIL&GIN data set
(see Data Sets) for this purpose. To obtain accurate measures of functional asymmetry, this
work made use of the Atlas of Intrinsic Connectivity of Homotopic Areas (AICHA) atlas,

Neurobiology of Language

82

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

including left-right homotopic regions of interest based on resting-state functional connectivity
data (Joliot et al., 2015). In the 179 left-right pairs of homotopic regions of the AICHA atlas,
fMRI signal variation and asymmetry were calculated for each task contrast and each par-
ticipant. Regions with both significant activation and leftward asymmetry across the three
sentence-level versus word-list task contrasts were identified. A significance threshold of
Bonferroni-corrected p < 0.05 was applied. Thirty-two left-hemisphere regions were obtained, including 27 cortical and 5 subcortical regions (Labache et al., 2018). SSAs SSAs were defined based on a large-scale neuroimaging meta-analysis of fMRI studies using Neurosynth (see Data Sets). A combination of terms related to sentence processing was used, including “sentence comprehension,” “sentence,” and “sentences” (411 studies). The resulting meta-analysis map (i.e., the likelihood map that shows there would be activation in some spe- cific brain regions given the presence of particular terms) was used in this study to cover re- gions that are relevant to the network of interest. To control the false-positive rate in the statistical map, a false discovery rate (FDR) threshold was used of 0.01 on a whole brain basis. As for SmSAs, the AICHA atlas was used to define the areas for functional network construction. Specifically, if a region from the AICHA atlas had more than half (50%) of its voxels showing significant specificity based on the thresholded mask from the meta-analysis (FDR-corrected p < 0.01), we included that region as one of the SSAs. The rationale for using the AICHA atlas was twofold. First, the AICHA atlas was generated using rs-fMRI connectivity data, with each region showing homogeneity of functional temporal activity within itself. Second, we wanted to ensure that the results were directly comparable with the results using SmSAs. OcSAs OcSAs were defined based on the probabilistic activation map of a single fMRI contrast from the EvLabN60 data set (see Data Sets). The passively reading task has been shown previously to reliably activate language-sensitive regions and to be robust to the materials, task, and modality of presentation (Fedorenko et al., 2011; Fedorenko et al., 2010; Mahowald & Fedorenko, 2016; Scott et al., 2017). The map was created by overlapping the statistical maps from all participants for the contrast of sentences versus nonwords (t > 2.3) onto the MNI152 template (Montreal
Neurological Institute template 152), and then dividing by the total number of participants (e.g.,
Zhen et al., 2017). The value for each voxel in the obtained map indicated the probability of
the voxel showing a significant contrast activation across the population. A probability thresh-
old of 50% was applied to identify voxels showing consistent activation (t > 2.3) across sub-
jects, and regions from the AICHA atlas with more than half of the voxels activated were
included, as for SSAs.

Exclusion of regions

We excluded one region that was relatively small (less than 150 voxels; i.e., G_Cingulum_Post-3)
and had limited gene-expression data in the AHBA (fewer than two sampling sites), as well as
subcortical areas that are known to have gene-expression profiles that were very different from
those of cerebral cortex and would swamp the analysis (e.g., hippocampus), a region that had
resting-state data missing in the GEB data set (i.e., G_Paracentral_Lobule-4), and two deep
regions where the gene-expression data were found to diverge substantially from most cerebral
cortical regions (i.e., G_ParaHippocampal-1 and G_Insula-anterior-1) (Figure S1). This resulted

Neurobiology of Language

83

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

in 21 SmSAs, 22 SSAs, and 12 OcSAs. The same criteria were also applied for defining the
comparison systems below (Table S1).

Comparison networks

The NeuroSynth term “navigation” was used to localize cortical regions involved in spatial
navigation (64 studies; search conducted on Nov. 3, 2016). An FDR threshold of 0.01 was
used to control the false-positive rate. Moreover, the default mode network and the fronto-
parietal multiple demand networks were defined using the EvLabN60 dataset (see Data
Sets), with the probabilistic activation maps of fMRI contrast Easy versus Hard, and Hard ver-
sus Easy, respectively, based on the spatial working memory task. Again, a probability thresh-
old of 50% was applied to identify voxels showing consistent activation (t > 2.3) across
subjects. Next, the AICHA atlas was used to identify regions for each functional network.
Our purpose was to define a similar number of top regions for each comparison network,
to support similarly powered analyses of all networks in the downstream analyses. To obtain
comparable numbers of top regions for each network, different thresholds of overlap were ap-
plied for each network. Specifically, we found that a threshold of 1/4 defined 19 top regions of
the spatial navigation network (SNN), a threshold of 1/2 defined 17 top regions of the multiple
demand network (MDN), and a threshold of 3/4 defined 12 top regions of the default mode
network (DMN).

Construction of the Functional Connectivity Networks

Two independent rs-fMRI datasets (i.e., BIL&GIN and GEB [above]) were used for functional con-
nectivity network construction among the sets of regions defined based on task fMRI activation.

Functional connectivity in the BIL&GIN data set

The preprocessing of the BIL&GIN data set was done by the Bordeaux group (MJ). Pre-
processing procedures included head motion correction and registration onto the anatomical
T1 image, the latter being stereotaxic registered on the MNI152 standard space. In addition,
time series for white matter and cerebrospinal fluid, the six head motion parameters, and the
temporal linear trend were removed from the stereotaxic normalized rs-fMRI data using regres-
sion analysis, and time series data were temporally filtered using a least squares linear-phase
finite impulse response (FIR) filter design bandpass (0.01–0.1 Hz). For each participant and
each region, a time series was then calculated by averaging the rs-fMRI time series of all voxels
located within that region. For each individual, we computed the Pearson’s correlation co-
efficient between the time series of each pair of cortical regions from a given task-defined
set of regions. Correlation coefficients were transformed to Gaussian-distributed z scores
via Fisher’s transformation. For each of the six networks (three sentence-processing and three
other functional networks), the functional connectivity matrix was computed by averaging data
from each individual. Negative edges were set to zero before follow-up analysis. (For a full de-
scription of the processing see Doucet et al., 2011).

Functional connectivity in the GEB data set

The preprocessing of the GEB data set was done by the Beijing group ( JL). Preprocessing pro-
cedures included head motion correction, spatial smoothing, intensity normalization, and re-
moval of linear trend, using the FEAT preprocessing workflow implemented with Nipype
(Gorgolewski et al., 2011). A temporal band-pass filter (0.01–0.1 Hz) was applied to reduce
low-frequency drifts and high-frequency noise. To eliminate physiological noise, such as

Neurobiology of Language

84

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

fluctuations caused by motion or cardiac and respiratory cycles, nuisance signals were re-
gressed out. Nuisance regressors were averaged cerebrospinal fluid signal, averaged white
matter signal, global signal averaged across the whole brain, six head realignment parameters
obtained by rigid-body head motion correction, and the derivatives of each of these signals.
The four-dimensional (4D) residual time series obtained after removing the nuisance covari-
ates were registered to MNI152 standard space. After preprocessing, a continuous time course
for each region of a given task-based functional network was extracted by averaging the time
courses of all voxels within that region. Temporal correlation coefficients between the ex-
tracted time course from a given region and those from other regions were calculated to
determine the strength of the connections between each pair of regions of a given functional net-
work at rest. Correlation coefficients were transformed to Gaussian-distributed z scores via Fisher’s
transformation to improve normality, resulting in a symmetric Z value matrix (i.e., functional con-
nectivity) for each task-defined system of each participant. Due to the ambiguous biological ex-
planation of negative correlations (Schwarz & McGonigle, 2011), we restricted our analyses to
positive edges and set negative edges to 0. We have applied the same processing procedure in
several previous studies (e.g., Kong, Wang, et al., 2017; Wang et al., 2016). After resting-state
functional connectivity networks were obtained corresponding to each task-defined set of regions,
a mean functional connectivity network for each set of regions was calculated by averaging across
participants, which was then used for subsequent analyses.

Construction of the Gene Expression Networks

Transcriptomic networks were constructed based on gene expression profiles in the human
brain from the AHBA (above). Specifically, we first extracted the normalized expression score
for each gene, from each sampling site, and each donor. For genes for which the expression
had been measured using multiple microarray probes in the AHBA data, average values were
calculated per gene at each sampling site and in each donor. Given that AICHA atlas regions
are defined in the standard MNI space, the location of each sampling site was then translated
into the standard space using alleninf, 2018. Gene expression data from within a given region
were then averaged per gene and across donors, to obtain a single expression measure of each
gene per region. We restricted our analyses to cerebral cortical regions with at least two sam-
pling sites summed across all donors, and data quality was then ensured by including only
genes showing relatively high interindividual consistency of expression levels, that is, we re-
stricted our analyses to the top 5% of all genes based on differential stability across donors as
assessed over the entire cerebral cortex (Figure S2). This was a set of 867 genes that had dif-
ferential stability greater than 0.357, as calculated previously (Hawrylycz et al., 2015). The
concept of differential stability was quantified as the averaged Pearson correlation between
pairs of brains over a set of cerebral cortical regions (Hawrylycz et al., 2015). The percentage
threshold applied (5%) seems to be strict, but the expression consistency of most genes is rel-
atively low in cortical regions in the Allen data (Hawrylycz et al., 2015), so that a 5% threshold
still includes genes with stability across donors of less than 0.4 (Figure S2). Excluding genes
with very low interindividual consistency was necessary to link average gene expression data
from the Allen brain data set to functional connectivity in other data sets. In addition, we
repeated our analysis using a lower stability threshold (i.e., top 10%, stability larger than
0.25; N = 1735 genes), to assess the robustness of our findings with respect to this threshold.
As a negative control, we repeated the analyses using the lowest 5% and lowest 10% of genes
with respect to differential stability across donors, with which we would expect null results.
We also repeated the analyses based on random sampling (repeated 1000 times) of genes from
the whole distribution of stability scores.

Neurobiology of Language

85

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

Our processing of the genetic data produced a vector of gene expression values across re-
gions. Transcriptional similarity between pairs of regions was then estimated by Pearson cor-
relation, as a measure of “transcriptomic connectivity.”

Correlation Between the Functional Networks and Gene Expression Networks

Network similarity analysis

We examined the correlations between functional connectivity networks and gene expression
networks. The essential concept is that pairs of regions of a functionally specific network that
have relatively higher similarity of their gene expression profiles could also show higher func-
tional connectivity (see Introduction). Specifically, a vector was extracted from the upper
triangle of the connectivity matrix of each network, and Spearman correlation was then cal-
culated between the vectors of the functional connectivity networks and their corresponding
gene expression networks. Unlike previous studies focusing on general organization of the
human brain, for example, modularity (Richiardi et al., 2015) and hubs (Vertes et al., 2016),
the present approach tested direct correlation between functional connectivity and gene ex-
pression similarity within the functional networks of interest. We also calculated the partial
correlations after controlling for the spatial distance between centers of regions (i.e., the
Euclidean distance of MNI coordinates) within the AICHA atlas (Joliot et al., 2015).

Gene contribution index

In addition to the overall correlations between functional connectivity and corresponding
gene expression networks, we formulated a novel index, the gene contribution index (GCI),
for estimating the contribution of each individual gene to an observed overall correlation. GCI
was defined as the difference in the overall correlation before and after removing that gene at
the step of gene expression network construction, that is, based on a “leave-one-out” ap-
proach. Note that this leave-one-out procedure was applied at the step of gene expression
network construction, rather than when correlation analysis was performed between two ma-
trices (see Figure 1). A similar approach has been applied in a recent study for the same pur-
pose (i.e., estimation of each gene’s contribution to the overall correlation; Seidlitz et al.,
2018).

Identification of “consensus genes” correlated with the sentence-processing network

We identified a set of “consensus genes” (N = 41), that is, those that had positive CGI scores in
all six analyses of the sentence-processing network: the three definition strategies SmSA, SSA,
and OcSA, by the two independent rs-fMRI data sets BIL&GIN and GEB. These were therefore
the genes that made the most consistent individual contributions to the significant overall cor-
relations between expression similarity and functional connectivity, regardless of the definition
of sentence network regions. For each of the three comparison networks (spatial navigation,
frontoparietal multiple demand, and default mode networks) we also identified the same num-
ber of genes (i.e., 41) showing the highest GCIs, based on the averaged score of each gene in
the two analyses relevant to each of those networks, that is, with the two rs-fMRI data sets.

Follow-up Analyses of the Consensus Genes

We further investigated the consensus genes correlated with the sentence-processing network
by use of literature searches and bioinformatics tools.

Neurobiology of Language

86

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

Gene ontology

Gene ontology provides a classification scheme for genes based on what is known with
respect to their molecular functions, the biological processes with which they are involved,
or the cellular components that they encode (Gene Ontology Resource, 2020). Gene ontol-
ogy analyses were performed with the Bioconductor package gProfileR (2019) using ontol-
ogies from Ensembl release 91. Gene sets containing between 25 and 1000 genes were
included. All known genes were used for determining the statistical domain size in the anal-
ysis. The default g:SCS method in the tool was used for multiple testing correction (corrected
p < 0.05). Gene expression across brain development We queried the Allen Institute’s BrainSpan project data (see data sets above) for the consensus genes, in relation to their developmental changes in expression from embryo to adult. Gene expression specificity For the 41 consensus genes correlated with connectivity in the sentence-processing network, we contrasted their expression levels between those cortical regions that were assigned to the sentence-processing network under any definition (i.e., all SmSA, SSA, and OcSA) (N = 31), versus all regions outside the system that were included among the SNN, MDN, or DMN (N = 34). More information about the lists of regions can be found in Table S1. The independent samples t-test (equal variances not assumed) was used to contrast expression levels. Because multiple comparisons were performed across 41 genes, a significance threshold of FDR- corrected p value of 0.05 was applied (Benjamini-Hochberg FDR). In terms of cell-type specificity, the expression levels of each of the 41 consensus genes that were correlated with connectivity within the sentence-processing network were queried in a published data set based on mouse cortical data (as indexed as sequence reads per kilobase of exon per million reads mapped (FPKM; Zhang et al., 2014). We also queried another single- cell gene expression database (Zeisel et al., 2015) to investigate the cell-type specificity of genes. Several recently published single-cell, or single-nucleus, gene expression data sets from human neocortex also became available while this study was in progress (Fan et al., 2018; Lake et al., 2018; Li et al., 2018; Zhong et al., 2018). We also queried these data sets. Enrichment Analysis Using GWAS Summary Statistics for ASD, Schizophrenia, and Intelligence We tested the hypothesis that the consensus set of sentence-processing network genes was enriched for genetic association signals with ASD, schizophrenia, or intelligence, in the pub- licly available results from very large-scale genome-wide association studies, based on case- control or general population cohorts. (There were no large-scale GWAS results yet available for reading/language measures in the general population, or disorders such as dyslexia or lan- guage impairment which involve linguistic deficits.) Specifically, we ran gene set analyses using MAGMA software ( 2020). MAGMA was run with default settings. Because each gene contains multiple individual polymorphisms within it, gene-based association scores were de- rived using the SNP-wise mean model, which considers the sum of -log(p-values) as derived from GWAS analysis, for SNPs located within the transcribed region of a given gene (using NCBI 37.3 gene locations). MAGMA accounts for gene size, number of SNPs in a gene, and linkage disequilibrium (LD) between SNPs when estimating gene-based association scores. LD between SNPs was based on the 1000Genomes phase 3 European ancestry Neurobiology of Language 87 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . e d u n o / l / l a r t i c e - p d f / / / / / 1 1 7 7 1 8 6 7 8 8 8 n o _ a _ 0 0 0 0 4 p d . l f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Genes and the Sentence-Processing Network samples. In this analysis, the score for a given gene therefore indicates how strongly genetic variation within, or in LD with, that gene is associated with the trait of interest (i.e., ASD, schizophrenia, or intelligence). These GWAS-based gene scores were subsequently used to compute gene set enrichment within the 41 consensus genes associated with the sentence- processing network. The enrichment analysis tests whether the genes in a given set have, on average, higher GWAS-based gene scores than the other genes in the genome. No cutoff was made on the GWAS-based gene scores, so that for all genes the degree of association with the trait of interest was taken into account. A significance threshold of FDR-corrected p value of 0.05 was applied to correct for multiple comparisons (i.e., ASD, schizophrenia, or intelligence). Similarly, we conducted exploratory gene set analyses with top genes for the other comparison functional networks, to compare with results for the sentence-processing network. RESULTS Functional Networks Given the absence of a universally agreed upon protocol for localizing brain regions that support high-level language processing (Fedorenko & Thompson-Schill, 2014), we used three different definition strategies: (1) Supramodal Sentence Areas (or SmSAs) based on the concor- dance of activation across three language fMRI tasks and leftward lateralization completed by 144 healthy right-handers (Labache et al., 2018), (2) Synthesized Sentence Areas (or SSAs) based on large-scale neuroimaging meta-analysis of fMRI studies (Yarkoni et al., 2011), and (3) One-contrast Sentence Areas (or OcSAs) based on the probabilistic activation map of a single language-task fMRI contrast (Fedorenko et al., 2010) (see Materials and Methods). Regions were defined according to the AICHA brain atlas, which is derived from rs-fMRI con- nectivity data, with each region showing homogeneity of functional temporal activity within itself (Joliot et al., 2015). The three resulting maps of the sentence-processing network included 21, 22, and 12 regions, respectively, which showed considerable overlap and were consistent with previous studies, especially with respect to core language regions such as the temporal and frontal regions (Figure 2; Table S1) (Fedorenko & Thompson-Schill, 2014). To compare to language-related networks, we also analyzed three other functional cortical networks: the spatial navigation network (SNN), the frontoparietal multiple demand network (MDN; similar to the dorsal attention network [Yeo et al., 2011]), and the default mode net- work (DMN). The sets of cortical regions defined for these networks (see Materials and Methods) appeared consistent with previous literature (Buckner, Andrews-Hanna, & Schacter, 2008; Crittenden, Mitchell, & Duncan, 2016; Kong, Wang, et al., 2017) and showed little overlap with the sentence-processing networks defined in the preceding text (Figure 2). We obtained 19, 17, and 12 areas for the SNN, MDN, and DMN, respectively. More information about the spatial distribution of each definition and their overlaps can be seen in Table S1. Connectivity within a given functionally defined set of regions was estimated based on inter-regional synchronization of rs-fMRI time courses. The connectivity patterns based on two independent rs-fMRI data sets (BIL&GIN and GEB; see Methods) were highly correlated for all functional networks (rho > 0.80; Figure 2), indicating high reproducibility of the group-
level functional connectivity pattern with respect to AICHA atlas regions. Corresponding gene
expression networks for each functionally defined set of regions were calculated based on
postmortem cortical gene expression data, and using pairwise similarities of regional gene ex-
pression profiles (see Materials and Methods). To create a reliable average brain map of cor-
tical gene expression, the analyses were restricted to 5% of all genes (i.e., 867 genes), which

Neurobiology of Language

88

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Figure 2. Functional and transcriptomic networks of the present study. First (left-side) column: the
sets of cortical regions defined based on task fMRI data; the top three (SmSA, SSA, and OcSA) cor-
respond to the three different ways of defining the network for sentence processing, and the bottom
three correspond to the three other comparison networks. Second and fourth columns: the functional
connectivity networks based on rs-fMRI from the BIL&GIN data set and GEB data set, respectively.
Third column: the transcriptomic networks based on the AHBA database. Within each matrix, the
colors indicate the strength of pairwise inter-regional correlations or transcriptomic inter-regional
similarity. Correlation values among the right three columns are the Spearman correlations (rho)
of each pair of matrices for a given network, based on the upper triangle part of each matrix. All
correlations were significant ( p < 0.01). SmSA, Supramodal Sentence Area; SSA, Synthesized Sentence Area; OcSA, One-contrast Sentence Area; SNN, Spatial Navigation Network; MDN, Multiple Demands Network; DMN, Default Mode Network. Neurobiology of Language 89 Genes and the Sentence-Processing Network showed the highest differential stability of expression levels across individual donors in cere- bral cortical data from the Allen Brain Atlas (Hawrylycz et al., 2015). This approach meant that we focused only on genes that have consistent expression with respect to the average organi- zation of the cerebral cortex. Within a given network, different pairs of regions varied in how similar they were in gene expression, although all regional pairwise correlations were high (greater than 0.9) (Figure 2). Similarity Between Functional Networks and Gene Expression Networks We used correlation analysis to test whether regions with more similar gene transcription profiles show stronger resting-state functional connectivity within each specific functional net- work. As expected, based on data from the BIL&GIN data set, we found significant correlations between functional connectivity and the corresponding transcriptomic similarity patterns for each sentence processing network definition (Figure 2; SmSA: rho = 0.19, p = 0.0048; SSA: rho = 0.26, p < 0.0001; OcSA: rho = 0.42, p = 0.00045). In addition, we obtained similar results for each of the comparison networks (Figure 2; SNN: rho = 0.24, p = 0.0020; MDN: rho = 0.39, p < 0.0001; and DMN: rho = 0.40, p = 0.00081). Using connectivity data from the independent rs-fMRI data set GEB, highly similar results were found (Figure 2; SmSA: rho = 0.19, p = 0.0054; SSA: rho = 0.24, p = 0.00018; OcSA: rho = 0.35, p = 0.0042; SNN: rho = 0.33, p < 0.0001; MDN: rho = 0.50, p < 0.0001; and DMN: rho = 0.60, p < 0.0001). All gene- brain correlations survived correction for multiple testing (FDR corrected p < 0.01). We found similar correlations when applying a more inclusive threshold for gene expression differential stability across donors (stability >0.25, i.e., the top 10% genes in stability), and as a negative
control, we also tested the bottom genes in differential stability across donors (i.e., bottom 5%
or 10% of genes), but saw little evidence of significant gene-brain correlation (Table S2), as
expected. We also repeated the gene-brain correlation analyses based on random sampling
(repeated 1000 times) of genes from the whole distribution of stability scores, and found that
the correlations based on the top genes were greater than the correlations of random gene sets
of equal size (Figure S3), which further supported the observed gene-brain associations.

For the top 5% of genes by interdonor stability, we also calculated the partial correlations
between transcriptomic and functional connectivity matrices after correcting for the spatial
proximity of regions (see Materials and Methods). These partial correlations were weaker than
those without the correction for spatial proximity, and mostly nonsignificant (Table S3), suggest-
ing that it is mostly local regional similarities that drive the overall transcriptomic-connectivity
correlations within the functional networks defined in this study.

Gene Contribution Index

With a leave-one-out procedure, we obtained gene contribution index (GCI) scores for all
individual genes, and for each functional network definition, which indicated the extent to
which each gene affected the overall connectivity-transcriptome correlation for a given net-
work (see Materials and Methods). To investigate the similarity of gene contribution patterns
across different networks, we calculated the correlation between the GCI scores of each pair of
networks. As expected, for the three different definitions of the sentence-processing network,
which involved largely overlapping sets of cortical regions, the GCI scores showed substantial
correlations, which were also reproducible across the two independent rs-fMRI data sets
(Figure 3A; BIL&GIN: mean r = 0.40, from 0.22 to 0.50, N = 3; GEB: mean CGI r = 0.50, from
0.32 to 0.61, N = 3). Regarding the comparison networks, although some limited regional
overlap existed between the networks of different functions—for example, SNN and MDN

Neurobiology of Language

90

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

Figure 3. Contributions of individual genes to overall gene-brain network correlations. A. Correlations between GCI scores derived for each
functional network, based on the two independent data sets: BIL&GIN (left) and GEB (right). The three definitions of the sentence-processing
network showed relatively high correlations of their GCI scores, whereas different functional networks showed low correlations of their GCI
scores. B. The consensus set of genes (N = 41) for the six analyses of the sentence-processing network (i.e., three functional definition strategies
by two rs-fMRI data sets). Red color indicates genes that have been linked to language-related phenotypes in previous studies (e.g., reading
deficits, intellectual disability, and autism), and genes in bold indicate significant differential expression within the sentence-processing net-
work regions compared to other cortical regions (FDR corrected p < 0.05). SmSA, Supramodal Sentence Areas; SSA, Synthesized Sentence Areas; OcSA, One-contrast Sentence Areas; SNN, Spatial Navigation Network; MDN, Multiple Demand Network; DMN, Default Mode Network. shared six regions (Table S1)—the GCI scores showed very low correlations (<0.1) across the different functional networks (i.e., between each of the comparison networks with one another, and with any definition of the sentence-processing network), which was again consistent across the two independent rs-fMRI data sets for measuring functional connectivity (Figure 3A; BIL&GIN: mean r = 0.058 ± 0.047, N = 12; GEB: mean r = 0.017 ± 0.12, N = 12). To derive a set of genes related to the sentence-processing network with the highest con- sistency, we identified the 41 “consensus genes” that had positive CGI scores in all six anal- yses of this network, that is, the three definition strategies (SmSA, SSA, and OcSA), by the two independent rs-fMRI data sets (BIL&GIN and GEB) (Figure 3B). As expected, these 41 genes showed significantly greater CGI scores when compared to all other genes, within each of the six analyses (t values > 4.16, p’s < 0.0002; equal variances not assumed). In addition, we re- calculated transcriptomic similarity matrices based on only these 41 consensus genes, and found these matrices to be significantly correlated with the corresponding functional connec- tivity matrices (r = 0.30–0.55, p’s < 0.0001) (Figure S4; also the correlations based on the 41 genes were greater than the correlations based on 1000 random gene sets of equal size). In addition, these correlations were largely unaffected after controlling for inter-regional spatial distance between centers of regions (all p’s < 0.05) (Table S4). Several of the 41 consensus genes—including ROBO1, MET, PRRX1, CNTN6, and CTXN3—have been reported to affect language- or reading-related phenotypes such as dyslexia, and/or disorders that are often ac- companied by linguistic impairments, that is, intellectual disability, ASD, and schizophrenia (see Discussion). Further information on the consensus gene set is provided in Table S5. Neurobiology of Language 91 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . e d u n o / l / l a r t i c e - p d f / / / / / 1 1 7 7 1 8 6 7 8 8 8 n o _ a _ 0 0 0 0 4 p d . l f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Genes and the Sentence-Processing Network Table 1. Gene ontology terms for which the consensus set associated with the sentence processing network shows enrichment Term names BP: neuron projection development Corrected p 0.0118 BP: actin filament-based process BP: actin cytoskeleton organization 0.0455 0.0228 BP: supramolecular fiber organization 0.0196 BP: actin filament organization 0.00183 BP, biological process. # of term genes 902 705 609 590 360 Overlap 6 Intersecting genes SERPINF1, PRRX1, ROBO1, TRPC6, SPON2, GPRIN1 5 5 5 5 MET, ZYX, ARHGAP25, VILL, IGSF22 MET, ZYX, ARHGAP25, VILL, IGSF22 MET, ZYX, ARHGAP25, VILL, IGSF22 MET, ZYX, ARHGAP25, VILL, IGSF22 Biological Roles of Consensus Genes Associated with the Sentence-Processing Network Gene ontology analysis of the consensus set of 41 genes associated with the sentence- processing network identified significant enrichment for terms mostly related to the actin cy- toskeleton ( p’s < 0.05), all driven by the same set of five genes: MET, ZYX, ARHGAP25, VILL, and IGSF22 (Table 1). Another significant enrichment was for the set “neuron projection de- velopment” ( p = 0.012), and there were six genes that drove the enrichment results, which were SERPINF1, PRRX1, ROBO1, TRPC6, SPON2, and GPRIN1 (Table 1). None of these gene sets was significant when analyzing the 41 top genes for each of the comparison net- works ( p > 0.05) (Table S6).

To examine expression across human brain development of these sets of genes associated with
sentence-processing network connectivity, we used data from the Allen Institute’s BrainSpan pro-
ject, which includes human brain tissues from embryonic stages to adulthood, measured using
RNA sequencing (Li et al., 2018). Each of these genes has detectable expression in the frontal
and temporal regions during fetal development and in early childhood (Figure 4). Some of the
genes increase in expression through development all the way to adulthood, such as MET,
SERPINF1, and PRRX1. Others decrease such as ROBO1, GPRIN1, and TRPC6, while still remain-
ing expressed through to adulthood.

Figure 4. Gene expression in human frontal and temporal cortices, from embryo (8 weeks postconception) to adulthood (40 years), from the
BrainSpan atlas. Genes are listed in the legend in order of their average expression across all samples. The y axis uses a log10 scale to visualize
change over time for each gene. The samples were dissected according to the BrainSpan Technical White Paper: Transcriptome Profiling by RNA
Sequencing and Exon Microarray (2012).

Neurobiology of Language

92

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

We found that 14 genes of the 41 consensus gene set showed differential expression when
contrasting regions defined as belonging to the sentence-processing network against those in
the comparison networks (see Materials and Methods; uncorrected p < 0.05), among which 6 genes survived correction for multiple comparisons (FDR corrected p < 0.05): C12orf23, FAM65B, LY6H, MGP, SERPINF1, and SPON2, all with higher expression in the regions as- signed to the sentence-processing network. Notably, three genes related to “neural projection development” in gene ontology analysis showed significantly higher expression within the sentence-processing regions, which were SERPINF1, SPON2, and GPRIN1 ( p’s < 0.05; e.g., SERPINF1: t(46) = 3.85, p = 0.00036; 95% CI of the difference: 0.24 to 0.76), whereas none showed lower expression in the sentence-processing regions ( p’s > 0.05). (More information
can be found in Table S7.)

Of the 41 consensus genes associated with the sentence-processing network, data on 33
were available from a single-cell RNA-sequencing study of adult mouse cerebral cortex
(Zhang et al., 2014). Among these 33 genes, a majority (i.e., 25) showed expression in one
cell type that was at least 1.5 times higher than all other cell types, although not predominantly
in neurons versus other cell types (Table S8). For example, MET and CTXN3 showed 3.57- and
1.53-fold expression in neurons compared to the maximum expression in other cell types, re-
spectively. Data on 7 of the 41 genes were available in another single-cell gene expression
database (Zeisel et al., 2015), and 2 of these showed enrichment in interneurons: ANK1 and
SHD. Among the six genes driving the gene ontology enrichment results related to neural pro-
jection development, the following four showed their highest expressions in neurons: ROBO1,
TRPC6, SPON2, and GPRIN1. In addition, among the five genes driving the gene ontology
enrichment related to the actin cytoskeleton, data for three were available: MET showed high-
est expression in neurons, and ARHGAP25 and ZYX in microglia. We also queried several
recently available single-cell or single-nucleus gene expression data sets on the human neo-
cortex (Fan et al., 2018; Lake et al., 2018; Li et al., 2018; Zhong et al., 2018). Data on 22 of the
41 genes were available in one or more of these data sets. Among these 22 genes, 8 were
suggested to be specific to neuronal cell types: ANK1, ROBO1, COX7A1, LAIR2, LY6H, GLCCI1,
CBLN2, and PGM2L1 (Table S9). Five of these neuronal genes were markers of excitatory neurons:
ROBO1, LAIR2, GLCCI1, CBLN2, and PGM2L1.

Association with ASD, Schizophrenia, and Intelligence

We were interested to test whether genes involved in the cortical language network also contain
polymorphisms in the population that affect human cognitive or behavioral variation, or suscepti-
bility to neuropsychiatric disorders. This analysis requires GWAS results from large-scale studies,
which are currently lacking for reading/language measures in the general population, and for
disorders such as dyslexia or language impairment, which involve language-related deficits. We
analyzed ASD and schizophrenia, as these disorders can involve linguistic deficits, as well as in-
telligence in the general population, which correlates with linguistic abilities (see Introduction).
Using GWAS summary statistics for ASD based on up to 7,387 cases and 8,567 controls (Autism
Spectrum Disorders Working Group of The Psychiatric Genomics Consortium, 2017), we found
that the 41 consensus genes associated with the sentence-processing network were significantly
enriched for SNPs showing association with ASD (beta = 0.32, p = 0.0080, FDR corrected). No
such signal in relation to ASD was seen for the top genes (N = 41), with highest CGI scores for each
of the comparison networks (SNN: p = 0.45; MDN: p = 0.65; DMN: p = 0.074). In addition, no
significant enrichment was found for any functional network in relation to schizophrenia (GWAS
based on up to 36,989 cases and 113,075 controls; Schizophrenia Working Group of the
Psychiatric Genomics Consortium, 2014) ( p’s > 0.50). For intelligence (GWAS based on 78,308

Neurobiology of Language

93

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

individuals; Sniekers et al., 2017), there was a significant enrichment for the top genes associated
with the MDN (beta = 0.30, p = 0.035), with no other enrichment found ( p’s > 0.30).

DISCUSSION

In this study, we combined gene transcription profiles in the human brain with task and resting-
state fMRI data and investigated the gene expression correlates of the high-level linguistic net-
work. Specifically, with six analyses based on complementary strategies and independent data
sets, we revealed a significant correlation between the pattern of functional connectivity within
the sentence-processing network and the corresponding pattern of inter-regional gene expres-
sion similarity. To our knowledge, this is the first evidence for a link between gene transcription
profiles and language networks.

Although some previous studies have suggested that transcription profiles are linked to pat-
terns of structural and functional connectivity across the brain (see Introduction), this relationship
could be driven by broad differences in gene expression between sensory and higher-order as-
sociation cortices. Here, we focus on one core human cognitive ability— language—and exam-
ine a fine-grained pattern of brain-gene relationships within the network that supports sentence
processing. Across three definitions of sentence-processing regions, and using two independent
resting-state fMRI data sets, we established and characterized a relationship with gene expres-
sion: pairs of language-sensitive brain regions that show synchronization during rest also show
more similar profiles of gene expression. We then identified a consensus set of genes most con-
sistently driving the correlation between transcriptomic similarity and functional connectivity
within the sentence-processing network, and thereby gained new insights into the molecular
bases of the language-ready brain.

The underlying basis of positive correlations between transcriptome similarity and func-
tional connectivity is unknown. One possibility is that functionally linked cortical regions
are more likely to share specific aspects of neuronal physiology and developmental trajectory,
which support their temporally synchronized activity. When we adjusted for the spatial prox-
imity of regions, the correlations based on 867 genes that comprise the top 5% for differential
stability across donors became mostly nonsignificant, which suggests that these correlations
largely reflected local similarities of nearby regions. However, the correlations based on the
41 consensus genes linked to the sentence-processing network were not diminished by adjust-
ing for spatial proximity, which indicates that these genes are involved in longer-distance inter-
regional relationships within this network.

The consensus set of 41 genes was enriched for functions relating to the actin cytoskeleton,
driven by the genes MET, ZYX, ARHGAP25, VILL, and IGSF22. Mutations in actin cytoskeleton–
related genes have been linked tentatively to rightward hemispheric language dominance, a
rearrangement of language in the brain that occurs in roughly 1% of the population (Carrion-
Castillo et al., 2019). The actin cytoskeleton plays many roles in neurons, including in regulating
the extension and direction of axon growth (Coles & Bradke, 2015), and the remodeling and main-
tenance of neuronal architecture throughout the neuron lifetime (Gordon-Weeks & Fournier,
2014). The consensus genes associated with language-related networks were also enriched for
roles in neuron projection development. Each of the six genes driving this enrichment, that is,
SERPINF1, PRRX1, ROBO1, TRPC6, SPON2, and GPRIN1, has detectable expression in frontal
and temporal regions during fetal development and early childhood, and might play especially
important roles during language network development. The fact that these genes, known for their
neurodevelopmental roles, are expressed in adult cerebral cortex in a manner linked to func-
tional connectivity within language networks, also suggests that they have continuing roles in

Neurobiology of Language

94

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

maintaining adult cortical circuitry for its regionally specialized roles. Their continued expres-
sion into adulthood certainly attests to adult functions, although these are poorly understood.
Further support for the particular importance of these genes for the sentence-processing net-
work came from the fact that some of them showed higher gene expression in sentence net-
work regions than elsewhere in the cortex, and none showed significantly lower expression in
sentence network regions.

Several of the consensus genes associated with the sentence-processing network have been
reported to contain common polymorphisms or rare mutations that affect language- and
reading-related phenotypes, or else disorders that can be accompanied by reduced linguistic
abilities such as intellectual disability, ASD, or schizophrenia. ROBO1 has well-established
roles in brain development (Hannula-Jouppi et al., 2005; St Pourcain et al., 2014), and has
been implicated in dyslexia as well as phonological short-term memory (Bates et al., 2011;
Hannula-Jouppi et al., 2005), while its homologue ROBO2 has been linked with expressive
vocabulary during early language acquisition (St Pourcain et al., 2014). Mutations in MET are a
risk factor for ASD (Campbell et al., 2006; Jackson et al., 2009; Mukamel et al., 2011; Sousa
et al., 2009), and MET is also regulated by the transcription factor FOXP2 (Mukamel et al.,
2011), which causes developmental verbal dyspraxia when mutated (Lai et al., 2001). PRRX1
is associated with intellectual disability and delayed language acquisition (Lam & Morris,
2016). Mutations of CNTN6 have been reported in patients with speech and language delays,
intellectual disability, and atypical ASD (Kashevarova et al., 2014). CTXN3 has been linked to
schizophrenia (Lewis et al., 2003; Potkin et al., 2010). These observations suggest that our
consensus gene set linked to the sentence-processing network might provide additional
candidates for future studies of language-related individual differences and neurodevelop-
mental disorders. In addition, a recent study used functional connectivity based on rs-fMRI,
in combination with Allen brain gene expression data, to identify 136 genes highly correlated
with the modular organization of intrinsic connectivity networks (although not defined on the
basis of task functional data; Richiardi et al., 2015). We found that 9 of those genes overlap with
our set of 41 genes (CNTN6, CTXN3, LAIR2, MGP, SHD, TGFBI, ASGR2, CDR2L, and IQCJ).
The overlap of these genes is encouraging despite the different data sets, methods, and brain
region definitions used in the other study.

No large-scale GWAS has yet been published for reading/language measures in the general
population, or for disorders such as dyslexia or language impairment, which involve language-
related deficits. However, through analysis of large-scale GWAS summary statistics for ASD, a
disorder that can also involve linguistic deficits (Kleinhans et al., 2008; Lombardo et al., 2015),
we found that the consensus gene set associated with connectivity in the sentence-processing
network is enriched for common SNPs that contribute to the polygenic liability with ASD in
the population. It is therefore possible that variants in these genes associate with ASD due, at
least in part, to dysfunction of high-level language networks. Note that although language-
related deficits may no longer be considered core to ASD in the latest edition of the Diagnostic
and Statistical Manual of Mental Disorders (DSM-5; American Psychiatric Association, 2013),
the earlier DSM-IV criteria were used for the large-scale GWAS, the results of which we used in
the present study (Autism Spectrum Disorders Working Group of the Psychiatric Genomics
Consortium, 2017), in which language delays were considered an important aspect of the disorder
(Grzadzinski, Huerta, & Lord, 2013). No such enrichment signal for ASD was observed for gene
sets associated with the other functional networks that we analyzed by way of comparison, that is,
spatial navigation, multiple demand, and DMNs. Of interest, a recent large-scale brain imaging
analysis of ASD found that cortical thinning was present in many of the regions included in our
definitions of the sentence-processing network (van Rooij et al., 2018). For schizophrenia and

Neurobiology of Language

95

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

intelligence we found no enrichment of association signals within the consensus gene set associ-
ated with the sentence-processing network. This pattern may relate to severity, as language deficits
in patients with schizophrenia have been reported to be less severe than in patients with ASD (again
based on earlier diagnostic criteria) (Chisholm, Lin, Abu-Akel, & Wood, 2015; Spek & Wouters,
2010). Alternatively, the genetic underpinnings of high-level language processing may be of little
relevance to schizophrenia or intelligence in the general population. However, the statistical power
to detect these relations may also be an issue.

There is at present no consensus in the field on what should constitute the precise high-level
language network and how best to define it (Fedorenko & Thompson-Schill, 2014; Labache
et al., 2018). To circumvent this problem, we focused on sentence-level processing specifically,
but used three complementary approaches for defining brain regions important for this function:
one based on a conjunction of three task contrasts and functional laterality (SmSA; Labache
et al., 2018), one based on a large-scale meta-analysis of prior neuroimaging studies (SSA;
Yarkoni et al., 2011), and one based on a single task contrast (OcSA; Fedorenko et al., 2010).
Reassuringly, our three approaches to defining sentence-level processing regions yielded similar
sets of regions, especially with respect to areas in the inferior frontal and middle temporal gyri.
Likely because of the overlap between the three network definitions, their transcriptomic corre-
lates were also similar, as well as the contributions of individual genes. This overall concordance
supports the validity of the different definitions, allowing us to define a consensus set of genes
that emerged consistently across all definitions and both of the rs-fMRI data sets, and is thus not
dependent solely on any individual approach for defining the functional network.

Although multiple cognitive processes are likely to be involved in sentence processing, at least
two of the key elements, syntax and semantics, are tightly integrated in the brain, whereas com-
binatorial processing is a key part of both of them (Fedorenko, Mineroff, Siegelman, & Blank,
2018). Regardless, we do not assume any straightforward mapping of high-level linguistic process-
ing to a small subset of underlying genes. As noted in the Introduction, the creation and mainte-
nance of brain circuits underlying language networks surely depend on the coordinated actions of
very many genes. Here we show that the overall transcriptional similarity of regions involved in
sentence processing is correlated with their functional connectivity, and we identify a relatively
small number of genes that contribute to that correlation most strongly and consistently, based on
current data. We expect that many more genes will be identifiable in relation to this functional
network, as higher quality gene expression data sets are generated, and that some combinations of
genes may be more or less important for particular cognitive subprocesses.

Functional connectivity of regions during task performance is correlated with that observed
at rest, which suggests an “intrinsic” architecture of functional organization (Blank et al., 2014;
Cole et al., 2014; Labache et al., 2018), which may also be linked to individual differences in
behavior (Arnold, Protzner, Bray, Levy, & Iaria, 2014; Kong, Wang, et al., 2017). Thus, the set
of regions for sentence processing defined here could be used in future studies of very large
data sets, such as the UK Biobank (Sudlow et al., 2015) or ENIGMA Consortium (Thompson
et al., 2014), which include neuroanatomical and/or intrinsic connectivity data but limited task
fMRI data for language functions. For instance, the functional regions defined here could be
used to investigate structural/functional variability during maturation, aging, and/or patholog-
ical processes in the general population, including those associated with developmental lan-
guage disorders.

For the comparison networks included in this study, that is, the spatial navigation, multiple
demand, and default mode networks, we also found similar overall correlations between the
functional connectivity networks and the corresponding transcriptomic networks, although

Neurobiology of Language

96

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

with largely different individual genes contributing. This was indicated by relatively low cor-
relations of GCI scores across different networks, and also that different functional networks
produced distinct results in gene ontology analysis. There were individual genes such as MET
that showed a relatively high contribution to the connectivity-transcriptomic correlation for the
sentence-processing network, but negligible contributions to other functional networks.
Therefore, although the existence of connectivity-transcriptome correlations appears to be
ubiquitous across functional networks, our study yields the important basic insight that differ-
ent functional networks can involve differently weighted genetic contributions at the level of
cortical gene expression. This is broadly consistent with genetic correlations between different
cognitive abilities such as verbal and nonverbal performance, as assessed in population genetic
analysis, which often indicate shared but also independent genetic effects on such pairs of traits
(Trzaskowski, Shakeshaft, & Plomin, 2013). Our study therefore suggests a novel approach to
complement existing genetic epidemiological approaches, for understanding the general versus
specific influences on diverse cognitive abilities. Note that, in the present study, in order to
achieve comparable statistical power in the network similarity analysis across different networks,
we purposely defined similar numbers of top regions for each comparison network, even though
this meant using different thresholds for including regions in each comparison network. Further
studies may investigate how using more or less inclusive definitions of these functional networks
affect relationships with gene expression.

The present study was based on gene expression data from a small number of donors in the
Allen Brain data base. Postmortem brain tissues suitable for transcriptomic analysis are difficult
to collect from individuals who were healthy immediately before death, which is necessary
because RNA degrades within hours after death. Thus, the availability of high-quality gene
expression data from the human brain is necessarily limited. Moreover, most of the Allen brain
data, and all the data used for the present study, are based on the older and relatively noisy
transcriptomic technology of microarrays, rather than the more accurate, latest method of RNA
sequencing. In addition, some genes known to be involved in language, such as FOXP2 (Lai
et al., 2001), had to be excluded from our analyses because of the inclusion criterion of sta-
bility across donors. FOXP2 showed a low interdonor differential stability of 0.22 across cere-
bral cortex samples, especially low across frontal samples (−0.07) and across temporal
samples (0.16) in the Allen brain data (Hawrylycz et al., 2015). Thus, it is likely that data of
sufficient quality were not available for other genes too that might have been of relevance to
language networks, such that future studies using RNA sequencing in larger numbers of indi-
viduals, and with more sampling per cortical region, would be well motivated. In addition,
interindividual variabilities have been observed in the precise locations of language-sensitive
regions (Fedorenko et al., 2010; Steinmetz & Seitz, 1991) as well as the cytoarchitectonic fea-
tures of higher-level cognitive areas (e.g., BA 44 and 45) (Fischl et al., 2008). To address these
issues, we focused on only a subset of genes with the highest differential stability across do-
nors, in order to ensure a reliable and representative, average map of gene expression, which
was necessary for this study. Interdonor differential stability is particularly low for cortical re-
gions in the Allen data set, for the majority of individual genes (Figure S2), so that only a mi-
nority of genes have a useful degree of consistency for group-based analysis. Having made a
selection based on high stability across donors, the reproducibility of our genetic findings with
respect to two independent rs-fMRI data sets supported their validity. Gene expression data
from a larger number of individuals, ideally when both anatomical and functional data from
the same individuals are available, would help to further improve upon this aspect. However,
the prospective enrollment of eventual donors of postmortem tissue into brain-imaging studies
during life would be extremely difficult to achieve, within a typical project duration.

Neurobiology of Language

97

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

As noted in the Introduction, several recent studies have examined the relationship between
postmortem brain transcriptomics and in vivo neuroimaging in different data sets (e.g., Anderson
et al., 2018; Reardon et al., 2018; Richiardi et al., 2015; Seidlitz et al., 2018; Whitaker et al., 2016).
Although these studies have provided strong evidence that patterns of gene expression covary with
anatomical and functional organization of the human brain, none of them focused on specific core
cognitive functions such as language. Unlike these previous studies, the present study provided
new evidence for the gene expression correlates of a high-level language network, and additionally
identified a set of individual genes likely to be important for cortical language functions.

A limitation of our study is the use of transcriptomic data from blocks of cerebral cortical
tissues, which comprised many cell types. Future databases based on single-cell transcripto-
mics would likely provide further insights. Although efforts to produce such data are underway
for a limited number of human cerebral cortical regions, no database currently exists with
broad mapping over the cerebral cortex. It will be a major undertaking in the future for the
human brain science community to achieve a widespread cerebral cortical gene expression
map at single cell resolution. For the time being, we queried our consensus genes associated
with the language-related network using single-cell or single-nucleus transcriptomic data from
brain tissues from a restricted number of regions/structures, which gave information about
whether specific genes are relatively more highly expressed in neurons versus some major
classes of glial cells. However, even if a gene of interest was expressed at a comparable or
higher level in glia than in neurons, it might still influence neuronal physiology and circuit
properties, either directly through its expression in neurons, or indirectly via the interactions
of surrounding glial cells with neurons.

As regards head motion during scanning, both signal artifacts (e.g., Friston, Williams, Howard,
Frackowiak, & Turner, 1996; Kong, 2014; Power, Barnes, Snyder, Schlaggar, & Petersen, 2012;
Satterthwaite et al., 2012) and functionally meaningful and reliable variability (e.g., Couvy-
Duchesne et al., 2014; Hodgson et al., 2017; Kong et al., 2014; Zeng et al., 2014) have been
suggested to relate to this. In the present study, we carried out traditional preprocessing in relation
to head motion, including motion correction, quality control in terms of the amount of head mo-
tion, and confound regression, but we did not apply further preprocessing steps proposed more
recently, such as the “scrubbing” method (Power et al., 2012). Motion issues mainly affect data
from young children, the elderly, and disorder groups (Kong et al., 2014; Satterthwaite et al.,
2012), although most of the patients involved in the present study were healthy young adults, with
a good capacity to stay still in scanners. More importantly, we found high correlations (up to 0.92)
of the functional connectivity patterns extracted from two independent rs-fMRI data sets (i.e.,
BIL&GIN and GEB), and we focused on consensus genes arising from the analysis of these two
data sets. Thus, any motion-induced artifacts would have minimal impact on our analyses.

In summation, we provide a first description of the overall transcriptomic correlates of brain
networks underlying high-level linguistic processing, as well as identify a set of individual genes
likely to be most important. These findings help elucidate the molecular basis of language net-
works as distinct from functional networks important for other aspects of cognition. A link of this
genetic infrastructure to ASD is also suggested by our data. Finally, we propose functional con-
nectivity and gene expression analysis as a complementary approach to existing genetic epide-
miological and genetic association approaches, for understanding complex cognitive traits.

ACKNOWLEDGMENTS

We thank the NeuroSynth, Allen Human Brain Atlas, and the Psychiatric Genomics Consortium
for data sharing.

Neurobiology of Language

98

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

AUTHOR CONTRIBUTIONS

Xiang-Zhen Kong: Conceived and designed the study; Analyzed the data; Contributed data/
materials/analysis tools; Contributed to the writing of the manuscript. Nathalie Tzourio-
Mazoyer: Contributed data/materials/analysis tools; Contributed to the writing of the manu-
script. Marc Joliot: Analyzed the data; Contributed data/materials/analysis tools; Contributed
to the writing of the manuscript. Evelina Fedorenko: Contributed data/materials/analysis tools;
Contributed to the writing of the manuscript. Jia Liu: Contributed data/materials/analysis tools;
Contributed to the writing of the manuscript. Simon E. Fisher: Contributed to the writing of the
manuscript. Clyde Francks: Conceived and designed the study; Contributed to the writing of
the manuscript.

FUNDING INFORMATION

Simon Fisher, Clyde Francks, and Xiang-Zhen Kong; Max-Planck-Gesellschaft (Max Planck
Society, Germany) (http://dx.doi.org/10.13039/501100004189). Evelina Fedorenko was
funded by NIH awards R01-DC016607 and R01-DC016950.

REFERENCES

Alleninf. (2018). [Code]. Retrieved from https://github.com/chrisgorgo/

alleninf

American Psychiatric Association. (2013). Diagnostic and statistical
manual of mental disorders (5th ed.). Washington, DC: Author.
Anderson, K. M., Krienen, F. M., Choi, E. Y., Reinen, J. M., Yeo, B.
T. T., & Holmes, A. J. (2018). Gene expression links functional
networks across cortex and striatum. Nature Communications,
9(1), 1428. https://doi.org/10.1038/s41467-018-03811-x

Arnold, A. E., Protzner, A. B., Bray, S., Levy, R. M., & Iaria, G. (2014).
Neural network configuration and efficiency underlies individual dif-
ferences in spatial orientation ability. Journal of Cognitive Neuro-
science, 26(2), 380–394. https://doi.org/10.1162/jocn_a_00491
Autism Spectrum Disorders Working Group of the Psychiatric
Genomics Consortium. (2017). Meta-analysis of GWAS of over
16,000 individuals with autism spectrum disorder highlights a
novel locus at 10q24.32 and a significant overlap with schizo-
phrenia. Molecular Autism, 8, 21. https://doi.org/10.1186/s13229-
017-0137-9

Avinun, R., Nevo, A., Knodt, A. R., Elliott, M. L., & Hariri, A. R.
(2018). Replication in imaging genetics: The case of threat-
related amygdala reactivity. Biological Psychiatry, 84(2), 148–159.
https://doi.org/10.1016/j.biopsych.2017.11.010

Bartres-Faz, D., Gonzalez-Escamilla, G., Vaque-Alcazar, L.,
Abellaneda-Perez, K., Valls-Pedret, C., Ros, E., & Grothe, M. J.
(2019). Characterizing the molecular architecture of cortical re-
gions associated with high educational attainment in older indi-
viduals. Journal of Neuroscience, 39(23), 4566–4575. https://doi.
org/10.1523/JNEUROSCI.2370-18.2019

Bates, T. C., Luciano, M., Medland, S. E., Montgomery, G. W.,
Wright, M. J., & Martin, N. G. (2011). Genetic variance in a com-
ponent of the language acquisition device: ROBO1 polymor-
phisms associated with phonological buffer deficits. Behavior
Genetics, 41(1), 50–57. https://doi.org/10.1007/s10519-010-9402-9
Bearden, C. E., Rosso, I. M., Hollister, J. M., Sanchez, L. E., Hadley,
T., & Cannon, T. D. (2000). A prospective cohort study of
childhood behavioral deviance and language abnormalities as
predictors of adult schizophrenia. Schizophrenia Bulletin, 26(2),
395–410.

Bigos, K. L., & Weinberger, D. R. (2010). Imaging genetics—days of
future past. Neuroimage, 53(3), 804–809. https://doi.org/10.1016/
j.neuroimage.2010.01.035

Blank, I., Kanwisher, N., & Fedorenko, E. (2014). A functional dis-
sociation between language and multiple-demand systems
revealed in patterns of BOLD signal fluctuations. Journal of
Neurophysiology, 112(5), 1105–1118. https://doi.org/10.1152/jn.
00884.2013

Brainspan. (2012). Technical white paper: Transcriptome profiling
by RNA sequencing and exon microarray (Version 5). Retrieved
from help.brain-map.org/download/attachments/3506181/
Transcriptome_Profiling.pdf

Buckner, R. L., Andrews-Hanna, J. R., & Schacter, D. L. (2008). The
brain’s default network: Anatomy, function, and relevance to dis-
ease. Annals of the New York Academy of Sciences, 1124, 1–38.
https://doi.org/10.1196/annals.1440.011

Burt, J. B., Demirtas, M., Eckner, W. J., Navejar, N. M., Ji, J. L.,
Martin, W. J., … Murray, J. D. (2018). Hierarchy of transcriptomic
specialization across human cortex captured by structural neuro-
imaging topography. Nature Neuroscience, 21(9), 1251–1259.
https://doi.org/10.1038/s41593-018-0195-0

Campbell, D. B., Sutcliffe, J. S., Ebert, P. J., Militerni, R., Bravaccio, C.,
Trillo, S., … Levitt, P. (2006). A genetic variant that disrupts MET tran-
scription is associated with autism. Proceedings of the National
Academy of Sciences of the United States of America, 103(45),
16834–16839. https://doi.org/10.1073/pnas.0605296103

Carrion-Castillo, A., Van der Haegen, L., Tzourio-Mazoyer, N.,
Kavaklioglu, T., Badillo, S., Chavent, M., … Francks, C. (2019).
Genome sequencing for rightward hemispheric language domi-
nance. Genes, Brain and Behavior, e12572. https://doi.org/10.
1111/gbb.12572

Carter, C. S., Bearden, C. E., Bullmore, E. T., Geschwind, D. H.,
Glahn, D. C., Gur, R. E., … Weinberger, D. R. (2017). En-
hancing the informativeness and replicability of imaging geno-
mics studies. Biological Psychiatry, 82(3), 157–164. https://doi.
org/10.1016/j.biopsych.2016.08.019

Chisholm, K., Lin, A., Abu-Akel, A., & Wood, S. J. (2015). The as-
sociation between autism and schizophrenia spectrum disorders:

Neurobiology of Language

99

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

A review of eight alternate models of co-occurrence. Neuro-
science and Biobehavioral Reviews, 55, 173–183. https://doi.
org/10.1016/j.neubiorev.2015.04.012

Cole, M. W., Bassett, D. S., Power, J. D., Braver, T. S., & Petersen,
S. E. (2014). Intrinsic and task-evoked network architectures of
the human brain. Neuron, 83(1), 238–251. https://doi.org/10.1016/
j.neuron.2014.05.014

Coles, C. H., & Bradke, F. (2015). Coordinating neuronal actin-
microtubule dynamics. Current Biology, 25(15), R677–691. https://
doi.org/10.1016/j.cub.2015.06.020

Couvy-Duchesne, B., Blokland, G. A., Hickie, I. B., Thompson, P. M.,
Martin, N. G., de Zubicaray, G. I., … Wright, M. J. (2014).
Heritability of head motion during resting state functional MRI in
462 healthy twins. Neuroimage, 102P2, 424–434. https://doi.org/
10.1016/j.neuroimage.2014.08.010

Crittenden, B. M., Mitchell, D. J., & Duncan, J. (2016). Task en-
coding across the multiple demand cortex is consistent with a
frontoparietal and cingulo-opercular dual networks distinction.
Journal of Neuroscience, 36(23), 6147–6155. https://doi.org/10.
1523/JNEUROSCI.4590-15.2016

Dale, P. S., Simonoff, E., Bishop, D. V. M., Eley, T. C., Oliver, B., Price,
T. S., … Plomin, R. (1998). Genetic influence on language delay in
two-year-old children. Nature Neuroscience, 1(4), 324–328.
https://doi.org/10.1038/1142

Deriziotis, P., & Fisher, S. E. (2017). Speech and language:
Translating the genome. Trends in Genetics, 33(9), 642–656.
https://doi.org/10.1016/j.tig.2017.07.002

Doucet, G., Naveau, M., Petit, L., Delcroix, N., Zago, L., Crivello,
F., … Joliot, M. (2011). Brain activity at rest: A multiscale hierar-
chical functional organization. Journal of Neurophysiology, 105(6),
2753–2763. https://doi.org/10.1152/jn.00895.2010

Dronkers, N. F., Wilkins, D. P., Van Valin, R. D., Jr., Redfern, B. B.,
& Jaeger, J. J. (2004). Lesion analysis of the brain areas involved
in language comprehension. Cognition, 92(1–2), 145–177. https://
doi.org/10.1016/j.cognition.2003.11.002

Eising, E., Carrion-Castillo, A., Vino, A., Strand, E. A., Jakielski, K. J.,
Scerri, T. S., … Fisher, S. E. (2018). A set of regulatory genes co-
expressed in embryonic human brain is implicated in disrupted
speech development. Molecular Psychiatry, 24(7), 1065–1078.
https://doi.org/10.1038/s41380-018-0020-x

Fan, X., Dong, J., Zhong, S., Wei, Y., Wu, Q., Yan, L., … Tang,
F. (2018). Spatial transcriptomic survey of human embryonic
cerebral cortex by single-cell RNA-seq analysis. Cell
Research, 28(7), 730–745. https://doi.org/10.1038/s41422-
018-0053-3

Fedorenko, E., Behr, M. K., & Kanwisher, N. (2011). Functional
specificity for high-level linguistic processing in the human brain.
Proceedings of the National Academy of Sciences of the United
States of America, 108(39), 16428–16433. https://doi.org/
10.1073/pnas.1112937108

Fedorenko, E., Duncan, J., & Kanwisher, N. (2013). Broad domain
generality in focal regions of frontal and parietal cortex.
Proceedings of the National Academy of Sciences of the
United States of America, 110(41), 16616–16621. https://doi.org/
10.1073/pnas.1315235110

Fedorenko, E., Hsieh, P. J., Nieto-Castanon, A., Whitfield-Gabrieli,
S., & Kanwisher, N. (2010). New method for fMRI investigations
of language: Defining ROIs functionally in individual subjects.
Journal of Neurophysiology, 104(2), 1177–1194. https://doi.org/10.
1152/jn.00032.2010

Fedorenko, E., Mineroff, Z., Siegelman, M., & Blank, I. (2018). Word
meanings and sentence structure recruit the same set of fronto-
temporal regions during comprehension. bioRxiv.

Fedorenko, E., & Thompson-Schill, S. L. (2014). Reworking the lan-
guage network. Trends in Cognitive Sciences, 18(3), 120–126.
https://doi.org/10.1016/j.tics.2013.12.006

Fischl, B., Rajendran, N., Busa, E., Augustinack, J., Hinds, O., Yeo,
B. T., … Zilles, K. (2008). Cortical folding patterns and predicting
cytoarchitecture. Cerebral Cortex, 18(8), 1973–1980. https://doi.
org/10.1093/cercor/bhm225

Fisher, S. E., & Scharff, C. (2009). FOXP2 as a molecular window
into speech and language. Trends in Genetics, 25(4), 166–177.
https://doi.org/10.1016/j.tig.2009.03.002

Fornito, A., Arnatkeviciute, A., & Fulcher, B. (2019). Bridging the gap
between connectome and transcriptome. Trends in Cognitive
Sciences, 23(1), 34–50. https://doi.org/10.1016/j.tics.2018.10.005
French, L., & Pavlidis, P. (2011). Relationships between gene ex-
pression and brain wiring in the adult rodent brain. PLoS Compu-
tational Biology, 7(1), e1001049. https://doi.org/10.1371/journal.
pcbi.1001049

Friston, K. J., Williams, S., Howard, R., Frackowiak, R. S., & Turner,
R. (1996). Movement-related effects in fMRI time-series. Mag-
netic Resonance in Medicine, 35(3), 346–355.

Fulcher, B. D., & Fornito, A. (2016). A transcriptional signature of
hub connectivity in the mouse connectome. Proceedings of the
National Academy of Sciences of the United States of America,
113(5), 1435–1440. https://doi.org/10.1073/pnas.1513302113
The Gene Ontology Resource. (2020). [Database]. Retrieved from

http://geneontology.org/

Geschwind, D. H., & Flint, J. (2015). Genetics and genomics of psy-
chiatric disease. Science, 349(6255), 1489–1494. https://doi.org/
10.1126/science.aaa8954

Goel, P., Kuceyeski, A., LoCastro, E., & Raj, A. (2014). Spatial patterns
of genome-wide expression profiles reflect anatomic and fiber con-
nectivity architecture of healthy human brain. Human Brain
Mapping, 35(8), 4204–4218. https://doi.org/10.1002/hbm.22471
Gordon-Weeks, P. R., & Fournier, A. E. (2014). Neuronal cyto-
skeleton in synaptic plasticity and regeneration. Journal of
Neurochemistry, 129(2), 206–212. https://doi.org/10.1111/jnc.12502
Gorgolewski, K., Burns, C. D., Madison, C., Clark, D., Halchenko,
Y. O., Waskom, M. L., & Ghosh, S. S. (2011). Nipype: A flexible,
lightweight and extensible neuroimaging data processing frame-
work in python. Frontiers in Neuroinformatics, 5, 13. https://doi.
org/10.3389/fninf.2011.00013

G:Profiler. (2019). [Analysis tool]. Retrieved from https://biit.cs.ut.

ee/gprofiler

Graham, S. A., & Fisher, S. E. (2015). Understanding language from a
genomic perspective. Annual Review of Genetics, 49, 131–160.
https://doi.org/10.1146/annurev-genet-120213-092236

Grzadzinski, R., Huerta, M., & Lord, C. (2013). DSM-5 and autism
spectrum disorders (ASDs): An opportunity for identifying ASD
subtypes. Molecular Autism, 4(1), 12. https://doi.org/10.1186/
2040-2392-4-12

Guen, Y. L., Amalric, M., Pinel, P., Pallier, C., & Frouin, V. (2018).
Shared genetic aetiology between cognitive performance and
brain activations in language and math tasks. bioRxiv. https://
doi.org/10.1101/386805

GWAS Summary Statistics. (2018). [Summary statistics]. Retrieved

from https://ctg.cncr.nl/software/summary_statistics

Hannula-Jouppi, K., Kaminen-Ahola, N., Taipale, M., Eklund, R.,
Nopola-Hemmi, J., Kaariainen, H., & Kere, J. (2005). The axon
guidance receptor gene ROBO1 is a candidate gene for develop-
mental dyslexia. PLoS Genetics, 1(4), 467–474. https://doi.org/10.
1371/journal.pgen.0010050

Hawrylycz, M., Miller, J. A., Menon, V., Feng, D., Dolbeare, T.,
Guillozet-Bongaarts, A. L., … Lein, E. (2015). Canonical genetic

Neurobiology of Language

100

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

signatures of the adult human brain. Nature Neuroscience, 18(12),
1832–1844. https://doi.org/10.1038/nn.4171

Hawrylycz, M. J., Lein, E. S., Guillozet-Bongaarts, A. L., Shen, E. H.,
Ng, L., Miller, J. A., … Jones, A. R. (2012). An anatomically com-
prehensive atlas of the adult human brain transcriptome. Nature,
489(7416), 391–399. https://doi.org/10.1038/nature11405

Helfinstein, S. M., Schonberg, T., Congdon, E., Karlsgodt, K. H.,
Mumford, J. A., Sabb, F. W., … Poldrack, R. A. (2014). Predicting
risky choices from brain activity patterns. Proceedings of the
National Academy of Sciences of the United States of America,
111(7), 2470–2475. https://doi.org/10.1073/pnas.1321728111
Hodgson, K., Poldrack, R. A., Curran, J. E., Knowles, E. E., Mathias, S.,
Goring, H. H. H., … Glahn, D. C. (2017). Shared genetic factors in-
fluence head motion during MRI and body mass index. Cerebral
Cortex, 27(12), 5539–5546. https://doi.org/10.1093/cercor/bhw321
Jackson, P. B., Boccuto, L., Skinner, C., Collins, J. S., Neri, G.,
Gurrieri, F., & Schwartz, C. E. (2009). Further evidence that the
rs1858830 C variant in the promoter region of the MET gene is as-
sociated with autistic disorder. Autism Research, 2(4), 232–236.
https://doi.org/10.1002/aur.87

Joliot, M., Jobard, G., Naveau, M., Delcroix, N., Petit, L., Zago, L.,
… Tzourio-Mazoyer, N. (2015). AICHA: An atlas of intrinsic con-
nectivity of homotopic areas. Journal of Neuroscience Methods,
254, 46–59. https://doi.org/10.1016/j.jneumeth.2015.07.013
Kashevarova, A. A., Nazarenko, L. P., Schultz-Pedersen, S.,
Skryabin, N. A., Salyukova, O. A., Chechetkina, N. N., …
Lebedev, I. N. (2014). Single gene microdeletions and microdupli-
cation of 3p26.3 in three unrelated families: CNTN6 as a new can-
didate gene for intellectual disability. Molecular Cytogenetics, 7.
https://doi.org/10.1186/s13039-014-0097-0

Kaufman, L., Ayub, M., & Vincent, J. B. (2010). The genetic basis of
non-syndromic intellectual disability: A review. Journal of Neuro-
developmental Disorders, 2(4), 182–209. https://doi.org/10.1007/
s11689-010-9055-2

Kleinhans, N. M., Muller, R. A., Cohen, D. N., & Courchesne, E.
(2008). Atypical functional lateralization of language in autism
spectrum disorders. Brain Research, 1221, 115–125. https://doi.
org/10.1016/j.brainres.2008.04.080

Kong, X. Z. (2014). Association between in-scanner head motion with
cerebral white matter microstructure: A multiband diffusion-weighted
MRI study. PeerJ, 2, e366. https://doi.org/10.7717/peerj.366

Kong, X. Z., Song, Y., Zhen, Z., & Liu, J. (2017). Genetic variation in
S100B modulates neural processing of visual scenes in Han
Chinese. Cerebral Cortex, 27(2), 1326–1336. https://doi.org/
10.1093/cercor/bhv322

Kong, X. Z., Wang, X., Pu, Y., Huang, L., Hao, X., Zhen, Z., & Liu, J.
(2017). Human navigation network: The intrinsic functional
organization and behavioral relevance. Brain Structure and
Function, 222(2), 749–764. https://doi.org/10.1007/s00429-
016-1243-8

Kong, X. Z., Zhen, Z., Li, X., Lu, H. H., Wang, R., Liu, L., … Liu, J.
(2014). Individual differences in impulsivity predict head motion
during magnetic resonance imaging. PLoS One, 9(8), e104989.
https://doi.org/10.1371/journal.pone.0104989

Konopka, G., & Roberts, T. F. (2016). Insights into the neural and
genetic basis of vocal communication. Cell, 164(6), 1269–1276.
https://doi.org/10.1016/j.cell.2016.02.039

Labache, L., Joliot, M., Saracco, J., Jobard, G., Hesling, I., Zago,
L., … Tzourio-Mazoyer, N. (2018). A SENtence Supramodal
Areas AtlaS (SENSAAS) based on multiple task-induced activa-
tion mapping and graph analysis of intrinsic connectivity in
144 healthy right-handers. Brain Structure and Function, 224(2),
859–882. https://doi.org/10.1007/s00429-018-1810-2

Lai, C. S. L., Fisher, S. E., Hurst, J. A., Vargha-Khadem, F., &
Monaco, A. P. (2001). A forkhead-domain gene is mutated
in a severe speech and language disorder. Nature, 413(6855),
519–523. https://doi.org/10.1038/35097076

Lake, B. B., Chen, S., Sos, B. C., Fan, J., Kaeser, G. E., Yung, Y.
C., … Zhang, K. (2018). Integrative single-cell analysis of transcrip-
tional and epigenetic states in the human adult brain. Nature
Biotechnology, 36(1), 70–80. https://doi.org/10.1038/nbt.4038
Lam, F., & Morris, C. (2016). Nine year old boy with chromosome
1q23.3-q25.1 deletion. American Journal of Medical Genetics
Part A, 170(11), 3013–3017. https://doi.org/10.1002/ajmg.
a.37843

Leech, R., Kamourieh, S., Beckmann, C. F., & Sharp, D. J. (2011).
Fractionating the default mode network: Distinct contributions of
the ventral and dorsal posterior cingulate cortex to cognitive con-
trol. Journal of Neuroscience, 31(9), 3217–3224. https://doi.org/
10.1523/JNEUROSCI.5626-10.2011

Lewis, C. M., Levinson, D. F., Wise, L. H., DeLisi, L. E., Straub, R. E.,
Hovatta, I., … Helgason, T. (2003). Genome scan meta-analysis
of schizophrenia and bipolar disorder, part II: Schizophrenia.
American Journal of Human Genetics, 73(1), 34–48. https://doi.
org/10.1086/376549

Li, M., Santpere, G., Imamura Kawasawa, Y., Evgrafov, O. V.,
Gulden, F. O., Pochareddy, S., … Sestan, N. (2018). Integrative
functional genomic analysis of human brain development and
neuropsychiatric risks. Science, 362(6420). https://doi.org/10.1126/
science.aat7615

Lombardo, M. V., Pierce, K., Eyler, L. T., Barnes, C. C., Ahrens-
Barbeau, C., Solso, S., … Courchesne, E. (2015). Different func-
tional neural substrates for good and poor language outcome in
autism. Neuron, 86(2), 567–577. https://doi.org/10.1016/j.neuron.
2015.03.023

MAGMA. (Version 1.6). (2020). [Analysis tool]. Retrieved from

https://ctg.cncr.nl/software/magma

Mahowald, K., & Fedorenko, E. (2016). Reliable individual-level
neural markers of high-level language processing: A necessary
precursor for relating neural variability to behavioral and genetic
variability. Neuroimage, 139, 74–93. https://doi.org/10.1016/j.
neuroimage.2016.05.073

Mazoyer, B., Mellet, E., Perchey, G., Zago, L., Crivello, F., Jobard,
G., … Tzourio-Mazoyer, N. (2016). BIL&GIN: A neuroimaging,
cognitive, behavioral, and genetic database for the study of hu-
man brain lateralization. Neuroimage, 124(Pt B), 1225–1231.
https://doi.org/10.1016/j.neuroimage.2015.02.071

McColgan, P., Gregory, S., Seunarine, K. K., Razi, A., Papoutsi, M.,
Johnson, E., … Track-On HD Investigators. (2018). Brain regions
showing white matter loss in Huntington’s disease are enriched
for synaptic and metabolic genes. Biological Psychiatry, 83(5),
456–465. https://doi.org/10.1016/j.biopsych.2017.10.019
McKiernan, K. A., Kaufman, J. N., Kucera-Thompson, J., & Binder,
J. R. (2003). A parametric manipulation of factors affecting task-
induced deactivation in functional neuroimaging. Journal of Cog-
nitive Neuroscience, 15(3), 394–408. https://doi.org/10.1162/
089892903321593117

Mukamel, Z., Konopka, G., Wexler, E., Osborn, G. E., Dong, H.,
Bergman, M. Y., … Geschwind, D. H. (2011). Regulation of
MET by FOXP2, genes implicated in higher cognitive dysfunction
and autism risk. Journal of Neuroscience, 31(32), 11437–11442.
https://doi.org/10.1523/JNEUROSCI.0181-11.2011

Newbury, D. F., Bishop, D. V. M., & Monaco, A. P. (2005). Genetic
influences on language impairment and phonological short-term
memory. Trends in Cognitive Sciences, 9(11), 528–534. https://
doi.org/10.1016/j.tics.2005.09.002

Neurobiology of Language

101

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

Park, D. C., Polk, T. A., Hebrank, A. C., & Jenkins, L. J. (2010). Age
differences in default mode activity on easy and difficult spatial
judgment tasks. Frontiers in Human Neuroscience, 3, 75. https://
doi.org/10.3389/neuro.09.075.2009

Potkin, S. G., Macciardi, F., Guffanti, G., Fallon, J. H., Wang, Q.,
Turner, J. A., … Xie, X. (2010). Identifying gene regulatory net-
works in schizophrenia. Neuroimage, 53(3), 839–847. https://
doi.org/10.1016/j.neuroimage.2010.06.036

Power, J. D., Barnes, K. A., Snyder, A. Z., Schlaggar, B. L., & Petersen,
S. E. (2012). Spurious but systematic correlations in functional
connectivity MRI networks arise from subject motion. Neuro-
image, 59(3), 2142–2154. https://doi.org/10.1016/j.neuroimage.
2011.10.018

Price, C. J. (2012). A review and synthesis of the first 20 years of PET
and fMRI studies of heard speech, spoken language and reading.
Neuroimage, 62(2), 816–847. https://doi.org/10.1016/j.neuroimage.
2012.04.062

Psychiatric Genomics Consortium. (2018). [Statistical summary].

Retrieved from https://www.med.unc.edu/pgc

Reardon, P. K., Seidlitz, J., Vandekar, S., Liu, S., Patel, R., Park, M. T. M.,
… Raznahan, A. (2018). Normative brain size variation and brain
shape diversity in humans. Science, 360(6394), 1222–1227. https://
doi.org/10.1126/science.aar2578

Richiardi, J., Altmann, A., Milazzo, A. C., Chang, C., Chakravarty,
M. M., Banaschewski, T., … Greicius, M. D. (2015). BRAIN
NETWORKS. Correlated gene expression supports synchronous
activity in brain networks. Science, 348(6240), 1241–1244. https://
doi.org/10.1126/science.1255905

Romero-Garcia, R., Warrier, V., Bullmore, E. T., Baron-Cohen, S., &
Bethlehem, R. A. I. (2018). Synaptic and transcriptionally down-
regulated genes are associated with cortical thickness differences
in autism. Molecular Psychiatry. https://doi.org/10.1038/s41380-
018-0023-7

Romme, I. A., de Reus, M. A., Ophoff, R. A., Kahn, R. S., & van den
Heuvel, M. P. (2017). Connectome disconnectivity and cortical
gene expression in patients with schizophrenia. Biological Psy-
chiatry, 81(6), 495–502. https://doi.org/10.1016/j.biopsych.2016.
07.012

Satterthwaite, T. D., Wolf, D. H., Loughead, J., Ruparel, K.,
Elliott, M. A., Hakonarson, H., … Gur, R. E. (2012). Impact
of in-scanner head motion on multiple measures of functional
connectivity: Relevance for studies of neurodevelopment in
youth. Neuroimage, 60(1), 623–632. https://doi.org/10.1016/
j.neuroimage.2011.12.063

Schizophrenia Working Group of the Psychiatric Genomics Con-
sortium. (2014). Biological insights from 108 schizophrenia-
associated genetic loci. Nature, 511(7510), 421–427. https://doi.
org/10.1038/nature13595

Schwarz, A. J., & McGonigle, J. (2011). Negative edges and soft
thresholding in complex network analysis of resting state func-
tional connectivity data. Neuroimage, 55(3), 1132–1146. https://
doi.org/10.1016/j.neuroimage.2010.12.047

Scott, T. L., Gallee, J., & Fedorenko, E. (2017). A new fun and ro-
bust version of an fMRI localizer for the frontotemporal language
system. Cognitive Neuroscience, 8(3), 167–176. https://doi.org/
10.1080/17588928.2016.1201466

Seidlitz, J., Vasa, F., Shinn, M., Romero-Garcia, R., Whitaker, K. J.,
Vertes, P. E., … Bullmore, E. T. (2018). Morphometric similarity
networks detect microscale cortical organization and predict
inter-individual cognitive variation. Neuron, 97(1), 231–247.
https://doi.org/10.1016/j.neuron.2017.11.039

Silva, P. A., Williams, S., & McGee, R. (1987). A longitudinal-study
of children with developmental language delay at age three:

Later intelligence, reading and behaviour problems. Devel-
opmental Medicine and Child Neurology, 29(5), 630–640.
https://doi.org/10.1111/j.1469-8749.1987.tb08505.x

Sniekers, S., Stringer, S., Watanabe, K., Jansen, P. R., Coleman, J. R.
I., Krapohl, E., … Posthuma, D. (2017). Genome-wide associa-
tion meta-analysis of 78,308 individuals identifies new loci and
genes influencing human intelligence. Nature Genetics, 49(7),
1107–1112. https://doi.org/10.1038/ng.3869

Sousa, I., Clark, T. G., Toma, C., Kobayashi, K., Choma, M., Holt,
R., … International Molecular Genetic Study of Autism
Consortium. (2009). MET and autism susceptibility: Family and
case-control studies. European Journal of Human Genetics, 17(6),
749–758. https://doi.org/10.1038/ejhg.2008.215

Spek, A. A., & Wouters, S. G. M. (2010). Autism and schizophrenia
in high functioning adults: Behavioral differences and overlap.
Research in Autism Spectrum Disorders, 4(4), 709–717. https://
doi.org/10.1016/j.rasd.2010.01.009

St Pourcain, B., Cents, R. A., Whitehouse, A. J., Haworth, C. M.,
Davis, O. S., O’Reilly, P. F., … Davey Smith, G. (2014). Com-
mon variation near ROBO2 is associated with expressive vocabu-
lary in infancy. Nature Communications, 5, 4831. https://doi.org/
10.1038/ncomms5831

Steinmetz, H., & Seitz, R. J. (1991). Functional anatomy of language
processing: Neuroimaging and the problem of individual vari-
ability. Neuropsychologia, 29(12), 1149–1161.

Sudlow, C., Gallacher, J., Allen, N., Beral, V., Burton, P., Danesh,
J., … Collins, R. (2015). UK Biobank: An open access resource
for identifying the causes of a wide range of complex diseases of
middle and old age. PLoS Medicine, 12(3). https://doi.org/
10.1371/journal.pmed.1001779

Tager-Flusberg, H., Paul, R., & Lord, C. (2005). Language and com-
munication in autism. In F. R. Volkmar, R. Paul, A. Klin & D.
Cohen (Eds.), Handbook of Autism and Pervasive Develop-
mental Disorders (Vol. 1). New York: Wiley.

Thompson, P. M., Stein, J. L., Medland, S. E., Hibar, D. P., Vasquez,
A. A., Renteria, M. E., … Saguenay Youth Study Group. (2014).
The ENIGMA Consortium: Large-scale collaborative analyses of
neuroimaging and genetic data. Brain Imaging and Behavior, 8(2),
153–182. https://doi.org/10.1007/s11682-013-9269-5

Tomblin, B. (2011). Co-morbidity of autism and SLI: Kinds, kin and
complexity. International Journal of Language & Communication
Disorders, 46(2), 127–137. https://doi.org/10.1111/j.1460-6984.
2011.00017.x

Trzaskowski, M., Shakeshaft, N. G., & Plomin, R. (2013).
Intelligence indexes generalist genes for cognitive abilities.
Intelligence, 41(5), 560–565. https://doi.org/10.1016/j.intell.2013.
07.011

van Rooij, D., Anagnostou, E., Arango, C., Auzias, G., Behrmann,
M., Busatto, G. F., … Buitelaar, J. K. (2018). Cortical and subcortical
brain morphometry differences between patients with autism spec-
trum disorder and healthy individuals across the lifespan: Results
from the ENIGMA ASD Working Group. American Journal of
Psychiatry, 175(4), 359–369. https://doi.org/10.1176/appi.ajp.2017.
17010100

Vertes, P. E., Rittman, T., Whitaker, K. J., Romero-Garcia, R., Vasa,
F., Kitzbichler, M. G., … Bullmore, E. T. (2016). Gene transcrip-
tion profiles associated with inter-modular hubs and connec-
tion distance in human functional magnetic resonance imaging
networks. Philosophical Transactions of the Royal Society B:
Biological Sciences, 371(1705). https://doi.org/10.1098/rstb.
2015.0362

Vigneau, M., Beaucousin, V., Herve, P. Y., Duffau, H., Crivello, F.,
Houde, O., … Tzourio-Mazoyer, N. (2006). Meta-analyzing left

Neurobiology of Language

102

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Genes and the Sentence-Processing Network

hemisphere language areas: Phonology, semantics, and sentence
processing. Neuroimage, 30(4), 1414–1432. https://doi.org/
10.1016/j.neuroimage.2005.11.002

Wang, X., Zhen, Z., Song, Y., Huang, L., Kong, X., & Liu, J.
(2016). The hierarchical structure of the face network revealed
by its functional connectivity pattern. Journal of Neuroscience,
36(3), 890–900. https://doi.org/10.1523/ JNEUROSCI.2789-
15.2016

Whitaker, K. J., Vertes, P. E., Romero-Garcia, R., Vasa, F., Moutoussis,
M., Prabhu, G., … Bullmore, E. T. (2016). Adolescence is associated
with genomically patterned consolidation of the hubs of the human
brain connectome. Proceedings of the National Academy of
Sciences of the United States of America, 113(32), 9105–9110.
https://doi.org/10.1073/pnas.1601745113

Yarkoni, T., Poldrack, R. A., Nichols, T. E., Van Essen, D. C., &
Wager, T. D. (2011). Large-scale automated synthesis of human
functional neuroimaging data. Nature Methods, 8(8), 665–670.
https://doi.org/10.1038/nmeth.1635

Yeo, B. T., Krienen, F. M., Sepulcre, J., Sabuncu, M. R., Lashkari,
D., Hollinshead, M., … Buckner, R. L. (2011). The organization
of the human cerebral cortex estimated by intrinsic functional
connectivity. Journal of Neurophysiology, 106(3), 1125–1165.
https://doi.org/10.1152/jn.00338.2011

Zeisel, A., Munoz-Manchado, A. B., Codeluppi, S., Lonnerberg, P.,
La Manno, G., Jureus, A., … Linnarsson, S. (2015). Cell types in
the mouse cortex and hippocampus revealed by single-cell RNA-

seq. Science, 347(6226), 1138–1142. https://doi.org/10.1126/
science.aaa1934

Zeng, L. L., Wang, D., Fox, M. D., Sabuncu, M., Hu, D., Ge, M., …
Liu, H. (2014). Neurobiological basis of head motion in brain im-
aging. Proceedings of the National Academy of Sciences of the
United States of America, 111(16), 6058–6062. https://doi.org/
10.1073/pnas.1317424111

Zhang, Y., Chen, K., Sloan, S. A., Bennett, M. L., Scholze, A. R.,
O’Keeffe, S., … Wu, J. Q. (2014). An RNA-sequencing transcriptome
and splicing database of glia, neurons, and vascular cells of the cere-
bral cortex. Journal of Neuroscience, 34(36), 11929–11947. https://
doi.org/10.1523/JNEUROSCI.1860-14.2014

Zhen, Z., Kong, X. Z., Huang, L., Yang, Z., Wang, X., Hao, X., …
Liu, J. (2017). Quantifying the variability of scene-selective re-
gions: Interindividual, interhemispheric, and sex differences.
Human Brain Mapping, 38(4), 2260–2275. https://doi.org/
10.1002/hbm.23519

Zhen, Z., Yang, Z., Huang, L., Kong, X. Z., Wang, X., Dang, X., …
Liu, J. (2015). Quantifying interindividual variability and asym-
metry of face-selective regions: A probabilistic functional atlas.
Neuroimage, 113, 13–25. https://doi.org/10.1016/j.neuroimage.
2015.03.010

Zhong, S., Zhang, S., Fan, X., Wu, Q., Yan, L., Dong, J., … Wang,
X. (2018). A single-cell RNA-seq survey of the developmental
landscape of the human prefrontal cortex. Nature, 555(7697),
524–528. https://doi.org/10.1038/nature25980

l

D
o
w
n
o
a
d
e
d

f
r
o
m
h

t
t

p

:
/
/

d
i
r
e
c
t
.

m

i
t
.

e
d
u
n
o

/

l
/

l

a
r
t
i
c
e

p
d

f
/

/

/

/

/

1
1
7
7
1
8
6
7
8
8
8
n
o
_
a
_
0
0
0
0
4
p
d

.

l

f

b
y
g
u
e
s
t

t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Neurobiology of Language

103RESEARCH ARTICLE image
RESEARCH ARTICLE image
RESEARCH ARTICLE image
RESEARCH ARTICLE image
RESEARCH ARTICLE image
RESEARCH ARTICLE image

Download pdf