METHODS - Ricerca sull'intelligenza artificiale specializzata al MIT

METHODS

Connectome sorting by consensus
clustering increases separability
in group neuroimaging studies

Javier Rasero
Daniele Marinazzo

, Ibai Diez

, Jesus M. Cortes
, and Sebastiano Stramaglia

2,3,4

1,5,6

,
8,9,10

1Biocruces Health Research Institute, Hospital Universitario de Cruces, Barakaldo, Spain
2Functional Neurology Research Group, Department of Neurology, Massachusetts General Hospital,
Harvard Medical School, Boston, MA, USA
3Gordon Center, Department of Nuclear Medicine, Massachusetts General Hospital,
Harvard Medical School, Boston, MA, USA
4Neurotechnology Laboratory, Tecnalia Health Department, Derio, Spain
5Department of Cell Biology and Histology, University of the Basque Country, Leioa, Spain
6Ikerbasque, The Basque Foundation for Science, Bilbao, Spain
7Faculty of Psychology and Educational Sciences, Department of Data Analysis, Ghent University, Ghent, Belgium
8Dipartimento di Fisica, Universitá degli Studi “Aldo Moro” Bari, Italy
Istituto Nazionale di Fisica Nucleare, Sezione di Bari, Italy
10TIRES-Center of Innovative Technologies for Signal Detection and Processing, Universitá degli Studi “Aldo Moro” Bari, Italy

D
o
w
N
o
UN
D
e
D

F
R
o
M
H

T
T

:
/
/

D
io
R
e
C
T
.

io
T
.

e
D
tu
N
e
N
UN
R
T
io
C
e
–
P
D

F
/

3
2
3
2
5
1
0
9
2
6
2
6
N
e
N
_
UN
_
0
0
0
7
4
P
D

B
sì
G
tu
e
S
T

o
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Keywords: Unsupervised learning, Consensus clustering, Brain connectivity, Classiﬁcation

ABSTRACT

A fundamental challenge in preprocessing pipelines for neuroimaging datasets is to increase
the signal-to-noise ratio for subsequent analyses. In the same line, we suggest here that the
application of the consensus clustering approach to brain connectivity matrices can be a
valid additional step for connectome processing to ﬁnd subgroups of subjects with reduced
intragroup variability and therefore increasing the separability of the distinct subgroups when
connectomes are used as a biomarker. Inoltre, by partitioning the data with consensus
clustering before any group comparison (for instance, between a healthy population vs. UN
pathological one), we demonstrate that unique regions within each cluster arise and bring
new information that could be relevant from a clinical point of view.

INTRODUCTION

In the supervised classiﬁcation of human connectome data (Sporns, 2011), subjects are usu-
ally grouped based on high-level clinical categories (per esempio., patients and controls, early vs. late
cognitive impairment, eccetera.), and typical approaches aim at deducing a decision function from
the labeled training data (Vedere, per esempio., Fornito & Bullmore, 2010). Likewise, unsupervised anal-
ysis is also usually performed with the researcher being blind to any phenotypic factors and
aiming to ﬁnd subgroups of subjects/features with similar characteristics. The existing litera-
ture reports several clustering algorithms dealing with these issues (Vedere, per esempio., Xu & Wunsch,
2005, and references within). The emergence of substructures underlying the data is very often
due to the fact that the populations to be studied (healthy subjects, patients, eccetera.) are usually
highly heterogeneous. Consequently, stratiﬁcation of groups may be a useful preliminary step,
allowing the subsequent supervised analysis to exploit the knowledge of the data’s structure.

A convenient strategy for group stratiﬁcation involves using subjects’ phenotypic variables
when available (for instance, the presence or absence of one speciﬁc gene mutation). More

a n o p e n a c c e s s

j o u r n a l

Citation: Rasero, J., Diez, I., Cortes,
J. M., Marinazzo, D., & Stramaglia, S.
(2019). Connectome sorting by
consensus clustering increases
separability in group neuroimaging
studies. Network Neuroscience, 3(2),
325–343. https://doi.org/10.1162/
netn_a_00074

DOI:
https://doi.org/10.1162/netn_a_00074

Supporting Information:
https://doi.org/10.1162/netn_a_00074

Received: 18 Giugno 2018
Accepted: 9 novembre 2018

Competing Interests: The authors have
declared that no competing interests
exist.

Corresponding Author:
Sebastiano Stramaglia
sebastiano.stramaglia@ba.infn.it

Handling Editor:
Olaf Sporns

Copyright: © 2018
Istituto di Tecnologia del Massachussetts
Pubblicato sotto Creative Commons
Attribuzione 4.0 Internazionale
(CC BY 4.0) licenza

The MIT Press

Connectome sorting by consensus clustering

Brain connectivity network
(connectome):
A network in which the nodes are
brain regions and the links are
anatomical connections
(“anatomical/structural connectivity”)
or statistical dependencies
(“functional connectivity”).

Distance matrix:
For each node, a distance matrix for
the set of subjects is constructed as
follows. For each pairs of subjects,
the corresponding distance is
(cid:2)

2(1 − r), where r is the Pearson

correlation between the nodal
connectivity patterns of the given
node in the two subjects.

Consensus matrix:
Given several partitions of a given set
of nodes, for each pair of nodes the
consensus matrix assumes, as the
corresponding entry, the fraction of
partitions in which the two nodes
belong to the same subset.

interestingly, stratiﬁcation may also rely on the measured variables, like the human connec-
tome data itself, applying clustering algorithms to ﬁnd natural groupings in the data.

An effective supervised approach, named multivariate distance matrix regression (MDMR),
has been proposed in Zapala & Schork (2006) for the analysis of gene expression patterns;
it tests the relationship between variation in a distance matrix and predictor information col-
lected on the samples whose gene expression levels have been used to construct the matrix.
The same method was also applied to the cross-group analysis of brain connectivity matrices
(Shehzad et al., 2014), as an alternative to the very commonly used method of connectome-
wide association, questo è, mass-univariate statistical analyses, in which the association with
a phenotypic variable of each entry of the brain connectivity matrix across subjects is tested.
While MDMR has found wide application (Vedere, Per esempio, Rasero et al., 2017), these ﬁndings
may be certainly affected by the heterogeneity of classes.

Recently an unsupervised method (Rasero et al., 2017), rooted on the notion of consensus
clustering (Lancichinetti & Fortunato, 2012), has been developed for community detection in
complex networks (Barabasi & Frangos, 2002), when a connectivity matrix is associated with
each item to be classiﬁed. In this method, the different features, extracted from connectivity
matrices, are not combined into a single vector to feed the clustering algorithm; Piuttosto, IL
information coming from the various features is combined by constructing a consensus network
(Lancichinetti & Fortunato, 2012). Consensus clustering is commonly used to generate stable
results out of a set of partitions delivered by different clustering algorithms (and/or parameters)
applied to the same data (Strehl & Ghosh, 2002). In the method developed in Rasero et al.
(2017), instead the consensus strategy was used to combine the information about the data
structure arising from different features so as to summarize them in a single consensus matrix.
Inoltre, such a matrix not only provides a partition of subjects in communities, but also a
geometrical representation of the set of subjects. It has been shown that this technique can be
effective in disentangling the heterogeneity of a population. The main similarity with MDMR
is that in both methods a distance matrix in the space of subjects is introduced.

D
o
w
N
o
UN
D
e
D

F
R
o
M
H

T
T

:
/
/

D
io
R
e
C
T
.

io
T
.

e
D
tu
N
e
N
UN
R
T
io
C
e
–
P
D

F
/

3
2
3
2
5
1
0
9
2
6
2
6
N
e
N
_
UN
_
0
0
0
7
4
P
D

The purpose of this work is to propose the consensus clustering approach from Rasero et al.
(2017) as a step to perform prior to MDMR in exploratory analysis, so as to cope with the het-
erogeneity present in user-deﬁned groups. Primo, we will test the robustness of our consensus
clustering by using noise to introduce heterogeneity in the data. Secondo, we will show that
extracting the natural classes present in the user-deﬁned groups and subsequently performing
the supervised analysis between the subgroups found by consensus clustering allows us to
identify variables whose patterns are altered in group comparisons, which are not identiﬁed
when the groups are used as a whole. Di conseguenza, the proposed approach leads to an increase
in separability.

B
sì
G
tu
e
S
T

o
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

MATERIALS AND METHODS

Materials

The robustness of the consensus clustering algorithm to disentangle the real structure of data
was tested using simulated data for two groups of 15 subjects each, generated using simTB
(Erhardt, Allen, Wei, Eichele, & Calhoun, 2012), a toolbox written in MATLAB that allows sim-
ulation of functional magnetic resonance imaging (fMRI) datasets under a model of spatiotem-
poral separability. Given the number of sources/components (nC), repetition time (TR), and a
hemodynamic model, the program yields the time course proﬁle subject to the experimental
= 20 components
progetto (block- and/or event-related) provided. Our scenario consists of nC

Network Neuroscience

326

Connectome sorting by consensus clustering

and TR = 2s, such that in an event-related experiment the ﬁrst 10 components of the co-
hort of group 1 have high probability (90%) of becoming activated, whereas for the remaining
components the activation is rare (10 %). For subjects of group 2 the situation is the opposite.
This scenario could be then considered as the same group of subjects performing two orthog-
onal tasks, where different components get involved. The length of the simulated time series is
148 points.

The advantages of using the consensus clustering method as a sorting preliminary step were
explored on two real and public neuroimaging datasets. The ﬁrst dataset comprises healthy
controls (HC) and individuals diagnosed with three pathologies: attention-deﬁcit hyperactivity
disorder (ADHD), bipolar disorder (BD), and schizophrenia (SCH). This dataset is provided
by the Consortium for Neuropsychiatric Phenomics LA5C Study and was downloaded from
the OpenfMRI database with accession number ds000030 (Poldrack et al., 2016). We used
resting functional MRI (rfMRI) Di 255 subjects: 40 ADHD, 47 BD, 50 SCH, E 118 HC. Dem-
ographics information about this cohort can be found in Table 1. Data were preprocessed with
FSL (FMRIB Software Library v5.0). All volume images were corrected for motion, after which
slice timing correction was applied to correct for temporal alignment. All voxels were spatially
smoothed with a 6-mm full width at half-maximum isotropic Gaussian kernel, and a band-
pass ﬁlter was applied between 0.01 E 0.08 Hz after intensity normalization. Inoltre,
linear and quadratic trends were removed. We next regressed out the motion time courses, IL
average CSF signal and the average white matter signal. Global signal regression was not per-
formed. Data were transformed to the MNI152 template, such that a given voxel had a volume
Di 3 mm × 3 mm × 3 mm. Finalmente, we obtained 278 time series, each corresponding to an
anatomical region of interest by averaging the voxel signals according to the functional atlas
described in (Shen, Tokoglu, Papademetris, & Constable, 2013). Finalmente, UN 278 × 278 matrix of
Pearson coefﬁcients among time series for each subject was obtained.

As for the second dataset, it contains resting-state fMRI scans from a population of autism
spectrum disorder (ASD) subjects and a population of typically developing (TD) ones, from
the Autism Brain Imaging Data Exchange (ABIDE) repository (Di Martino et al., 2014), an
initiative that aims at collecting data from laboratories around the world to accelerate the
understanding of the neural bases of this disorder. Since it is unclear what effect that different
scanner machines can have on the observable results, we decided to avoid this additional
possible confounder by considering only a sample of 75 subjects for each group acquired at
the same site (NYU). For this subset, we matched age and sex between samples (Wilcoxon rank
sum p = 0.27 and χ2
test p = 0.29, rispettivamente). Preprocessing of the data was done using FSL,
AFNI, and MATLAB. Primo, slice-time correction was applied. Then, each volume was aligned
to the middle volume to correct for head motion artifacts followed by intensity normalization.
We next regressed out 24 motion parameters, the average cerebrospinal ﬂuid (CSF) and the
average white matter signal. A band-pass ﬁlter was applied between 0.01 E 0.08 Hz, E

Tavolo 1. Demographic information for LA5C dataset

Resting-state fMRI:
Functional magnetic resonance
imaging acquired while the subject
is simply instructed to stay awake.

D
o
w
N
o
UN
D
e
D

F
R
o
M
H

T
T

:
/
/

D
io
R
e
C
T
.

io
T
.

e
D
tu
N
e
N
UN
R
T
io
C
e
–
P
D

F
/

3
2
3
2
5
1
0
9
2
6
2
6
N
e
N
_
UN
_
0
0
0
7
4
P
D

B
sì
G
tu
e
S
T

o
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

HC (N = 118)
ADHD (N = 40)
BD (N = 47)
SCH (N = 50)
Note. FWD = framewise displacement; HC = healthy controls; ADHD = attention-deﬁcit hyper-
activity disorder; BD = bipolar disorder; SCH = schizophrenia.

Age
31.56 ± 8.83
32.05 ± 10.4
35.47 ± 9.16
36.46 ± 8.88

FWD
0.17 ± 0.15
0.15 ± 0.11
0.19 ± 0.13
0.25 ± 0.20

Sex (M/F)
64/54
21/19
26/21
38/12

Network Neuroscience

327

Connectome sorting by consensus clustering

linear and quadratic trends were removed. All voxels were spatially smoothed with a 6-mm
FWHM. Finalmente, FreeSurfer software was used for brain segmentation and cortical parcellation.
An 86 region atlas was generated with 68 cortical regions from the Desikan-Killiany Atlas (34
in each hemisphere) E 18 subcortical regions (9 in each hemisphere: thalamus, caudate,
putamen, pallidum, hippocampus, amygdala, accumbens, ventralDC and cerebellum). Each
subject parcellation was projected to individual functional data, and the mean functional time
series of each region was computed.

Consensus clustering method

Given a set of matrices of distance among subjects, the consensus clustering proposed in
(Rasero, Pellicoro, et al., 2017) can be summarized as follows: (io) cluster each distance matrix
using a known clustering algorithm, (ii) build the consensus network from the correspond-
ing partitions and (iii) extract groups of subjects by ﬁnding the communities of the consensus
network thus obtained.

Regarding the distance matrices calculation in the case of fMRI data, let us consider m
subjects such that each one has a N × N matrix of features, which can for example represent
the brain functional connectivity matrix. We will denote this matrix as {UN(io, j)α}, where α =
1, . . . , m and i, j = 1, . . . , N. For each row i, we build a distance matrix for the set of subjects as
follows. Consider a pair of subjects α and β, and consider the corresponding patterns {UN(io, :)α}
E {UN(io, :)β}; let r be their Pearson correlation. As the distance between the two subjects,
for the node i we take dαβ = 1 − r; other choices for the distance can be used, like, for
2(1 − r), where r is the Pearson correlation. The m × m distance matrix dαβ
esempio, dαβ =
corresponding to row i will be denoted by Di, with i = 1, . . . , N. The set of D matrices may
be seen as corresponding to layers of a multilayer network (Boccaletti et al., 2014), each brain
node representing a layer.

(cid:2)

Each distance matrix Di is then partitioned into k groups of subjects using the k-medoids
method (Brito, Bertrand, Cucumel, & Carvalho, 2007), and this cluster information is encoded
into an adjacency matrix among subjects. Subsequently, an m × m consensus matrix C is evalu-
ated by averaging this different information across the nodes. Hence, the entries of Cαβ indicate
the number of partitions in which subjects α and β are assigned to the same group, divided by
the number of partitions N. Eventually, the consensus matrix is averaged over k ranging in the
interval (2-20) so as to combine, in the ﬁnal consensus matrix, information about structures at
different resolutions (for more details, see Rasero, Pellicoro, et al., 2017).

The consensus matrix obtained as explained before provides a weighted graph that contains
all the community information, whose structure can be further examined by an appropriate
method. Following Rasero, Pellicoro, et al. (2017), we perform this by modularity optimiza-
zione, for which we compared this consensus matrix with the ensemble of all consensus matri-
ces that one may obtain randomly and independently by permuting the cluster labels obtained
after applying the k-medoids algorithm to each of the set of distance matrices. More precisely,
a modularity matrix is evaluated as

B = C − γ · P,

(1)

K-medoids algorithm:
A classical partitioning technique of
clustering that groups the data set of
n objects into k clusters known
a priori. Like the k-means, k-medoids
algorithms attempts to minimize the
distance between points labeled to
be in a cluster and a point designated
as the center of that cluster; In
contrast to the k-means algorithm,
k-medoids chooses data points as
centers (medoids).

Modularity matrix:
For each entry in the consensus
matrix, the null ensemble of all
possible conﬁgurations by label
permutation at a given resolution
is subtracted.

where P is the expected co-assignment matrix, uniform as a consequence of the null ensemble
here chosen, obtained repeating many times the permutation of labels, and γ is the resolution
parameter that all modularity matrices implicitly or explicitly carry. As in the Newman and
Girvan scenario, we hereafter set this parameter to be 1, which corresponds to the maximum
modularity case (Reichardt & Bornholdt, 2006).

Network Neuroscience

328

D
o
w
N
o
UN
D
e
D

F
R
o
M
H

T
T

:
/
/

D
io
R
e
C
T
.

io
T
.

e
D
tu
N
e
N
UN
R
T
io
C
e
–
P
D

F
/

3
2
3
2
5
1
0
9
2
6
2
6
N
e
N
_
UN
_
0
0
0
7
4
P
D

B
sì
G
tu
e
S
T

o
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Connectome sorting by consensus clustering

A modularity matrix of this kind encodes all the information at different levels about the
interaction strength among subjects. Di conseguenza, one could now just as easily deﬁne a distance
matrix based on this matrix by computing any metric distance between pairs of rows and
follow with k-medoids to perform a clustering analysis. We instead submitted this matrix to a
modularity optimization algorithm, which does not require a predeﬁned number of clusters,
to obtain the output partition by the proposed approach (we used the Community Louvain
routine in the Brain Connectivity Toolbox (Rubinov & Sporns, 2010), which admits modularity
matrices instead of connectivity matrices as input.

Multivariate Distance Matrix Regression

The cross-group analysis of brain connectivity matrices has been performed using the MDMR
approach as described in Shehzad et al. (2014) and Zapala & Schork (2006). This method
allows for an identiﬁcation of voxels/regions whose whole-brain connectivity patterns vary
signiﬁcantly with a given set of regressor variables. In contrast to the usual general linear re-
gression model, the variation on the total sum of squares here is tested on the Distance matrix,
constructed from the connectivity patterns for each node, rather than directly on the observed
dati. This approach can assess signiﬁcance of multiple connections at the same time and is
especially useful when the number of dependent variables—in this case, the entries of the
connectivity matrices—exceeds the number of observations.

For a ﬁxed brain node i, the distance between connectivity patterns of i with the rest of
the brain was calculated per pair of subjects (tu,v), by calculating Pearson correlation between
connectivity vectors of subject pairs, thus leading to a distance matrix in the subject space for
each i investigated. In particular, the following formula was applied

(cid:3)

(cid:4)

(cid:5)

1 − ri
uv

di
uv

(2)

where ri
uv is the Pearson correlation between connectivity patterns of node i for subjects u and
v. Prossimo, MDMR was applied to perform cross-group analysis as implemented in R (McArtor,
n.d.).

MDMR yielded a pseudo-F estimator (analogous to that F estimator in standard ANOVA
analysis), which addresses signiﬁcance due to between-group variation as compared with
within-group variations (McArdle & Anderson, 2001) on a distance matrix calculated from
a set of variables. In the particular case when the only regressor variable is categorical (cioè.,
the group label), given a distance matrix for a certain node, one can calculate the total sum of
squares as

SST

N
∑
u=1

N
∑
v=u+1

2
uv

(3)

D
o
w
N
o
UN
D
e
D

F
R
o
M
H

T
T

:
/
/

D
io
R
e
C
T
.

io
T
.

e
D
tu
N
e
N
UN
R
T
io
C
e
–
P
D

F
/

3
2
3
2
5
1
0
9
2
6
2
6
N
e
N
_
UN
_
0
0
0
7
4
P
D

B
sì
G
tu
e
S
T

o
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

with N being the total number of subjects. Allo stesso modo, the within-group sum of squares can be
written as

SSW

= ∑ 1
ng

∑
u=1

N
∑
v=u+1

2
uv

(cid:6)G
uv

(4)

where ng is the number of subjects per group and (cid:6)G
uv a variable equal to 1 if subjects u and v
belong to group g and 0 otherwise. The between-group variation is simply SSB = SST − SSW,
which leads to a pseudo-F statistic as follows

F = (N − 1) SSA
SSW

(5)

329

Network Neuroscience

Connectome sorting by consensus clustering

where m is the number of groups. As it was acknowledged in Zapala & Schork (2006), IL
pseudo-F statistic is not distributed like the usual Fisher’s F-distribution under the null hy-
pothesis. Accordingly, we randomly shufﬂed the subject indexes and computed the pseudo-F
statistic each time. A p value is computed by counting those pseudo-F statistic values from per-
muted data greater than that from the original data respect to the total number of performed
permutations.

Nevertheless, in our cross-group analyses, in addition to group label, age, sex, and mean
framewise displacement (FWR) were also considered as covariates because of their possible
confounding effect in distance variation between subjects. Finalmente, we controlled for I errors
by false discovery rate corrections and set signiﬁcance threshold at 1%.

The proposed approach

The application of MDMR to identify altered patterns is hampered by the heterogeneity of
subjects in the same group (healthy controls or patients). Therefore, we propose here the use
of the consensus clustering approach as a sorting preliminary tool. In a ﬁrst step, the consensus
clustering is to be applied to extract the natural classes present in the group of controls (and/or
in the group of patients). In a subsequent step, the supervised analysis of MDMR is to be
performed between the pairs of subgroups found by consensus clustering. In this way one can
identify variables whose pattern is altered in subgroups comparisons, but not in the whole
group.

RESULTS

Simulated data

Our approach aims to use cluster information from each node instead of using the whole set
of nodes across the brain. We can visualize this in Figure 1, where the group reconstruction
provided by our method is compared with the one yielded by considering the whole corre-
lation matrix as pattern connectivity from which one calculates the distance matrix, so the
average is only carried out over different resolutions κ when applying k-medoids. Therefore,
the introduction of a clustering step at a node-level resolution seems to permit a better cluster
reconstruction, conﬁrming the ﬁndings in Rasero et al. (2017).

We also assessed the robustness of our method in group reconstruction when Gaussian

noise is added to the time series of group 1 (TS1) and group 2 (TS2) come segue:

ij
1 = TS

ij
2 = TS

1 + (cid:6)ij
ij

1 · A · N ij

1 (0, 1)

2 + (cid:6)ij
ij

2 · A · N ij

2 (0, 1),

(6)

for i = 1 . . . 15 subjects and j = 1 . . . 20 components. A is the amplitude of the perturbations
E (cid:6)ij
1,2 is a binary matrix allowing us to play with different noise conﬁguration scenarios. In
our notation, subscript index represents group label and superscript indices i, j subject and
component, rispettivamente.

In a ﬁrst scenario, different values of the noise amplitude A = {0.1, 0.3, 0.5} were applied to
all components and subjects (cid:6)i=1..15,j=1..20
= 1. As we can see in Figure 2, only for large ampli-
tude, noise starts to dominate so much that the consensus algorithm renders group more com-
pact and mix them together making it incapable of distinguishing between groups perfectly.

1,2

Network Neuroscience

330

D
o
w
N
o
UN
D
e
D

F
R
o
M
H

T
T

:
/
/

D
io
R
e
C
T
.

io
T
.

e
D
tu
N
e
N
UN
R
T
io
C
e
–
P
D

F
/

3
2
3
2
5
1
0
9
2
6
2
6
N
e
N
_
UN
_
0
0
0
7
4
P
D

B
sì
G
tu
e
S
T

o
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Connectome sorting by consensus clustering

D
o
w
N
o
UN
D
e
D

F
R
o
M
H

T
T

:
/
/

D
io
R
e
C
T
.

io
T
.

e
D
tu
N
e
N
UN
R
T
io
C
e
–
P
D

F
/

3
2
3
2
5
1
0
9
2
6
2
6
N
e
N
_
UN
_
0
0
0
7
4
P
D

B
sì
G
tu
e
S
T

o
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Figura 1. The modularity matrix among subjects of both groups and its cluster structure reconstruc-
tion provided by the Louvain community detection algorithm on this matrix is depicted in two
different scenarios: Application of the consensus clustering to the distance matrix from the whole
pattern connectivity matrix (UN) or to the set of distance matrices from each node pattern connec-
attività (B).

In a second scenario, variation of noise across subsets of components was studied. To am-
plify this effect, we restrict noise amplitudes to a ﬁxed large value A = 0.5. Four subsets of
components affected by noise were taken j = {(1), (1 . . . 3), (1 . . . 6), (1 . . . 10)}. Inoltre, In
order to add more complexity, we considered the application of noise to ﬁve subjects of group 1
E 10 of group 2, rispettivamente. As a consequence, this effect is stronger in group 2, since for the
ﬁrst 10 components the activity is lower and therefore more sensitive to noise. When applied to
a different number of components, this leads to a decrease of intragroup distance of subjects
affected (yellowish areas in the modularity matrices of Figure 3) and also a slight approxi-
mation between groups as a consequence of the synchronization among the noised compo-
nents. Nevertheless, the consensus clustering is pretty robust when reconstructing both groups.

Finalmente, variation of number of subjects was taken into account (Guarda la figura 4). To simplify
this scenario, we considered that noise only affected group 2 and the ﬁrst 10 components with
a large amplitude A = 0.5. The subset of number of subjects are i = {(1), (1 . . . 5), (1 . . . 10),
(1 . . . 15)}. The consensus turns out to be again robust when reconstructing both groups. For
this case, the effect of changing the number of subjects of group 2 affected by noise makes
this group more compact as more subjects are involved since noise makes them look more
similar. This also makes these subjects of group 2 more similar with those of group 1 for the

Network Neuroscience

331

Connectome sorting by consensus clustering

Figura 2. The modularity matrix among subjects of both groups and its cluster structure reconstruction provided by a Louvain community
detection algorithm on this matrix is depicted when changing the amplitude noise A in Equation 6, that affects all the simulated time series
for both groups: A = 0.1 (UN), A = 0.3 (B), A = 0.5 (C).

ﬁrst 10 components, but not enough to mix them together to make them indistinguishable by
the consensus method.

LA5C dataset

The application of the consensus clustering method to the brain matrices in the four groups
yields the modularity matrices depicted in Figure 5, which have been ordered according to the

Figura 3. The modularity matrix among subjects of both groups and its cluster structure reconstruction provided by a Louvain community
detection algorithm on this matrix is depicted when changing the number of components, cioè., the index j of (cid:6)ij in Equation 6 affected by large
amplitude noise (A = 0.5): j = (1) (UN), j = (1 . . . 3) (B), j = (1 . . . 6) (C), j = (1 . . . 10) (D).

Network Neuroscience

332

D
o
w
N
o
UN
D
e
D

F
R
o
M
H

T
T

:
/
/

D
io
R
e
C
T
.

io
T
.

e
D
tu
N
e
N
UN
R
T
io
C
e
–
P
D

F
/

3
2
3
2
5
1
0
9
2
6
2
6
N
e
N
_
UN
_
0
0
0
7
4
P
D

B
sì
G
tu
e
S
T

o
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Connectome sorting by consensus clustering

Figura 4.
The modularity matrix among subjects of both groups and its cluster structure reconstruction provided by a Louvain community
detection algorithm on this matrix is depicted when changing the subjects affected, cioè., the index i of (cid:6)ij in Equation 6 affected by large
amplitude noise (A = 0.5): i = (1) (UN), i = (1 . . . 5) (B), i = (1 . . . 10) (C), i = (1 . . . 15) (D).

membership conﬁguration found by the community Louvain routine. For the healthy group,
this algorithm detects three communities of 55, 54, E 8, rispettivamente. Such clusters are sta-
tistically differentiated by FWD (Kruskal-Wallis test, P < 0.0001) and sex (χ2 test, p = 0.0201). For the ADHD group, it ﬁnds two clusters of 21 and 15 subjects. For the BD group it detects three communities of 17, 21, and 8 subjects, and for the SCH group, three clusters of 14, 25, and 9, these SCH clusters being statistically different with respect to FWD ( Kruskal-Wallis test, p = 0.0003). In all cases, communities with fewer subjects than ﬁve have been considered as outliers. Likewise, the consensus clustering algorithm allows extraction of more homogeneous sub- groups of subjects, as can be noticed from Figure 5, where the mean intragroup distance distri- bution produced by the collection of distance matrix for each pattern connectivity is exhibited. We stress that the intersubjects distance is highly and signiﬁcantly smaller for the clusters with respect to the whole groups, as estimated pairwise by means of a Wilcoxon signed-rank test. 2 Furthermore, having more homogeneous clusters is translated into a variability decrease, leading in some cases to a gain in separability, when clusters are introduced in between-group comparison studies. As can be seen in Figure 6, the partition of individual groups by consensus clustering in general enhances the effect size (here evaluated through the pseudo-R measure in MDMR) per node associated with class difference. Furthermore, this also allows us to reject the null model even though the sample size decreases when a partitioned group is involved, providing a better control for type I errors compared with the unpartitioned case (see p values distribution Supporting Information Figure S2, Rasero, Diez, Cortes, Marinazzo, & Stramaglia, 2019). Concerning partition of the healthy group, such an enhancement is notorious, showing, for example, that a procedure selecting and recruiting subjects with similar characteristics of those subjects within the cluster 2 in our healthy control sample would optimize the node association with the pathology. In addition, for the ADHD group, which lacks of signiﬁcant results when being involved in group comparisons as a whole, size effect ampliﬁcation leads to development of a signiﬁcant gain improvement when clustered using the consensus algorithm. In contrast, for both BD and SCH clusters such an improvement takes place specially in some 2 measure: Pseudo-R Measure that accounts for the proportion of sum of squared pairwise distances explained by the predictors and that can be therefore used to assess the effect size of association between the dependent and independent variables in the multivariate distance matrix regression. Network Neuroscience 333 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / / t e d u n e n a r t i c e - p d l f / / / / / 3 2 3 2 5 1 0 9 2 6 2 6 n e n _ a _ 0 0 0 7 4 p d t . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Connectome sorting by consensus clustering l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / t / e d u n e n a r t i c e - p d l f / / / / / 3 2 3 2 5 1 0 9 2 6 2 6 n e n _ a _ 0 0 0 7 4 p d . t f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Figure 5. Modularity matrix and the reduction of the mean intragroup distance per node after applying the consensus clustering method for each group in the LA5C dataset. of their clusters and as a consequence, the consensus algorithm can also help ﬁlter out “noisy” subjects, which have less in common with the pathology in study. On the other hand, application of consensus clustering to brain connectivity matrices of pathologies can also help unmask substructures whose underlying properties are unique and Network Neuroscience 334 Connectome sorting by consensus clustering l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / t / e d u n e n a r t i c e - p d l f / / / / / 3 2 3 2 5 1 0 9 2 6 2 6 n e n _ a _ 0 0 0 7 4 p d . t f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Figure 6. The change in the effect size measure R provided by MDMR across nodes, where subscript term “whole” involves both whole groups comparison and “cluster” one partitioned group by consensus clustering. An enhancement is developed above zero, here displayed by a gray thick line. Median value point reaching the maximum and minimum values are depicted here in black. 2 that might be worth studying. For example, concerning the comparisons with healthy subjects of the ADHD group after having been partitioned, we obtain for cluster 1, 59 signiﬁcant nodes that are present neither in cluster 2 nor when this group is taken as a whole. We will here- after denote these nodes as unique nodes for cluster 1, meaning that only the comparison with cluster 1 leads one to point out their altered pattern with respect to pathological conditions. Likewise, seven nodes are unique for cluster 2 of the same disorder group. Regarding compar- ison with the BD group, there are 61 unique nodes that are recognized as signiﬁcant for cluster 1 and 46 for cluster 2. Finally, 113 nodes arise as being signiﬁcantly associated with the whole group of schizophrenia, whereas substructure inspection increases this number to 128 and 132 for clusters 1 and 2, exhibiting both 27 different unique nodes. Both cluster 1 of BD and cluster 3 of SCH keep subjects associated with both pathologies, and therefore no association is found whatsoever. Regions affected by unique nodes in each pathology are represented in Figure 7. Higher signiﬁcant regions for cluster 1 of ADHD lie in cortical regions from the lateral occipital cortex, superior division, the cingulate gyrus, posterior division and lingual gyrus, Network Neuroscience 335 Connectome sorting by consensus clustering and involve functions from the visual and dorsal attention system. Cluster 2 has also incidence from parts of the lingual gyrus in addition to postcentral gyrus, the superior parietal lobule and the supramarginal gyrus, anterior division—all associated with the dorsal attention system— and a small portion of the cerebellum. On the other hand, regarding the BD group, the default mode system fundamentally emerges in cluster 2 with cortical regions in the frontal pole, temporal pole, inferior temporal gyrus, anterior division and the precuneus. Cluster 3 includes the postcentral gyrus and the left thala- mus, which take part in sensorimotor functions, and parts of the cerebellum and the occipital fusiform gyrus. Finally, SCH community representation obtained through the consensus algorithm exhibits a well-differentiated structure, with cluster 1 focusing mainly on the posterior parts of the brain with the postcentral gyrus, precuneus and lingual gyrus, somatosensory, and visual systems; l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . t / / e d u n e n a r t i c e - p d l f / / / / / 3 2 3 2 5 1 0 9 2 6 2 6 n e n _ a _ 0 0 0 7 4 p d t . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Figure 7. Unique nodes for each cluster within the disorder groups, i.e., signiﬁcant nodes not presented in the whole group or the rest of cluster comparisons, which emerge when confronting with the whole healthy control group. Network Neuroscience 336 Connectome sorting by consensus clustering Figure 8. The modularity matrix B ordered after label structure provided by community detection is depicted for typically developing (left) and autism spectrum disorder (right) group in the ABIDE dataset considered. and cluster 2 involving basically the prefrontal cortex and some parts of the superior frontal gyrus (mainly associated to the default mode network). ABIDE dataset Application of the consensus clustering algorithm and subsequent community detection al- gorithm to both groups leads to the ordered community structure provided by the modularity matrices in Figure 8. For the TD group, the modularity matrix B yields three modules of 18, 36, and 19 subjects, respectively. In contrast, division of the ASD group consists of two modules of 30 and 38 patients, respectively. Moreover, a Wilcoxon rank sum test performed on ASD community partition separates subjects statistically by age (p = 0.0028) and framewise dis- placement (p < 0.0001) (see Figure 9). Figure 9. Age and FWD points distribution for the subjects within the clusters of ASD group found by the consensus clustering algorithm. In red, the dot displays the median, and the vertical lines reach the maximum and minimum dispersion of the points in each case. Network Neuroscience 337 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / / t e d u n e n a r t i c e - p d l f / / / / / 3 2 3 2 5 1 0 9 2 6 2 6 n e n _ a _ 0 0 0 7 4 p d . t f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Connectome sorting by consensus clustering On the other hand, we can see in Figure 10 that overall, the effect size of each node pattern connectivity for distinguishing between both group labels is larger when a partitioned case is involved. This is also translated into an increase of statistical signiﬁcance (see Supporting Information Figure S3, Rasero et al., 2019). In particular, it is notable how cluster 3 of the TD population shows all the beneﬁts of applying the consensus clustering method since it gathers the TD subjects with the highest connectome difference in comparison with the ASD subjects. Moreover, this pronounced case corresponds to the scenario where the decrease of intragroup distance (cluster is more homogeneous) also follows an increase of the intergroup distance (left panel of Figure 11). This effect is not reproduced in the rest of the cases, even though clusters become more compact. We are indeed clustering each group separately, thus maximum separation between classes does not need to be guaranteed. Likewise, the intragroup distance reduction also yields an effect size enhancement in both autistic subpopulations, more clearly for cluster 1 where this reduction is more pronounced. Consequently, the increase in effect size and the subsequent gain in statistical signiﬁcance for some of the cases might allow to unmask regions not observed signiﬁcantly different at ﬁrst and therefore be further exploited by a more thorough exploratory analysis. l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / / t e d u n e n a r t i c e - p d l f / / / / / 3 2 3 2 5 1 0 9 2 6 2 6 n e n _ a _ 0 0 0 7 4 p d t . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Figure 10. The change in the effect size measure R provided by MDMR across nodes, where subscript term “whole” involves both whole groups comparison and “cluster” one partitioned group. An enhancement is developed above zero, here displayed by a gray thick line. Median value point reaching the maximum and minimum values are depicted here in black. 2 Network Neuroscience 338 Connectome sorting by consensus clustering Figure 11. Mean intra- and intergroup distance distribution provided by the collection of distance matrices per each node. On the left, this is shown for the TD group, and on the right for the ASD subjects. The intergroup distance is always evaluated considering the other group as a whole. l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / t / e d u n e n a r t i c e - p d l f / / / / / 3 2 3 2 5 1 0 9 2 6 2 6 n e n _ a _ 0 0 0 7 4 p d t . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 When considering both groups each as a whole, MDMR yields only two signiﬁcant regions (region 57 and 83) as explained by group label variation between subjects, whereas this num- ber increases to 13 and 70 for age and FWR, respectively, showing an important effect of the observed variance based on these last covariates. However, applying ﬁrst consensus cluster- ing to the TD group produces three clusters such that when comparing with the ASD group, one cluster yields 52 signiﬁcant nodes, with 48 nodes uniquely assigned to it. Likewise, both clusters obtained by partitioning the ASD group lead to 16 and 9 signiﬁcant nodes when com- pared with the group of typically developing subjects and, most importantly, 14 and 6 of them, respectively, only observable in each cluster. Therefore, not only we do gain separability by means of an increase of signiﬁcant nodes, but also our approach allows us to unmask unique regions not visible whatsoever when using standard whole groups comparisons. From a clinical point of view the emergence of unique nodes can evidence brain regions whose implications on the development of the pathology should be further examined. In our case, concentrating on the ASD case, cluster 1 has 14 regions not present neither in clus- ter 2 versus TD, nor in whole group comparison. These regions are prominently localized on the right brain hemisphere with the highest overall signiﬁcance in the pars triangularis of fdr = 0.0004). Areas in the same hemisphere involving the frontal the inferior frontal gyrus (p fdr = 0.0014) and fdr = 0.0036) and postcentral (p pole (p fdr = 0.0019) in the opposite hemisphere are also associated to greater paracentral regions (p separation. On the other hand, the six unique signiﬁcant nodes of cluster 2 belong to areas in fdr = 0.0021 in the left both hemispheres involving the banks of the superior temporal sulcus (p fdr = 0.0085 hemisphere and p fdr = 0.0069) and and p the supramarginal (p fdr = 0.0075, respectively), and regions in the inferior temporal lobe (p fdr = 0.0079) in the right hemisphere. fdr = 0.0078 in the right one) and the superior temporal lobe (p fdr = 0.0018) and the precuneus (p DISCUSSION New techniques aimed to increase group separability in neuroimaging studies are constantly being developed to gain deeper and better insight on brain function and malfunctioning. The use of connectomes as a biomarkers is also increasingly prominent. Our clustering method Network Neuroscience 339 Connectome sorting by consensus clustering is naturally located in this latter ﬁeld and helps unveil more compact subclasses beyond the group labels. In addition, the application of consensus clustering to different regions of the brain allows us to capture differences in the groups unseen when the whole connectivity matrix is used. To our best knowledge, such an approach has not been yet explored in neuroimaging. Additionally, the consensus matrix gathers information from different resolutions and nodes and encodes them into the consensus matrix. As a consequence, some possible arbitrariness such as choosing a priori the number of clusters to be obtained disappears. Moreover, clusters found by our method are fairly robust with respect to moderate changes in amplitude noise. Of note, clinical and control groups have been shown to share functional connectivity patterns (Easson, Fatima, & McIntosh, in press; Spronk et al., 2018). Here we do not go after a wholly data driven clustering of subjects irrespective of their main clinical label, but rather on reﬁning these user-deﬁned groups based on their FC patterns. Preliminary analysis of groups of labeled subjects by consensus clustering leads to more compact partitions and a subsequent increase in separability of classes when performing group comparisons. This effect can be simply understood by looking at the comparison of two uni- variate distributions, where the separability of both groups can be expressed in terms of the mean difference (intergroup distance) divided by the variability of both groups (intragroup dis- tance). As a consequence, a reduction of the intragroup distance can help increase separability. Nonetheless, it is important to note that application of our clustering method to an individual group only optimizes the intragroup distance, and therefore it might happen that for some parti- tions the intergroup distances also decrease, compensating the beneﬁt of having more compact groups. Different strategies to preprocess the data and to address motion confounders mod- ify to some extent the output of virtually any subsequent analysis. One of the most debated choices in this respect, and likely one of the most inﬂuential on the results, is global signal regression. In our original analyses we decided not to apply global signal regression because of the expected network-speciﬁc differential effects evidenced in Gotts et al. (2013). On the other hand, there are also reasons to use global signal regression (Murphy & Fox, 2017), and this choice was made in a recent paper proposing a similar subgroup classiﬁcation strategy (Easson et al., in press). We repeated our analysis after regressing the global signal out, and found that the results were qualitatively similar (see Supporting Information Figure S4, Rasero et al., 2019 for the ABIDE dataset). In addition to variability reduction that allows us to elucidate regions with distinctive con- nectivity patterns in group comparisons, each cluster found also provides unique information. In particular, we have shown this for two different datasets that include four different mental disorders: ADHD, BD, SCH, and ASD. Most of the studies aimed at ﬁnding brain alterations in ADHD subjects have been focused on children, since the symptoms related to this disorder such as inattention and/or hyperactiv- ity/impulsivity need to be treated at the beginning of their appearance. In our case, however, our cohort consists of adult subjects, among which we have found two different clusters car- rying both different and unique information. The ﬁrst cluster comprises a vast anterior brain area, in addition to the lateral occipital cortex and the lingual gyrus, mainly associated with visual attention. Similar results were reported using structural data (Ahrendts et al., 2011). The second one covers a small portion of the brain associated with the dorsal attention and the cerebellum, possibly capturing the cognitive functions of the latter in attention tasks (Berquin et al., 1998). Network Neuroscience 340 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . t / / e d u n e n a r t i c e - p d l f / / / / / 3 2 3 2 5 1 0 9 2 6 2 6 n e n _ a _ 0 0 0 7 4 p d . t f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Connectome sorting by consensus clustering Our clustering method clearly evidences the default mode network as one of the relevant systems in one of the subgroups in bipolar disorder and schizophrenia. Such a fundamental network is likely very sensitive to most modulations and already reported to be affected in both mental disorders (V. Calhoun et al., 2012; Ongür et al., 2010). In our speciﬁc clusters, regions with a disrupted connectivity pattern concentrate in the frontal cortex (Deakin et al., 1989; Johnston-Wilson et al., 2000), with also some areas in the temporal lobe, which may account for differences between bipolar disorder and schizophrenia (Altshuler et al., 2000; V. D. Calhoun, Maciejewski, Pearlson, & Kiehl, 2008; Johnstone et al., 1989). The present study evidenced areas involved in different functional systems, such as the cerebellum (DelBello, Strakowski, Zimmerman, Hawkins, & Sax, 1999; Forlim et al., 2017; McDonald et al., 2005), and the postcentral gyrus was connected to the appearance of delusion and hallucinations associated with a dysfunction of the sensory system (Song et al., 2015), and the bilateral lingual gyrus, whose hyperactivation has been connected to lack of empathy in schizophrenic subjects (Kühn & Gallinat, 2013). On the other hand, two clear macroregions characterize our results on ASD subjects. The ﬁrst one has a clear right asymmetry, with the participation of areas around the pars opercu- laris and pars triangularis in the inferior frontal gyrus, usually connected to lack of emotions (Dapretto et al., 2005; De Foss et al., 2004; Herbert et al., 2002). This domain also includes sensorimotor areas such as the postcentral and paracentral gyrus and precuneus and conﬁrm the results observed in a larger ABIDE cohort of subjects when comparing ASD with typically developing controls (Lee, Kyeong, Kim, & Cheon, 2016). The second set of regions comprises some parts of the temporal lobe that, in addition to regions in the orbitofrontal cortex and the amygdala, deﬁne the “social brain.” In this case, autistic subjects develop abilities for verbally labeling complex visual stimuli and processing faces and eyes, to compensate for amygdala abnormality (Baron-Cohen et al., 1999). It is worth stressing that the connectivity patterns among brain regions are inﬂuenced by respiration, movement, cardiac phase, and other physiological variables, and what we observe and can infer here (and in most fMRI studies) is just a higher level view in which the effects of all these actors are conﬂated. CONCLUSIONS In this work we have proposed the use of the consensus clustering approach for exploratory analysis in order to cope with the heterogeneity of subjects. Extracting the natural classes present in data and subsequently performing the supervised analysis by MDMR on these al- lows us to identify variables whose adjacency pattern is altered in group comparisons, and which are not highlighted when considering the groups as a whole. As a result, the proposed approach leads to an increase in the effect sizes. We presented an application to public fMRI data, evidencing groups of subjects in which speciﬁc regions contributed to a higher separa- bility between classes. In sum, we have shown an additional beneﬁt of using the consensus clustering algorithm developed in Rasero et al. (2017). Such an algorithm is based on the idea of condensing information on structure at different levels, which in general works well when extracting the inherent data subclasses. Future studies could test alternative strategies when this is not the case. AUTHOR CONTRIBUTIONS Javier Rasero: Conceptualization; Data curation; Formal analysis; Investigation; Methodology; Software; Validation; Visualization; Writing – original draft. Ibai Diez: Data curation. Jesus Network Neuroscience 341 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . t / / e d u n e n a r t i c e - p d l f / / / / / 3 2 3 2 5 1 0 9 2 6 2 6 n e n _ a _ 0 0 0 7 4 p d t . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Connectome sorting by consensus clustering M Cortes: Conceptualization; Methodology; Supervision; Writing – original draft. Daniele Marinazzo: Conceptualization; Data curation; Formal analysis; Investigation; Methodology; Supervision; Visualization; Writing – original draft. Sebastiano Stramaglia: Conceptualization; Formal analysis; Investigation; Methodology; Supervision; Writing – original draft. FUNDING INFORMATION Sebastiano Stramaglia, University of Bari “Aldo Moro” (http://doi.org/10.13039/501100005362), Award ID: “Progetti di Ricerca su base competitiva, f.di ex 60% - residuo 2015.” Javier Rasero, Department of Education (Basque Country Government), Award ID: Doctoral Research Staff Improvement Programme. REFERENCES Ahrendts, J., Rüsch, N., Wilke, M., Philipsen, A., Eickhoff, S. B., Glauche, V., . . . van Elst, L. T. (2011). Visual cortex abnormal- ities in adults with ADHD: A structural MRI study. The World Journal of Biological Psychiatry, 12(4), 260–270. https://doi.org/ 10.3109/15622975.2010.518624 Altshuler, L. L., Bartzokis, G., Grieder, T., Curran, J., Jimenez, T., Leight, K., . . . Mintz, J. (2000). An MRI study of temporal lobe structures in men with bipolar disorder or schizophrenia. Biolog- ical Psychiatry, 48(2), 147–162. https://doi.org/10.1016/S0006- 3223(00)00836-2 Barabasi, A.-L., & Frangos, J. (2002). Linked: The New Science of Networks (1st ed.). New York: Perseus Books Group. Baron-Cohen, S., Ring, H. A., Wheelwright, S., Bullmore, E. T., Brammer, M. J., Simmons, A., & Williams, S. C. R. (1999). So- cial intelligence in the normal and autistic brain: An fMRI study. European Journal of Neuroscience, 11(6), 1891–1898. http:// doi.org/10.1046/j.1460-9568.1999.00621.x Berquin, P. C., Giedd, J. N., Jacobsen, L. K., Hamburger, S. D., Krain, A. L., Rapoport, J. L., & Castellanos, F. X. (1998). Cerebel- lum in attention-deﬁcit hyperactivity disorder. Neurology, 50(4), 1087–1093. http://n.neurology.org/content/50/4/1087 Boccaletti, S., Bianconi, G., Criado, R., del Genio, C., Gómez- Gardeñes, J., Romance, M., . . . Zanin, M. (2014). The structure and dynamics of multilayer networks. Physics Reports, 544(1), 1–122. http://doi.org/10.1016/j.physrep.2014.07.001 Brito, P., Bertrand, P., Cucumel, G., & Carvalho, F. D. (2007). Clus- tering by Means of Medoids. Selected Contributions in Data Analysis and Classiﬁcation. Berlin: Springer Science & Business Media. Calhoun, V., Sui, J., Kiehl, K., Turner, J., Allen, E., & Pearlson, G. (2012). Exploring the psychosis functional connectome: Aber- rant intrinsic networks in schizophrenia and bipolar disorder. Frontiers in Psychiatry, 2, 75. https://www.frontiersin.org/article/ 10.3389/fpsyt.2011.00075 Calhoun, V. D., Maciejewski, P. K., Pearlson, G. D., & Kiehl, K. A. (2008). Temporal lobe and default hemodynamic brain modes discriminate between schizophrenia and bipolar dis- order. Human Brain Mapping, 29(11), 1265–1275. http://doi.org/ 10.1002/hbm.20463 Dapretto, M., Davies, M. S., Pfeifer, J. H., Scott, A. A., Sigman, M., Bookheimer, S. Y., & Iacoboni, M. (2005). Understanding emo- tions in others: Mirror neuron dysfunction in children with autism spectrum disorders. Nature Neuroscience, 9(1), 2830. https://doi. org/10.1038/nn1611 Deakin, J. F. W., Slater, P., Simpson, M. D. C., Gilchrist, A. C., Skan, W. J., Royston, M. C., . . . Cross, A. J. (1989). Frontal cortical and left temporal glutamatergic dysfunction in schizophrenia. Journal of Neurochemistry, 52(6), 1781–1786. http://doi.org/ 10.1111/j.1471-4159.1989.tb07257.x De Foss, L., Hodge, S. M., Makris, N., Kennedy, D. N., Caviness, V. S., McGrath, L., . . . Harris, G. J. (2004). Language-association cortex asymmetry in autism and speciﬁc language impairment. Annals of Neurology, 56(6), 757–766. http://doi.org/10.1002/ ana.20275 DelBello, M. P., Strakowski, S. M., Zimmerman, M. E., Hawkins, J. M., & Sax, K. W. (1999). MRI analysis of the cerebellum in bipo- lar disorder: A pilot study. Neuropsychopharmacology, 21(1), 63–68. https://doi.org/10.1016/S0893-133X(99)00026-3 Di Martino, A., Yan, C. G., Li, Q., Denio, E., Castellanos, F. X., Alaerts, K., . . . Milham, M. P. (2014). The autism brain imag- ing data exchange: Towards a large-scale evaluation of the in- trinsic brain architecture in autism. Molecular Psychiatry, 19(6), 659–667. http://doi.org/10.1038/mp.2013.78 Easson, A. K., Fatima, Z., & McIntosh, A. R. (in press). Func- tional connectivity-based subtypes of individuals with and with- out autism spectrum disorder. Network Science. Retrieved from https://doi.org/10.1162/netn_a_00067 Erhardt, E. B., Allen, E. A., Wei, Y., Eichele, T., & Calhoun, V. D. (2012). SimTB, a simulation toolbox for fmri data under a model of spatiotemporal separability. NeuroImage, 59(4), 4160–4167. https://doi.org/10.1016/j.neuroimage.2011.11.088 Forlim, C., Klock, L., Bchle, J., Stoll, L., Giemsa, P., Fuchs, M., . . . Kühn, S. (2017). Disrupted cerebrum-cerebellum network in schizophrenia revealed by network-based statistics and graph theory. BMC Neuroscience, 18(1), 138. Fornito, A., & Bullmore, E. T. (2010). What can spontaneous ﬂuc- tuations of the blood oxygenation-level-dependent signal tell us about psychiatric disorders? Current Opinion in Psychiatry, 23(3), 239–249. http://doi.org/10.1097/yco.0b013e328337d78d Gotts, S., Saad, Z., Jo, H. J., Wallace, G., Cox, R., & Martin, A. (2013). The perils of global signal regression for group com- parisons: A case study of autism spectrum disorders. Frontiers Network Neuroscience 342 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / / t e d u n e n a r t i c e - p d l f / / / / / 3 2 3 2 5 1 0 9 2 6 2 6 n e n _ a _ 0 0 0 7 4 p d . t f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Connectome sorting by consensus clustering in Human Neuroscience, 7, 356. https://www.frontiersin.org/ article/10.3389/fnhum.2013.00356 Herbert, M. R., Harris, G. J., Adrien, K. T., Ziegler, D. A., Makris, N., Kennedy, D. N., . . . Caviness, V. S. (2002). Abnormal asymmetry in language association cortex in autism. Annals of Neurology, 52(5), 588–596. http://doi.org/10.1002/ana.10349 Johnstone, E. C., Owens, D. G., Crow, T. J., Frith, C. D., Alexandropolis, K., Bydder, G., & Colter, N. (1989). Temporal lobe structure as determined by nuclear magnetic resonance in schizophrenia and bipolar affective disorder. Journal of Neurol- ogy, Neurosurgery & Psychiatry, 52(6), 736–741. http://jnnp.bmj. com/content/52/6/736 Johnston-Wilson, N. L., Sims, C. D., Hofmann, J.-P., Anderson, L., Shore, A. D., Torrey, E. F., & Yolken, R. H. (2000). Disease-speciﬁc alterations in frontal cortex brain proteins in schizophrenia, bipo- lar disorder and major depressive disorder. Molecular Psychiatry, 5(2), 142149. https://doi.org/10.1038/sj.mp.4000696 Kühn, S., & Gallinat, J. (2013). Resting-state brain activity in schizo- phrenia and major depression: A quantitative meta-analysis. Schizophrenia Bulletin, 39(2), 358–365. http://doi.org/10.1093/ schbul/sbr151 Lancichinetti, A., & Fortunato, S. (2012). Consensus clustering in complex networks. Scientiﬁc Reports, 2:336. http://doi.org/10. 1038/srep00336 Lee, J. M., Kyeong, S., Kim, E., & Cheon, K.-A. (2016). Abnormalities of inter- and intra-hemispheric functional connectivity in autism spectrum disorders: A study using the autism brain imaging data exchange database. Frontiers in Neuroscience, 10, 191. https:// www.frontiersin.org/article/10.3389/fnins.2016.00191 McArdle, B. H., & Anderson, M. J. (2001). Fitting multivariate models to community data: A comment on distance-based re- dundancy analysis. Ecology, 82(1), 290–297. http://doi.org/10. 1890/0012-9658(2001)082[0290:FMMTCD]2.0.CO;2 McArtor, D. B. (n.d.). MDMR: Multivariate distance matrix re- gression R package. Retrieved from https://CRAN.R-project.org/ package=MDMR McDonald, C., Bullmore, E., Sham, P., Chitnis, X., Suckling, J., Maccabe, J., . . . Murray, R. M. (2005). Regional volume de- viations of brain structure in schizophrenia and psychotic bipo- lar disorder: Computational morphometry study. British Journal of Psychiatry, 186(5), 369377. https://doi.org/10.1192/bjp.186.5.369 Murphy, K., & Fox, M. D. (2017). Towards a consensus regard- ing global signal regression for resting state functional connec- tivity MRI. NeuroImage, 154, 169–173. https://doi.org/10.1016/ j.neuroimage.2016.11.052 Ongür, D., Lundy, M., Greenhouse, I., Shinn, A. K., Menon, V., Cohen, B. M., & Renshaw, P. F. (2010). Default mode network abnormalities in bipolar disorder and schizophrenia. Psychia- try Research: Neuroimaging, 183(1), 59–68. https://doi.org/10. 1016/j.pscychresns.2010.04.008 Poldrack, R. A., Congdon, E., Triplett, W., Gorgolewski, K. J., Karlsgodt, K. H., Mumford, J. A., . . . Bilder, R. M. (2016). A phenome-wide examination of neural and cognitive function. Sci Data, 3, 160110. Rasero, J., Alonso-Montes, C., Diez, Remaki, L., Escudero, I., . . I., Olabarrieta-Landa, L., . Alzheimer’s Disease Neuro- imaging Initiative. (2017). Group-level progressive alterations in brain connectivity patterns revealed by diffusion-tensor brain net- works across severity stages in Alzheimer’s disease. Frontiers in Aging Neuroscience, 9, 215. https://www.frontiersin.org/article/ 10.3389/fnagi.2017.00215 Rasero, J., Diez, I., Cortes, J. M., Marinazzo, D., & Stramaglia, S. (2019). Supporting Information for “Connectome sorting by con- sensus clustering increases separability in group neuroimaging studies.” Network Neuroscience, 3(2), 325–343. https://doi.org/ 10.1162/netn_a_00074 Rasero, J., Pellicoro, M., Angelini, L., Cortes, J. M., Marinazzo, D., & Stramaglia, S. (2017). Consensus clustering approach to group brain connectivity matrices. Network Neuroscience, 1(3), 242–253. https://doi.org/10.1162/NETN_a_00017 Reichardt, J., & Bornholdt, S. (2006). Statistical mechanics of com- munity detection. Physics Review E, 74, 016110. https://link.aps. org/doi/10.1103/PhysRevE.74.016110 Rubinov, M., & Sporns, O. (2010). Complex network measures of brain connectivity: Uses and interpretations. NeuroImage, 52(3), 1059–1069. https://doi.org/10.1016/j.neuroimage.2009.10.003 Shehzad, Z., Kelly, C., Reiss, P. T., Craddock, R. C., Emerson, J. W., McMahon, K., . . . Milham, M. P. (2014). A multivariate distance- based analytic framework for connectome-wide association studies. NeuroImage, 93(Part 1), 74–94. https://doi.org/10.1016/ j.neuroimage.2014.02.024 Shen, X., Tokoglu, F., Papademetris, X., & Constable, R. (2013). Groupwise whole-brain parcellation from resting-state fMRI data for network node identiﬁcation. NeuroImage, 82(Supplement C), 403–415. https://doi.org/10.1016/j.neuroimage.2013.05.081 Song, J., Han, D., Kim, S., Hong, J., Min, K. J., Cheong, J., & Kim, B. (2015). Differences in gray matter volume corresponding to delusion and hallucination in patients with schizophrenia compared with patients who have bipolar disorder. Neuropsy- chiatric Disease and Treatment, 1211. https://doi.org/10.2147/ ndt.s80438 Sporns, O. (2011). Networks of the Brain. Cambridge, MA: MIT Press. Spronk, M., Kulkarni, K., Ji, J. L., Keane, B., Anticevic, A., & Cole, M. W. (2018). A whole-brain and cross-diagnostic perspective on functional brain network dysfunction. bioRxiv, 326728. https:// doi.org/10.1101/326728 Strehl, A., & Ghosh, J. (2002). Cluster ensembles - A knowledge In Eighteenth reuse framework for combining partitionings. National Conference On Artificial Intelligence (AAAI-02)/Fourteenth Innovative Applications of Artiﬁcial Intelligence Conference (IAAI-02), Proceedings (p. 93–98). Cambridge, MA: MIT PRESS. Xu, R., & Wunsch, D. (2005). Survey of clustering algorithms. IEEE Transactions on Neural Networks, 16(3), 645–678. https://doi. org/10.1109/TNN.2005.845141 Zapala, M. A., & Schork, N. J. (2006). Multivariate regression analysis of distance matrices for testing associations between gene expression patterns and related variables. Proceedings of the National Academy of Sciences of the United States of America, 103(51), 19430–19435. http://www.pnas.org/content/ 103/51/19430.abstract l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / / t e d u n e n a r t i c e - p d l f / / / / / 3 2 3 2 5 1 0 9 2 6 2 6 n e n _ a _ 0 0 0 7 4 p d . t f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Network Neuroscience 343 METHODS image

Scarica il pdf