MÉTHODES - Recherche en IA spécialisée au MIT

MÉTHODES

Increasing robustness of pairwise methods for
effective connectivity in magnetic resonance
imaging by using fractional moment series
of BOLD signal distributions

Natalia Z. Bielczyk

1,3, Jan K. Buitelaar
Jeffrey C. Glennon1,2, and Christian F. Beckmann 1,2,3,4

1,2, Alberto Llera

1,2,

un accès ouvert

journal

1Donders Institute for Brain, Cognition and Behavior, Nijmegen, the Netherlands
2Department of Cognitive Neuroscience, Radboud University Nijmegen Medical Centre, Nijmegen, the Netherlands
3Radboud University Nijmegen, Nijmegen, the Netherlands
4Oxford Centre for Functional MRI of the Brain, University of Oxford, Oxford, ROYAUME-UNI

Mots clés: Causal
Pairwise causal inference

inference, Effective connectivity, Functional magnetic resonance imaging,

ABSTRAIT

Estimating causal interactions in the brain from functional magnetic resonance imaging
(IRMf) data remains a challenging task. Multiple studies have demonstrated that all current
approaches to determine direction of connectivity perform poorly when applied to synthetic
fMRI datasets. Recent advances in this ﬁeld include methods for pairwise inference, lequel
involve creating a sparse connectome in the ﬁrst step, and then using a classiﬁer in order to
determine the directionality of connection between every pair of nodes in the second step. Dans
this work, we introduce an advance to the second step of this procedure, by building a
classiﬁer based on fractional moments of the BOLD distribution combined into cumulants.
The classiﬁer is trained on datasets generated under the dynamic causal modeling (DCM)
generative model. The directionality is inferred based on statistical dependencies between
the two-node time series, Par exemple, by assigning a causal link from time series of low
variance to time series of high variance. Our approach outperforms or performs as well as
other methods for effective connectivity when applied to the benchmark datasets. Surtout, it
is also more resilient to confounding effects such as differential noise level across different
areas of the connectome.

RÉSUMÉ DE L'AUTEUR

Estimating causal interactions from functional magnetic resonance imaging (IRMf) data is a
formidable task. Recent advances in this ﬁeld include methods for pairwise inference. In the
ﬁrst step of this procedure, connections are revealed by means of functional connectivity. Dans
the second step, every detected connection is analyzed separately to reveal the direction of
the causal links. We introduce an advance to the second step of this procedure by building a
classiﬁer based on the novel concept of fractional moments of the BOLD distribution
combined into cumulants. The classiﬁer is trained on datasets generated under the dynamic
causal modeling (DCM) generative model. Using fractional cumulants gives a measure
resilient to confounding effects such as differential noise levels across different areas of the
connectome.

Citation: Bielczyk, N. Z., Llera, UN.,
Buitelaar, J.. K., Glennon, J.. C., &
Beckmann, C. F. (2019). Increasing
robustness of pairwise methods for
effective connectivity in magnetic
resonance imaging by using fractional
moment series of BOLD signal
distributions. Neurosciences en réseau,
3(4), 1009–1037. https://est ce que je.org/
10.1162/netn_a_00099

EST CE QUE JE:
https://doi.org/10.1162/netn_a_00099

Informations complémentaires:
https://doi.org/10.1162/netn_a_00099

Reçu: 11 May 2018
Accepté: 3 Juin 2019

Intérêts concurrents: Les auteurs ont
a déclaré qu'aucun intérêt concurrent
exister.

Auteur correspondant:
Natalia Bielczyk
natalia.bielczyk@gmail.com

Éditeur de manipulation:
Shella Keilholz

droits d'auteur: © 2019
Massachusetts Institute of Technology
Publié sous Creative Commons
Attribution 4.0 International
(CC PAR 4.0) Licence

La presse du MIT

D
o
w
n
o
un
d
e
d

F
r
o
m
h

t
t

:
/
/

d
je
r
e
c
t
.

je
t
.

e
d
toi
n
e
n
un
r
t
je
c
e
–
p
d

F
/

3
4
1
0
0
9
1
8
6
6
8
0
3
n
e
n
_
un
_
0
0
0
9
9
p
d

b
oui
g
toi
e
s
t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

Improving pairwise inference in fMRI using fractional moments of BOLD

Effective connectivity:
Causal relations between nodes of
the investigated network. In fMRI,
effective connectivity (as opposed to
directed functional connectivity) est
typically derived from a model that
additionally considers the underlying
neuronal processes (voir: dynamic
causal modeling).

Dynamic Causal Modeling:
The most popular generative model
in the fMRI connectivity research. Il
assumes that the observable BOLD
fMRI signal is a product of an
underlying neuronal communication
smoothed with a slow hemodynamic
response.

Functional connectivity:
Statistical associations between
activity in nodes of the investigated
réseau. In fMRI, functional
connectivity is most often evaluated
by means of Pearson/partial
correlation or mutual information.

Pairwise inference:
A two-step causal inference
procedure that reduces causal
inference in a large graph to studying
two-node interactions.

INTRODUCTION

In the context of functional magnetic resonance research, effective connectivity refers to the
process of estimating causal interactions between distinct regions within the brain. Several
characteristics of fMRI data impose severe restrictions on the possibility of estimating such
effective connectivity (Valdes-Sosa, Roebroeck, Daunizeau, & Friston, 2011; Friston, 2011;
Bielczyk et al., 2019). D'abord, the temporal resolution of the image acquisition is low (sampling
rate typically <1 Hz). Furthermore, blood oxygen level–dependent (BOLD) activity is delayed with respect to neuronal ﬁring, with a delay of 3–6 s in the adult human brain (Arichi et al., 2012). The delayed hemodynamic response can also induce spurious cross-correlations between two BOLD time series (Ramsey al., 2010; Bielczyk, Llera, Buitelaar, Glennon, & Beckmann, 2017). Both subject-to-subject and region-to-region variability shape hemodynamic response (Devonshire 2012) provide general limitation methods for effective connectivity research fMRI: when one region faster than in another, temporal precedence peak easily be mistaken causation. Secondly, fMRI data characterized by relatively low signal-to- noise ratio. Within gray matter at ﬁeld strengths 3 T, task-induced signal changes range within 2–3% mean depending on task (Kruger, 2018). Furthermore, the stochastic noise has been shown have scale-free spectral characteristic (He, 2014; Bédard, Kröger, Destexhe, 2006; Dehghani, Cash, Halgren, Destexhe, 2010), which additionally hinders identiﬁability causal structures derived from data (Bielczyk, Beckmann, 2017). Moreover, typical protocols in- volve short (a few hundred samples), estimation conditional probabilities between variables, higher order statistic becomes difﬁcult. Multiple were proposed estimate effective connectivities (Friston, 2011). In computational study Smith al. (2011), range con- nectivity tested synthetic datasets dynamic modeling forward model (DCM; Friston, Harrison, Penny, 2003). this study, most estimating causal interactions remained chance level. One method highlighted as successful at identifying links based Patel’s tau measure (PT; Patel, Bowman, Rilling, 2006; Smith 2011). PT entails two-step approach ﬁrst step involves identifying the (undirected) connections means functional connectivity. This achieved basis of correlations different regions, referred Patel’s kappa (Patel 2011). One note make that, other pairwise inference procedures assume that causation implies correlation. assumption necessary perform infer- ence procedure, is, select reliable further classiﬁcation into upstream and downstream regions. However, although often true, not always the case under certain circumstances (e.g., control systems) causation might be as- sociated correlation (Kennaway, 2015). recent Di Biswal (2019), the authors investigated task-modulated whole-brain connectivity six block-design and event-related cognitive task. By using psychophysiological pairs of regions interest (ROIs) whole brain, authors identiﬁed statistically signiﬁcant task modulations connectivity, reported, “task modulated was found only activated during but were not or deactivated.” suggests considering pairs which activity correlated neglect some underlying connections. There are multiple strategies implementing thresholding connectivity estimates (Varoquaux Craddock, 2013). Since Pearson typically returns dense Network Neuroscience 1010 l D o w n o a d e d f r o m h t t p : > 0).

Fait intéressant, the range of high discriminability is different for real and imaginary compo-
nents. Maximal discriminability in both real and imaginary cumulants exists in the range of k
+ l > 5.0. En plus, in real cumulants there is a region of high discriminability in the range
k + je < 2.0, and in imaginary cumulants there is such region in the range 2.0 < k + l < 5.0 (Figure 3, both boundary lines marked with a white line). This different characteristic along the imaginary axis illustrates that moving from integer to fractional moments of the distribution provides additional predictive power. Note also that as we are deriving the discriminability maps from the DCM generative model. These maps are ﬁngerprints of the particular problem of effective connectivity in fMRI; when derived from another generative model simulating another dataset, the maps would be different. Network Neuroscience 1016 Improving pairwise inference in fMRI using fractional moments of BOLD Figure 3. Discriminative power for all cumulants in range k, l in [0.0, 5.0], in the ideal case of a very long BOLD time series and no background neuronal noise. Maximal discriminability in both real and imaginary cumulants exists in the range of k + l > 5.0. En plus, in real cumulants
there is a region of high discriminability in the range k + je < 2.0, and in imaginary cumulants there is such region in the range 2.0 < k + l < 5.0 (both boundary lines marked with a white line). In order to investigate how the performance of classiﬁcation based on single cumulants changes when a single connection is embed in a bigger network, we evaluated their success rate in estimating connectivity for benchmark synthetic datasets (Smith et al., 2011). Figure 4 presents the grand mean success rate achieved by every cumulant separately, across all 28 benchmark synthetic datasets (Smith et al., 2011). l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . t / / e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d . t / f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Success rate for all the individual cumulants, averaged over 28 simulations from the Figure 4. synthetic benchmark datasets (Smith et al., 2011). White-edged square denotes a single cumulant used by Hyvärinen and Smith (2013). The performance of this cumulant is shown in the color bar, white band. Black-edged squares denote cumulants that give the highest performance on this dataset. Their performance is presented in the color bar, black band. For the cumulants of high indexes k + l > 4.0 (white line), the success rate is not as high as the discriminability presented
in Figure 3 would suggest. The high success rate is not preserved for high-indexed cumulants that
achieved high discriminability on two-node simulations. The maximal grand mean performance
equals 0.847 for the real components and 0.814 for imaginary components.

Neurosciences en réseau

1017

Improving pairwise inference in fMRI using fractional moments of BOLD

The success rate for each of the 28 separate synthetic datasets (Smith et al., 2011) is pre-
sented in Supporting Information 4. The maps of simulation-dependent success rate relate to
the maps of discriminative power (Chiffre 3), but they are not identical and differ between sim-
ulations. One difference is that for the cumulants of high indexes k + l > 4.0, the success
rate is not as high as the discriminability presented in Figure 3 would suggest. This is because
Chiffre 3 represents the limit of a system of two isolated nodes with inﬁnite SNR, and a very
long BOLD time series, whereas benchmark synthetic datasets refer to a more realistic case
when for each pair of nodes, the time series is short, there are confounding signals from
other nodes in the network, and there is a certain degree of noise in the communication (voir
Informations complémentaires 2). Altogether, these factors make the high moments hard to estimate
in practice.

Combining Fractional Cumulants into a Classiﬁer

We propose to combine information contained in multiple cumulants by building the classiﬁer
based on a “voting” scheme between the cumulants. This classiﬁer determines whether the
map of cumulants obtained for a pair of time series X(t), Oui(t) is closer to the benchmark maps
presented in Figure 2A (which is an evidence for a connection X → Y), or their inverse (lequel
is an evidence for a ﬂipped connection Y → X). Each of the cumulants Ckl votes due to sign
Srk,je, Sik,je (Figure 2C). If the sign of the cumulant is the same as in Figure 2C, it adds to the
evidence for a connection X → Y, and against this connection otherwise.

Since in realistic conditions (short datasets, large TRs), high index cumulants, k + l > 3.0,
yield the aforementioned estimation problem, we discount their contribution in the voting by
using a nonlinearity of a form (further referred to as weighting throughout the manuscript):

F (X) = log(cosh(maximum(X, 0)))

(3)

A similar function was proposed to discount the outliers present in the BOLD time series in
the work by Hyvärinen and Smith (2013). The ﬁnal classiﬁer yields:

{ X → Y i f ∑k,je[Srk,l f (Crk,je) + Sik,l f (Cik,je)] ≥ 0

Y → X

otherwise

in the weighted case, et

{ X → Y i f ∑k,je[Srk,lCrk,je + Sik,lCik,je] ≥ 0

Y → X

otherwise

(4)

(5)

D
o
w
n
o
un
d
e
d

F
r
o
m
h

t
t

:
/
/

d
je
r
e
c
t
.

je
t
.

e
d
toi
n
e
n
un
r
t
je
c
e
–
p
d

F
/

3
4
1
0
0
9
1
8
6
6
8
0
3
n
e
n
_
un
_
0
0
0
9
9
p
d

b
oui
g
toi
e
s
t

o
n
0
7
S
e
p
e
m
b
e
r
2
0
2
3

in the unweighted case.

Supervised Learning Using Synthetic Benchmark Datasets

In this work, we derive the classiﬁer by using sign maps Srk,je, Sik,je (Figure 2C) developed using
multiple realizations of a two-node simulation under the DCM generative model, which is a
form of supervised learning. In a different application and under a different generative model,
these maps would look differently.

Neurosciences en réseau

1018

Improving pairwise inference in fMRI using fractional moments of BOLD

We know that cumulants differ with respect to discriminability (Chiffre 3), and that the suc-
cess rate of cumulants differs depending on the range k + je <= Indmax (Figure 4). Therefore, we optimize the performance of the classiﬁer with respect to these two dimensions by ﬁnding a combination that gives the best grand mean performance across the 28 simulations from the synthetic benchmark datasets, as they represent the variety of experimental conditions en- countered in real-life fMRI setups. First, we ﬁx Indmax = max(k, l) = 3.1, and consider cumulants on a triangle k; l >= 0.1, k +
je < Indmax. We then choose only cumulants of discriminability exceeding a particular value to be fed into the classiﬁer. For instance, a cutoff value of 0.1 means that we include the vote from all cumulants for which the discriminative value is not less than 0.1. We can then evaluate the grand mean success rate (as the mean success rate over all 28 benchmark synthetic datasets) in the function of the thresholding discriminability value. Figure 5A, demonstrates that including l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / t / e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d t / . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Figure 5. Dependence of the grand mean performance on synthetic datasets on the choice of cumulants. (A) Grand mean performance for unweighted cumulants in range k + l <= 3.1, in the function of the cutoff discriminative value. The higher cutoff, the less cumulants we take into account while voting for the directionality of the link. The results clearly show that in order to maximize the success rate in estimating effective connectivity, all the cumulants should be taken into account, except for the diagonal of k = l. (B) The grand mean performance based on cumulants of indexes k, l between 0.1 and k + l <= Indmax, in the function of that maximal index. The optimal performance in unweighted case equals 0.835 for [IndRmax; IndImax] = (2.4, 1.7), and 0.886 for [IndRmax; IndImax] = (2.1, 3.7) in weighted case. Network Neuroscience 1019 Improving pairwise inference in fMRI using fractional moments of BOLD all the cumulants with a positive discriminative value (all cumulants except for k = l, for which discriminability is always zero) gives the best classiﬁcation performance. Second, we optimize the window Indmax for indexes k, l and compare between classiﬁer with and without weighting with the discount function introduced in Equation 3. Since dis- criminability is generally higher for low indexes k, l (Figure 3), we will evaluate the grand mean performance based on cumulants of indexes between 0 and a maximum Indmax in the function of that maximum. We consider the maximal indexes along real and imaginary di- mension separately. The results are presented in Figure 5B. The optimal performance in an unweighted case equals 0.835 for [IndRmax, IndImax] = (2.4, 1.7), and 0.886 for [IndRmax, IndImax] = (2.1, 3.7) in a weighted case, which exceeds both the grand mean performance of the “PW-LR skew r” method by Hyvärinen (0.845) and the maximal grand mean performance of any single cumulant in our study (Figure 4, the maximum of 0.847). Selection of Other Approaches for Effective Connectivity Research in fMRI In order to benchmark our classiﬁer, we compare the performance against other methods. As mentioned in the Introduction, the ﬁeld of effective connectivity in fMRI is very wide (Smith et al., 2011); therefore, we restricted ourselves to the most popular approaches (other than DCM itself; Friston et al., 2003): 1. State-space implementation of Granger causality (GC; Granger, 1969; Seth, Barrett, & Barnett, 2015) is a multivariate method inferring effective connectivity between a pair of time series under the assumption that both of them can be expressed as autoregressive processes. We used a simple version of GC featuring ordinary least square regression with lag of 1, implemented in Multivariate Granger Causality Toolbox (Barnett & Seth, 2014), obtained from http://www.sussex.ac.uk/sackler/mvgc. For GC based on VAR pro- cess, as in our study, the state-space implementation is more robust than spectral GC (Geweke, 1982, 1984), because the frequency-domain version has a bias-variance trade- off (a function of the VAR model order that can induce spurious conditional GG causality estimates, such as erroneous peaks in the frequency domain, as indicated in the recent work by Stokes and Purdon, 2017). Furthermore, the state-space formulation of GC is the most robust, mitigating effects of bias and variance due to the fact that the reduced model is VAR rather than VARMA (Barnett & Seth, 2015). In order to compare performance with the methods for pairwise inference, we used GC in a bivariate rather than multivariate fashion: by applying GC to each of the previously found connections separately 2. Partial Directed Coherence (PDC; Baccalá & Sameshima, 2001) is known as a method conceptually close to Directed Transfer Function (Kami ´nski & Blinowska, 1991; Baccalá & Sameshima, 2001). Both these methods are derivatives from Geweke spectral measures of GC (Geweke, 1982, 1984), and all three methods have similar limitations (Chicharro, 2011). However, PDC is used substantially more often in fMRI studies than the other two methods, especially when compared with spectral GC. For this reason, we chose PDC as a method representing this class of approaches. We used PDC implementation from the Extended Multivariate Autoregressive Modelling Toolbox (Faes, Erla, Porta, & Nollo, 2013; http://www.lucafaes.net/emvar.html). As in case of GC, we applied PDC in a bivariate fashion 3. Patel’s tau (PT; Patel et al., 2006), as described in the Introduction, is implemented sim- ilarly as in Smith et al. (2011) by recalculating each time series into the range [0, 1], setting samples under the 10th percentile to 0, over the 90th percentile to 1, and linearly Network Neuroscience 1020 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . t / / e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d / . t f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Improving pairwise inference in fMRI using fractional moments of BOLD mapping the remaining samples to the range [0, 1]. Then, we infer the directionality of connection from the difference between P (X|Y) and P (Y|X). In addition to the previous implementation, however, we also integrate the results over all the possible thresholds in order to eliminate the thresholding problem while calculating the conditional proba- bilities P (X|Y), P (Y|X). 4. Pairwise likelihood ratios methods (PW-LR; Hyvärinen & Smith, 2013) are represented by “PW-LR r skew,” as it gives the highest performance among all the PW-LR methods when applied to synthetic fMRI data. We obtained the codes for the PW-LR methods from https://www.cs.helsinki.ﬁ/u/ahyvarin/code/pwcausal/ (Hyvärinen used the cumulant k; l = (2; 1) weighted with covariance for synthetic benchmark datasets). Therefore, for this comparison, we use the classiﬁer based on fractional cumulants weighted with covari- ance. For our study, we chose a PW-LR r skew version of the method, which involves an inference based on a third cumulant with a discount for outliers (Hyvärinen & Smith, 2013) As in Hyvärinen & Smith (2013), we performed the ﬁrst step of the inference as inverse co- variance thresholded with permutation testing. All the methods, including multivariate meth- ods such as GC and PDC, were then applied in a pairwise fashion (i.e., separately for each two-node network representing a single connection found in the previous step). Furthermore, we did not include the DCM procedure in this comparison, for the same reasons as Smith et al. (2011): DCM is not an exploratory method and using it in this context, namely for exploratory causal research on the set of benchmark synthetic datasets (where the smallest network consists of ﬁve nodes) is not computationally feasible. Furthermore, the characteristics of this synthetic benchmark dataset is that input signals (Figure 1A) represent random events and can therefore emulate all types of fMRI experiments: classic task-fMRI studies, event-related responses or resting-state BOLD time series. In DCM, however, the inputs must be strictly speciﬁed in the block design, otherwise DCM inference cannot be initiated. Therefore, assumptions of DCM do not ﬁt a research problem formulated in this particular way. l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . t / / e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d . t / f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Testing Robustness of the Methods Against Confounds In addition to evaluating our approach against the existing simulations from Smith et al. (2011), we further evaluate the performance under additional yet typical modes of variation in the data. Speciﬁcally we are interested in characterizing the discriminative performance relative to (i) more complex forms of stochastic noise in the data and (ii) unequal levels of SNR per node. The benchmark synthetic datasets involve temporally uncorrelated, white background noise of a low magnitude on the neuronal level (Smith et al., 2011). This type of noise is not physio- logically plausible, as it is known from physiological studies that in the neuronal net- works, the background noise has a scale-free power spectrum (He, 2014; Bédard et al., 2006; Dehghani et al., 2010; Bielczyk et al., 2017). Therefore, we simulated a two-node system and introduced scale-free (pink) noise to the system. Then, we varied the variance of the noise in the range of [0.2, 5.0] while keeping the amplitude of the inputs si(t) ﬁxed to 1.0. We per- formed 500 realizations of 10 min simulation at high temporal resolution of Fs = 200 Hz, for each conﬁguration of the noise variances. Furthermore, in the original version of the DCM procedure (Friston et al., 2003), as well as in most computational studies (Smith et al., 2011), equal stimulus strengths to both nodes si(t) are assumed. This assumption might not hold true in the real fMRI datasets. Therefore, we performed another, noiseless simulation, in which we varied signal strengths between the Network Neuroscience 1021 Improving pairwise inference in fMRI using fractional moments of BOLD upstream and downstream region. We performed 500 realizations of 10-min simulation at high temporal resolution of Fs = 200 Hz for each conﬁguration of input strengths in the range of [0.2, 5.0]. RESULTS Supervised Learning Using Synthetic Benchmark Datasets The best version of the classiﬁer was obtained for voting between cumulants in the range [IndRmax; IndImax] = (2.1; 3.7), and with the discount for high moment indexes (Equation 3). The comparison of this classiﬁer against four other methods, GC, PDC, PT, and PW-LR r skew, on the benchmark simulation no. 2 is presented in Figure 6. The violin plots denote the distri- bution of the z-scores for connections as compared to the null distribution. Blue dots denote the percentage of correct assignments for the true connections, as in Smith et al. (2011). In most of the other 27 benchmark datasets, our classiﬁer outperforms all the other methods (Support- ing Information 5. As in the original study by Smith et al. (2011), lagged methods, GC, and PDC perform worse than the structural methods. PW-LR r skew and fractional cumulants both outperform PT, most probably because PT is based on the thresholded signal and therefore contains a free parameter. In general, in the benchmark synthetic datasets, fractional cumulants outperform all other techniques in almost all cases, although the difference between performance of the fractional cumulants and PW-LR methods is small. l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . t / / e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d . t / f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Figure 6. Comparison between the classiﬁer based on the fractional cumulants and several other methods on benchmark simulation no. 2. The violin plots denote the distribution of the z-scores for connections as compared with the null distribution. Blue dots denote the percentage of correct assignments for the true connections (Smith et al., 2011). The difference in performance between the classiﬁer based on fractional cumulants and PW-LR r skew (Hyvärinen & Smith, 2013) is small. Network Neuroscience 1022 Improving pairwise inference in fMRI using fractional moments of BOLD Robustness of the Methods with Respect to Confounds Figure 7 presents the comparison between the fractional cumulant classiﬁer and various other methods on a two-node simulation under the addition of varying levels of physiologically plausible, scale-free neuronal noise across various levels of SNR for the upstream and down- stream regions. The results suggest that all previously tested methods show low levels of ro- bustness toward such additional sources of variability in the data. GC as well as PDC are at the lowest performance, and give results on a chance level across the whole parameter space. In PT, the success rate in proper classiﬁcation between upstream and downstream node rapidly drops toward 50% along with a decrease in SNR in the upstream node. However, in case SNR in the upstream node is higher than 1 (left half of the Patel’s tau panel, Figure 7), PT is resilient to the variance of SNR in the downstream node. The performance of PW-LR r skew drops down to the chance level with respect to both the noise level in the upstream and downstream region, whereas GC and PDC are performing almost equally poorly under any combination of noise variances (probably because the variance of the noise from the ﬁtted autoregressive model is used to establish the directionality of the causal inﬂuence in GC). l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / t / e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d t / . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Figure 7. Robustness of the methods against the background scale-free neuronal noise. The variance of the background noise s(t) differs between upstream and downstream region in the range of [0:2; 5:0]. As signal magnitude is constant and equal to 1 in these simulations, the signal-to-noise ratio (SNR) was calculated as 1/var(σ). Network Neuroscience 1023 Improving pairwise inference in fMRI using fractional moments of BOLD l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / t / e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d t . / f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Figure 8. Robustness of different methods to the change in signal strengths. The variance of the signal differs between upstream and down- stream region, both in the range of [0:2, 5:0]. GC and PDC give performance around the chance level across the whole parameter space, whereas PW-LR r skew and PT exhibit certain resilience toward this variability in the inputs. However, the classiﬁer based on fractional cumulants is the only method whose performance does not fall toward the chance level within the parameter space. Figure 8 presents the comparison between the classiﬁer based on fractional cumulants and other methods, given noiseless simulation and varying signal magnitudes. The classiﬁer based on fractional cumulants is the only method where the performance does not decrease down to a chance level within the chosen parameter space. GC and PDC give performance around the chance level across the whole parameter space, whereas PW-LR r skew and PT exhibit certain resilience toward this variability in the inputs. However, the performance breaks down to the chance level in cases when signal magnitudes between the upstream and downstream node becomes highly disproportionate (higher than 3.0). DISCUSSION Summary This work provides an advance to the effective connectivity research in fMRI, by utilizing the additional information contained in the BOLD time series with use of fractional moments of the BOLD distribution combined into cumulants. Usage of this additional information (embedded within a classiﬁer) signiﬁcantly increases the robustness toward plausible sources of variability Network Neuroscience 1024 Improving pairwise inference in fMRI using fractional moments of BOLD in fMRI, namely presence of physiologically realistic (scale-free) background noise as well as disproportion in the inputs strength, either due to differences in the amount of neuronal activity locally induced and/or due to effective differences induced by, for example, regional variations in the coil sensitivity proﬁles. This is where the value of added information coming from fractional cumulants becomes apparent: among the methods tested in this work, only the classiﬁer based on the fractional cumulants gives a performance better than chance across the whole parameter space in these experimentally realistic conditions. Effective connectivity is a research problem directly related to the notion of causality. Causal- ity is, in general, difﬁcult both to deﬁne (Pearl, 2000) and to measure. In the most basic for- mulation, in X causes Y, it means that without X, Y would not occur. The picture is far less clear in complex dynamic systems such as brain networks: for any event, a high number of potential causes can be deﬁned, and these causes most often interfere with each other. This research problem was recently discussed by Albantakis, Marshall, Hoel, and Tononi (2017) who decomposed causality into independent dimensions: realization, composition, informa- tion, integration, and exclusion. Also, interpretation of a causal component in a given process depends on the context. For example, respiratory movement is typically considered a con- found in fMRI experiments, unless we are interested in the inﬂuence of respiration speed on the activity of neuronal populations. Furthermore, in brain networks, temporal ordering of the cause of cause and effect is hard to maintain, as information is circulating in recurrent rather than feed forward networks (Schurger & Uithol, 2015). Furthermore, If we have an interventional study at disposal, establishing causality becomes straightforward, but this is rarely the case in human brain research. In human fMRI, all the studies are observational rather than interventional. In such cases, causation can never be observed directly, just correlation (Hume, 1772). When a correlation is highly stable, we are inclined to infer a causal link. Additional information is then needed to assess the direction of the assumed causal link, as correlation indicates for association and not for causation (Altman & Krzywinski, 2015). l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / t / e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d . t / In the simulations, we used multiple realizations of the dynamics for every network pattern. We needed to run multiple instantiations of a noisy system in order to evaluate the mean success rate of the methods under noisy circumstances. We also simulated long runs of the dynamics aiming to ﬁnd an upper bound on the methods’ performance in a function of the SNR disproportion and background noise levels. We did not investigate the effects of the sample length on the results. This is because in our study, we focused on the confounding factors which—unlike the duration of the study—cannot be inﬂuenced by the researcher. f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Although fractional moments of a distribution, as a mathematical concept, were studied before (Dremlin, 1994; Matsui& Pawlas, 2013), this concept was not applied to biomedical sciences to date. One reason for this lack of applications might be that the fractional moments become complex numbers for the normalized time series, and that subsequently, the features characterized by these moments cannot be conceptualized as easily as the features character- ized by the integer moments (e.g., skewness can be interpreted as a measure of “asymmetry” of the distribution, and kurtosis can be interpreted as its “ﬂatness”). However, although the fractional moments of a distribution are a mathematical concept with limited practical in- terpretation, nevertheless they could still contain valuable, in our case causal, information. In this work, we demonstrate that fractional moments provide important additional informa- tion about the distribution of BOLD values. We ﬁrst perform supervised learning of the clas- siﬁer on the set of benchmark synthetic datasets, and then validate the classiﬁer on two-node Network Neuroscience 1025 Improving pairwise inference in fMRI using fractional moments of BOLD simulations with biologically realistic confounds. We believe that confounding factors such as a physiological background noise of a magnitude varying between the nodes is important to overcome for any method for causal inference in fMRI. This is because every network in the brain is embedded in larger networks, and therefore the background activity of other inter- connected networks can be interpreted as “noise” (Deco, Jirsa, & McIntosh, 2011). We demon- strate that our approach can increase the robustness of the methods for pairwise inference in fMRI to the main sources of variability in BOLD fMRI. therefore it Unlike the previous methods for pairwise inference in fMRI (Hyvärinen & Smith, 2013), the classiﬁer deﬁned in this study is informed by the dynamic causal modeling generative incorporates the priors derived from the neurophysiological studies model, (Buxton, Wong, & Frank, 1998). Deriving benchmark signum maps for the classiﬁer from the multiple instantiations of the DCM generative model allows for marginalizing out all the param- eters unimportant for the effective connectivity research: the classiﬁcation procedure focuses only on classifying a pair of regions into upstream and downstream instead of ﬁtting all the hyperparameters as is done in the classic DCM inference procedure. Therefore, this approach is a reduction of the problem of effective connectivity in a large network to a two-node classi- ﬁcation problem on one hand, and an extension of the feature space from integer to fractional moments on the other hand. The signum maps derived in the training process are dependent on the generative model. In this study, we chose the canonical, original formulation of the DCM (Friston et al., 2003). There are also newer formulations of the DCM, for example, the canonical microcircuit ap- proach (Pinotsis et al., 2017; Friston et al., 2017), in which layer-speciﬁc neuronal populations in the cortex are modeled with use of a neural mass model. In this case, we assume that the inference is performed on mesoscale level, in which ROIs represent brain regions (cortex re- gions or subcortical nuclei) rather than cortex components. Furthermore, although versions of DCM containing higher order, nonlinear effects (Stephan et al., 2008) are also developed, we believe that a (bi)-linear model is a good simpliﬁcation to describe the underlying neuronal dynamics, as it refers to the linear part of the sigmoidal transfer functions between neuronal populations in the brain (Silver, 2010; Bielczyk, Buitelaar, Glennon, & Tiesinga, 2015). Mod- eling communication between nodes in the network with use of linear transfer functions is a common practice in modeling effective connectivity in fMRI (see, e.g., structural equation models (Mclntosh & Gonzalez-Lima, 1994) or LiNGAM-ICA (Shimizu et al., 2006; Smith et al., 2011). The bilinear version of DCM, often referred to as the general linear model approach, is still a very popular tool for ﬁnding effective connectivity patterns from fMRI in clinical practice (see, e.g., recent work by Zhang et al., 2018; Nackaerts et al., 2018; Arioli et al., 2018; Pool et al., 2018). We reproduced the DCM generative model after Smith et al. (2011). This implementation is acknowledged in the ﬁeld as a benchmark setting for testing new methods for functional and effective connectivity in fMRI (see, e.g., Hyvärinen & Smith, 2013; Hinne et al., 2015; Bielczyk et al., 2019). In this implementation, bilinear effects (namely, modulation of connections by experimental inputs) are not modeled. In pairwise inference this omission is justiﬁed, as mod- ulation of connection only affects the strength of connection weight A12, which will inﬂuence cumulant values quantitatively and not qualitatively, which does not inﬂuence the outcome signum maps. Evaluating methods with the use of synthetic datasets as the ground truth is typically the ﬁrst step in the validation of any new data analytic framework. Validating new methods with use of Network Neuroscience 1026 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / / t e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d . / t f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Improving pairwise inference in fMRI using fractional moments of BOLD synthetic datasets is a state of the art technique across the whole ﬁeld of neuroimaging, from single cell imaging to whole-brain imaging with fMRI or EEG/MEG. This tradition has a long history, starting from the Nobel-winning Hodgkin and Huxley model for initiation and propa- gation of action potential (Hodgkin & Huxley, 1952). Today, methods for effective connectivity between neuronal assemblies measured with multielectrode arrays are still validated on syn- thetic datasets generated from this classical model, including recent approaches: nonlinear data assimilation (Hamilton, Berry, Peixoto, & Sauer, 2013) and differential covariance (Lin, Das, Krishnan, Bazhenov, & Sejnowski, 2017). In cognitive neuroimaging, testing methods on synthetic datasets from generative models is also a standard. In EEG/MEG research, there are multiple classes of generative models generating different type of dynamics, depending on the purpose of the modeling study, for example, the nonlinear lumped-parameter model for generating alpha rhythms and its neural mass extension by (David & Friston, 2003), the Wong-Wang model for winner-take-all dynamics (Wong & Wang, 2006), the Hindmarsh-Rose model for epileptor dynamics (Hindmarsh & Rose, 1984), and DCM for EEG/MEG (Kiebel, Garrido, Moran, Chen, & Friston, 2009; Steen, Almgren, Razi, Friston, & Marinazzo, 2018; Moran, Pinotsis, & Friston, 2013). Furthermore, the Human Neocortical Neurosolver simula- tor developed at Brown University (HNN, https://hnn.brown.edu) is a complex tool simulating local ﬁeld potentials measured with EEG/MEG by bottom-up modeling of clusters of neurons. All these tools can serve to validate new methods for functional and effective connectivity in EEG/MEG (Valdes-Sosa et al., 2011; Wang et al., 2014). In fMRI, the selection of generative models is narrower than in EEG/MEG: DCM (Friston, Moran, & Seth, 2013; Smith et al., 2011) achieved a status of the standard generative model. With use of this synthetic data generated from this model, new methods for effective connectiv- ity in fMRI are validated, for example, the third- and fourth-cumulant method by Hyvärinen and Smith (2013) and artiﬁcial immune algorithm combined with the Bayes net method (AIAEC; Ji, Liu, Liang, & Zhang, 2016). To date, DCM is the most biologically relevant generative model proposed in the ﬁeld of functional magnetic resonance imaging. The implementation of benchmark synthetic datasets based on DCM by Smith et al. (2011) has gained a lot of attention and following in the ﬁeld, but it has also gained its critics. For instance, according to Smith’s results, PT (Patel, Bowman, & Rilling, 2006) is one of the methods giving best performance in retrieving directed con- nectivity patterns from synthetic benchmark datasets. In a recent work, Wang, David, Hu, and Deshpande (2017) performed a modeling study on datasets derived from an experiment by David et al. (2008) in which fMRI activity was measured in genetically modiﬁed rats suffering from epilepsy. Activity from the same set of regions was recorded in an associated intracere- bral EEG study in order to provide ground truth information ﬂow. The authors chose primary somatosensory cortex barrel ﬁeld (S1BF), thalamus, and striatum (caudate-putamen; CPu) as ROIs and demonstrated that Patel’s tau is no better than chance in recovering directional con- nectivity patterns from this data in both raw and deconvolved fMRI datasets. On the contrary, DCM and Granger causality proved to correctly estimate the directionality of the information ﬂow on the group level and on the deconvolved data. There are more caveats to be noted with respect to the benchmark DCM simulations by Smith et al. (2011). First, the synthetic datasets derived by the authors of the study involve a low noise condition, in which the background noise in the networks in as low as 5% of the signal magnitude. Given that the background activity in the brain networks includes not only noise but also echo of cognitive processes unrelated to the experiment, 5% of back- ground activity seems to be on the lower end of the spectrum of possibilities. Second, networks Network Neuroscience 1027 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / t / e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d / . t f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Improving pairwise inference in fMRI using fractional moments of BOLD investigated by Smith et al. are sparse (a number of connections in a network of size N is of order of N) and almost acyclic, which also seems to be a very optimistic scenario. Third, Smith et al. used a TR of 3 s and time series of 200 data points. This TR is too long, and the time series length too short for generalizing the empirical datasets used today. Today, shorter TRs (1 s or less, e.g., 0.72 s as in Human Connectome Project datasets; Van Essen et al., 2013), and longer time series have become the norm (e.g., 4,800 samples in resting-state datasets from Human Connectome Project datasets; Van Essen et al., 2013). Last, Smith et al. used a ﬁxed delays of 50 ms in the ﬁrst layer of the DCM model, representing the underlying neuronal commu- nication. This delay represents synaptic transmission delays and axonal transmission delays between nodes of the network. The constant value of delay is a crude estimation, especially given that pairs of brain regions positioned at different distances from each other should have different axonal transmission delays. Also, given polysynaptic connections, effective delays between neuronal populations might be much higher than the aforementioned 50 ms. For ex- ample, P300 potential appears after 300 ms (Polich, 2007), and some other cortical potentials have even slower latencies. This lack of attention toward modeling neuronal delays might fa- vor nonlagged methods (i.e., PT or LiNGAM) and might be preferred in this analysis over the lagged methods. Altogether, there are reasons to believe that benchmark datasets derived by Smith et al. are, to some extent, not representative of the real fMRI datasets. For these reasons, results of validation on the benchmark datasets should be interpreted with care. There are also other, competitive generative models in the ﬁeld, for example, the model proposed by Seth et al. (2013). In this model, the authors used a simple VAR generative model in order to simulate neuronal dynamics in the testing network of ﬁve regions, based on work by Baccalá and Sameshima (2001). Subsequently, the VAR model output is convolved with ﬁve different HRF kernels generated with use of the difference-of-gamma approach as imple- mented in SPM8 (http://www.ﬁl.ion.ucl.ac.uk/spm/software/spm8/). Another possibility, would be to use local ﬁeld potentials (LFPs) instead of neuronal dynamics simulated as a system of differential equations with delay, and convolve LFPs with HRF (Deshpande, Sathian, & Hu, 2010). However, to date, Smith’s synthetic datasets derived from the DCM generative model remains the benchmark datasets. Furthermore, in this work, we performed the inference on the full BOLD response, without deconvolving the BOLD time series into the neuronal time series. It has been shown in synthetic and empirical data that incorporating a physiologically based model of spatiotemporal hemo- dynamic response function into the preprocessing pipeline leads to an improvement in the esti- mated neuronal activation (Aquino, Robinson, Schira, & Breakspear, 2014). It was also shown that it is generally difﬁcult to accurately recover true task-evoked changes in BOLD fMRI time series irrespectively of the method chosen for modeling HRF response (Lindquist et al., 2009). Hence, there is a long-lasting debate in the ﬁeld of connectomics on whether or not a (blind) hemodynamic deconvolution is necessary to perform the (effective) connectivity research in fMRI (Wu et al., 2013). For instance, structural equation models (Mclntosh & Gonzalez-Lima, 1994) are often applied without deconvolution to fMRI datasets (Schlösser et al., 2003; Zhuang, LaConte, Peltier, Zhang, & Hu, 2005). Our previous theoretical research in synthetic datasets generated from the DCM model suggests that deconvolution is not necessary in effective connectivity research in fMRI if the method used in the study is not lag dependent (Bielczyk et al., 2017). This is because under the assumption that the underlying signal on the neuronal level is in the low-frequency range, the hemodynamic response—as a low-pass ﬁlter—does not affect the signatures of different connectivity patterns present in the outcome BOLD re- sponse. Therefore, in this work, we did not perform the deconvolution step before assessing effective connectivity with any of the tested methods, including Granger causality. In fMRI Network Neuroscience 1028 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / t / e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d / t . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Improving pairwise inference in fMRI using fractional moments of BOLD literature, GC is applied both with (David et al., 2008; Ryali, Supekar, Chen, & Menon, 2011; Ryali et al., 2016; Hutcheson et al., 2015; Wheelock et al., 2014; Sathian, Deshpande, & Stilla, 2013; Goodyear et al., 2016) and without (Zhao et al., 2016; Regner et al., 2016; Chen et al., 2017) use of deconvolution. Recent research suggests that all connectivity methods (includ- ing functional connectivity) will improve their estimation accuracy post-HRF deconvolution (Rangaprakash, Wu, Marinazzo, Hu, & Deshpande, 2018). However, in this work, we chose for implementation without deconvolution, to be consistent with (Smith et al., 2011). Here, we would like to mention that in recent years a lot of progress has been made in the area of modeling local hemodynamics from fMRI datasets. For example, Havlicek et al. (2011) proposed a new approach to modeling hemodynamic response functions based on cubature Kalman ﬁltering. Furthermore, Bush et al. (2015) proposed and validated a meta-algorithm for performing semiblind deconvolution of the BOLD fMRI by using bootstrapping. This method allows for estimating the timing of the underlying neural events stimulating BOLD responses, together with conﬁdence levels. Sreenivasan et al. (2015) proposed a nonparametric blind BOLD deconvolution method based on homomorphic ﬁltering. Last, in our simulations we have set the connection strength to a ﬁxed value of a = 0.9. On this stage, the output of the classiﬁer (Equation 4 and Equation 5) is a binary response, that is, an indication for a connection, either X → Y or Y → X. This indication is based on a linear combination the binary signum maps Sr, Si (Figure 2C) with the values of the cumulants Cr, Ci computed for the given dataset X(t), Y(t), in either weighted or unweighted form. If the connection strength varies, the strength of the coupling between fractional moments in X(t) and Y(t) varies accordingly. Therefore, the absolute values of the associated fractional cumulants will also adjust. If cumulant values scale, then the RHS sum in Equations 4 and 5 will scale accordingly, but the signum of this sum should stay the same. Therefore, in a noiseless case, this classiﬁer should give the same response regardless of the connection strength a. Limitations of the Method It is also important to remember that there are always two independent aspects to a method for causal inference. First, the method should have assumptions grounded in a biologically plau- sible framework relevant for the given research problem. For instance, a method for causal inference in fMRI should respect (1) the confounding, region-, and subject-speciﬁc BOLD dy- namics (Handwerker, Ollinger, & D’Esposito, 2004) and (2) co-occurrence of cause and effect (since the time resolution of the data is low compared with the underlying neuronal dynamics; the causes and their effects most likely happen within the same frame in the fMRI data). The new methods for pairwise inference such as classifying on the basis of fractional cumulants ad- dress this issue by (1) breaking the time order, and performing causal inference on the basis of statistical properties of the distribution of the BOLD samples, and not from the timing of events; and (2) using correlation in order to detect connections. A good counterexample here is GC, which has been proven useful in multiple disciplines. However, there is an ongoing discus- sion on whether or not GC is suited for causal interpretations of fMRI data. On the one hand, theoretical work by Seth, Chorley, and Barnett (2013) and Roebroeck, Formisano, & Goebel (2005) suggest that despite the slow hemodynamics, GC can still be informative about the directionality of causal links in the brain. Second, an estimation procedure needs to be com- putationally stable. Even if the generative model faithfully describes the data, it still depends on the estimation algorithm as to whether the method will return correct results. However, the face validity of the algorithms can only be tested in particular paradigms, in which the ground truth is known. If in the given paradigm the ground truth is unknown, which is most often the case in fMRI experiments, only reliability can be tested. Network Neuroscience 1029 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / t / e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d t . / f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Improving pairwise inference in fMRI using fractional moments of BOLD Our approach requires certain assumptions. For instance, we assume that on the neuronal level, effects of directed connectivity are linear. This is also an assumption underlying the orig- inal DCM model used in this study. However, it is known that this is not always the case in the neuronal dynamics. For instance, shunting inhibition (Alger & Nicoll, 1979) is a phenomenon in which excitatory potential is reduced by division rather than by subtraction. However, effects such as shunting inhibition typically happen in microscale and should not affect large-scale neuronal dynamics as measured by BOLD fMRI. Therefore, we do not consider effects such as shunting inhibition as plausible confounds to our approach. One crucial limitation of our approach (as well as previous methods such as pairwise like- lihood ratios) is that these techniques only retrieve the net connectivity. Namely, what these methods effectively pick up on is the difference between connectivity strengths, and in a sce- nario where the connectivity strengths X → Y and Y → X are equal, the outcome cumulant map for the system will have lower amplitudes than in case of a unidirectional connection X → Y with the same connection strength. The signiﬁcance of the cumulant values (whether or not the values are signiﬁcantly different from zero) can be established with use of permuta- tion testing. For more details, see Supporting Information 6, Figure 12B. However, since in the ﬁrst step of the inference we select only strong functional links for further classiﬁcation, we can interpret ambiguous output from the classiﬁer as a bidirectional connection. Since the quality of the classiﬁer depends on the ability to determine the directionality of a unidirectional con- nection, we used only unidirectional connections in the validation stage. One point to make here is that bidirectional connections are hard to disambiguate for many methods for effec- tive connectivity, as they represent cyclic nets. This is an issue, for example, in applications of Bayesian nets (Pearl, 2000), in which joint probability for a certain graph is not tractable when propagation of information in the network is cyclic. Also, the third- and fourth-cumulant method by Hyvärinen and Smith (2013) falls into this category, as the asymmetry between third cumulants in the case of bidirectional connections will be equal to zero. One interesting direction for the method development would be also classiﬁcation between excitatory and inhibitory connectivity. This aspect is missing in our study, as we focus on the connections of a positive sign only deriving a classiﬁer able to distinguish between excitatory and inhibitory connections would require deriving an additional set of benchmark Sr and Si maps built on the basis of repetitive simulations of an inhibitory connection, and creating a new set of benchmark synthetic datasets, as the original datasets by Smith et al. (2011) involve excitatory links only. The reason why inhibition is not implemented yet, is because the cu- mulant patterns for inhibitory connection are different from patterns given in Figure 2 (we did not include the pictures in this manuscript though). The classiﬁer in its current form gives a binary decision on whether the connection is going in the direction of X → Y or Y → X. The decision is based on whether the cumulant pattern obtained from the given dataset X(t), Y(t) is more similar to the signum maps derived from DCM for connection X → Y (Figure 2C) or to the inverse of these signum maps. In order to add inhibition to the picture, one would need to extend the number of possible classes by adding signum maps derived from DCM for inhibitory connection X → Y and introducing some metrics of distance to excitatory/inhibitory signum maps. This is the next step to take. One thing to note in addition is whether or not inhibitory effective connectivity is expected in large-scale networks investigated with fMRI; this is a mat- ter for debate. On one hand, it is known that long-distance projections in the brain are mostly excitatory as inhibition is typically local, presented by groups of tightly connected interneu- rons within single brain regions (Markram et al., 2004), which is also modeled in the DCM generative model with use of the self-inhibition term (Friston et al., 2003). On the other hand, several anatomical and physiological studies indicate that there are also long-range inhibitory Network Neuroscience 1030 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . t / / e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d / . t f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Improving pairwise inference in fMRI using fractional moments of BOLD Confounder: A factor latent in the experiment that can inﬂuence the causal inference. It can be a node that projects information to two other nodes in the network, causing a spurious causal association between them, or a background neuronal noise of magnitude varying across the brain. connections, for example, between hippocampus and entorhinal cortex in rats (Melzer et al., 2012). Furthermore, we consider local variability in brain physiology by varying hemodynamic responses between realizations of the simulation and by studying the effects of the scale-free background noise on the resulting effective connectivity measures. The DCM generative model summarizes the current state of knowledge about the mechanisms underlying generation of the BOLD response (Friston et al., 2017; Havlicek et al., 2015; Havlicek, Ivanov, Roebroeck, & Uluda˘g, 2017). Therefore, we do not have efﬁcient ways of incorporating human brain phys- iology into our consideration in any more depth than this model allows for. However, at the same time, we do not consider the inﬂuence of artifacts from the data ac- quisition, such as the effects of movement in the scanner. We believe that the inﬂuence of such confounders should be limited by a proper data preprocessing. For instance, motion artifacts can be reduced with use of new, data-driven protocols for motion artifacts removal such as ICA-AROMA (Pruim et al., 2015) or a censoring-based artifact removal strategy based on vol- ume censoring (Power et al., 2014). Therefore, developing efﬁcient strategies for controlling such confounders is beyond the scope of this paper. l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / t / e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d t / . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Future Research In the context of fMRI research, increasing the granularity of moments in order to better char- acterize the distribution is an especially useful application because the experimental fMRI datasets are short (a few thousand samples at most); therefore, the estimation error for the high-order integer moments of the distribution becomes high. However, the subsequent cu- mulants contain information redundant to a certain extent, as they are correlated for any given time series x(t). We chose the granularity that gives smooth patterns of discriminability (Figure 3), which is Δ k = 0.1. Choosing the optimal moment resolution is a subject to future research; although, we believe that increasing index resolution to substantially less than 0.1 would not be beneﬁcial, yet it would substantially increase the computational cost for the method. In this work, we validated our approach by using synthetic benchmark datasets derived from the DCM generative model. Using generative models is valuable in neuroimaging, in terms of validating new methods as mentioned in point 2, but also in applied research. Especially in the ﬁelds of applied computational psychiatry (Frässle et al., 2018) and network neuroscience in general (Betzel & Bassett, 2017), using generative models is valuable because these models enable inference on model parameters, which afford for some degree of mechanistic inter- pretability on the putative processes underlying the studied phenomena. Generative models, especially DCM, are acknowledged in multiple contexts in the ﬁeld of cognitive neuroimaging, from method validation to application in clinical datasets. However, a valuable method should also give predictions testable in clinics (e.g., lead to more reliable estimation of directed causal inﬂuences during cognitive tasks, lead to better stratiﬁcation of clinical populations in resting state, etc.), which we will also further investigate. In this work, we faithfully reproduced the pipeline after Smith et al. (2011). However, since the original version of DCM (Friston, Harrison, & Penny, 2003) based on the original Balloon- Windkessel model (Buxton, Wong, & Frank, 1998) was published, substantial advancements to the hemodynamic model have been proposed. First, Obata et al. (2004) reported an error in the expression for the outcome BOLD response (Equation 8, Supporting Information 1: DCM forward model). Stephan, Weiskopf, Drysdale, Robinson, & Friston (2007) proposed a more Network Neuroscience 1031 Improving pairwise inference in fMRI using fractional moments of BOLD accurate expression for one of the terms in this formula. Second, Heinzle, Koopmans, den Ouden, Raman, & Stephan (2016) proposed an updated formula for ﬁeld strengths higher than 1.5 T. In the following work, we will take these improvements into account. Furthermore, the approach needs to further be tested and compared against other methods with use of experimental fMRI datasets, such as, for example, the Human Connectome Project data (Van Essen et al., 2013; Barch et al., 2013). Since little is known about causal connections in large-scale brain networks, especially in the resting state, such a validation might be quan- titative (i.e., by means of reliability) rather than qualitative. Alternatively, one could perform such a validation in particular pathways in which one can make assumptions about the direc- tionality of information ﬂow, such as the dorsal and the ventral stream in the visual cortex. One possibility is testing using datasets coming from an interventional study in which neural activity was evoked and, therefore, the ground truth is known: a fused fMRI-optogenetic ex- periment (Ryali et al., 2016), in which intervention with use of optogenetics guarantees causal link between investigated neuronal populations. CONCLUSIONS The ﬁeld of effective connectivity research in fMRI is still growing. For instance, in recent years, multiple algorithms for the graph network search have been developed and applied to fMRI datasets, including independent multiple-sample greedy equivalence search (IMaGES; Ramsey et al., 2010), group iterative multiple model estimation (GIMME; Gates & Molenaar, 2012), or fast greedy equivalence search (FGES; Ramsey et al., 2016). It is material for debate whether or not bivariate or multivariate inference serves a better purpose for effective connectivity research in fMRI. In our work, we focused on the pairwise inference, and we achieved a signiﬁcant im- provement over previous approaches for pairwise estimation of functional causal interactions. Most importantly, the robustness against known sources of variability (possible differences be- tween up- and downstream noise magnitude and possible presence of non-Gaussian scale-free noise) signiﬁcantly increases due to the simultaneous incorporation of multiple aspects of the associated BOLD distributions. We believe that, as this approach based on fractional moments of a distribution increases resilience of the methods for pairwise connectivity to potential con- founds in the experimental data, it can become a generic method to increase the power of causal discovery studies, both in cognitive neuroimaging and beyond. SUPPORTING INFORMATION Supporting information for this article is available at https://doi.org/10.1162/netn_a_00099. ACKNOWLEDGMENTS We thank the whole SIN group at the Donders Institute for Brain, Cognition and Behavior for advice and engagement in the project. AUTHOR CONTRIBUTIONS Natalia Bielczyk: Conceptualization; Formal analysis; Methodology; Validation; Visualization; Writing – Original Draft; Writing – Review & Editing. Alberto Llera: Conceptualization; Formal analysis; Supervision; Writing – Review & Editing. Jeffrey Glennon: Funding acquisition. Jan Buitelaar: Supervision; Writing – Review & Editing. Christian Beckmann: Conceptualization; Formal analysis; Methodology; Supervision; Writing – Review & Editing; Funding acquisition. Network Neuroscience 1032 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . t / / e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d t . / f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Improving pairwise inference in fMRI using fractional moments of BOLD FUNDING INFORMATION Jeffrey Glennon, FP7 Ideas: European Research Council, Award ID: 305697 (OPTIMISTIC). Jeffrey Glennon, European Community’s Seventh Framework Programme (FP7/2007-2013), Award ID: 278948 (TACTICS). Jeffrey Glennon, European Union’s Seventh Framework Pro- gramme, Award ID: 603016 (MATRICS). Christian Beckmann, Wellcome Trust UK Strategic Award, Award ID: 098369/Z/12/Z. Christian Beckmann, Netherlands Organization for Interna- tional Cooperation in Higher Education (NL) Netherlands Organisation for Scientiﬁc Research (NWO-Vidi, Award ID: 864-12-003. REFERENCES Albantakis, L., Marshall, W., Hoel, E., & Tononi, G. (2017). What caused what? A quantitative account of actual causation using dynamical causal networks. ArXiv:1708.06716 [Cs, Math, Stat]. Retrieved from http://arxiv.org/abs/1708.06716 Aldrich, J. (1995). Correlations genuine and spurious in Pearson and Yule. Statistical Science, 10(4), 364–376. https://doi.org/10. 1214/ss/1177009870 Alger, B. E., & Nicoll, R. A. (1979). GABA-mediated biphasic inhibitory responses in hippocampus. Nature, 281, 315–317. https://doi.org/10.1038/281315a0 Altman, N., & Krzywinski, M. (2015). Points of signiﬁcance: Associ- ation, correlation and causation. Nature Methods, 12, 899–900. https://doi.org/10.1038/nmeth.3587 Aquino, K. M., Robinson, P. A., Schira, M. M., & Breakspear, M. (2014). Deconvolution of neural dynamics from fMRI data using a spatiotemporal hemodynamic response function. NeuroImage, 94, 203–215. https://doi.org/10.1016/j.neuroimage. 2014.03.001 Arichi, T., Fagiolo, G., Varela, M., Melendez-Calderon, A., Allievi, A., Merchant, N., . . . Edwards, A. D. (2012). Devel- opment of BOLD signal hemodynamic responses in the hu- man brain. NeuroImage, 63(2), 663–673. https://doi.org/10.1016/ j.neuroimage.2012.06.054 Arioli, M., Perani, D., Cappa, S., Proverbio, A. M., Zani, A., Falini, A., & Canessa, N. (2018). Affective and cooperative social inter- actions modulate effective connectivity within and between the mirror and mentalizing systems. Human Brain Mapping, 39(3), 1412–1427. https://doi.org/10.1002/hbm.23930 Baccalá, L. A., & Sameshima, K. (2001). Partial directed coherence: a new concept in neural structure determination. Biological Cy- bernetics, 84(6), 463–474. https://doi.org/10.1007/PL00007990 Barch, D. M., Burgess, G. C., Harms, M. P., Petersen, S. E., Schlaggar, B. L., Corbetta, M., . . . WU-Minn HCP Consor- tium. (2013). Function in the human connectome: Task-fMRI and individual differences in behavior. NeuroImage, 80, 169–189. https://doi.org/10.1016/j.neuroimage.2013.05.033 Barnett, L., & Seth, A. K. (2014). The MVGC multivariate Granger causality toolbox: A new approach to Granger-causal inference. Journal of Neuroscience Methods, 223, 50–68. https://doi.org/ 10.1016/j.jneumeth.2013.10.018 Barnett, L., & Seth, A. K. (2015). Granger causality for state-space models. Physical Review E, 91(4), 040101. https://doi.org/10. 1103/PhysRevE.91.040101 Bédard, C., Kröger, H., & Destexhe, A. (2006). Does the 1/f fre- quency scaling of brain signals reﬂect self-organized critical states? Physical Review Letters, 97(11), 118102. https://doi.org/ 10.1103/PhysRevLett.97.118102 Berkson, J. (1946). Limitations of the application of fourfold table analysis to hospital data. Biometrics Bulletin, 2(3), 47–53. https:// doi.org/10.2307/3002000 Betzel, R. F., & Bassett, D. S. (2017). Generative models for network neuroscience: Prospects and promise. Journal of The Royal So- ciety Interface, 14(136), 20170623. https://doi.org/10.1098/rsif. 2017.0623 Bielczyk, N. Z., Buitelaar, J. K., Glennon, J. C., & Tiesinga, P. H. E. (2015). Circuit to construct mapping: A mathematical tool for assisting the diagnosis and treatment in major depressive dis- order. Frontiers in Psychiatry, 6. https://doi.org/10.3389/fpsyt. 2015.00029 Bielczyk, N. Z., Llera, A., Buitelaar, J. C., & Beckmann, C. F. (2017). The impact of hemodynamic variability and signal mixing on the identiﬁability of effective connectivity structures in BOLD fMRI. Brain and Behavior, 7(8). https://doi. org/10.1002/brb3.777 J. K., Glennon, Bielczyk, N. Z., Walocha, F., Ebel, P. W., Haak, K. V., Llera, A., Buitelaar, J. K., . . . Beckmann, C. F.(2018). Thresholding functional connectomes by means of mixture modeling. Neuro- Image,171, 402–414. https://doi.org/10.1016/j.neuroimage.2018. 01.003 Bielczyk, N. Z., Uithol, S., van Mourik, T., Anderson, P., Glennon, J. C., & Buitelaar, J. K. (2019). Disentangling casaul webs in the brain using functional magnetic resonance imaging: A review of current approaches. Network Neuroscience, 237–273. https:// doi.org/10.1162/netn_a_00062 Bush, K., Cisler, J., Bian, J., Hazaroglu, G., Hazaroglu, O., & Kilts, C. (2015). Improving the precision of fMRI BOLD signal decon- volution with implications for connectivity analysis. Magnetic Resonance Imaging, 33(10), 1314–1323. https://doi.org/10.1016/ j.mri.2015.07.007 Buxton, R. B., Wong, E. C., & Frank, L. R., (1998). Dynamics of blood ﬂow and oxygenation changes during brain activation: the bal- loon model. Magnetic Resonance in Medicine, 39(6), 855–864. https://doi.org/10.1002/mrm.1910390602 Chen, Y. -C., Xia, W., Chen, H., Feng, Y., Xu, J.-J., Gu, J.-P., . . . Yin, X. (2017). Tinnitus distress is linked to enhanced resting-state func- tional connectivity from the limbic system to the auditory cor- tex. Human Brain Mapping, 38(5), 2384–2397. https://doi.org/ 10.1002/hbm.23525 Network Neuroscience 1033 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / / t e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d / . t f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Improving pairwise inference in fMRI using fractional moments of BOLD Chicharro, D. (2011). On the spectral formulation of Granger causality. Biological Cybernetics, 105(5–6), 331–347. https://doi. org/10.1007/s00422-011-0469-z David, O., & Friston, K. for MEG/EEG: NeuroImage, 20(3), 1743–1755. https://doi.org/10. 1016/j.neuroimage.2003.07.015 (2003). A neural mass model J. David, O., Guillemain, I., Saillet, S., Reyt, S., Deransart, C., Segebarth, C., & Depaulis, A. (2008). Identifying neural drivers with functional MRI: An electrophysiological validation. PLoS Biology, 6(12). https://doi.org/10.1371/journal.pbio.0060315 Deco, G., Jirsa, V. K., & McIntosh, A. R. (2011). Emerging con- cepts for the dynamical organization of resting-state activity in the brain. Nature Reviews Neuroscience, 12(1), 43–56. https:// doi.org/10.1038/nrn2961 Dehghani, N., Bédard, C., Cash, S. S., Halgren, E., & Destexhe, A. (2010). Comparative power spectral analysis of simultaneous ele- croencephalographic and magnetoencephalographic recordings in humans suggests non-resistive extracellular media. Journal of Computational Neuroscience, 29(3), 405–421. https://doi.org/ 10.1007/s10827-010-0263-2 Deshpande, G., Sathian, K., & Hu, X. (2010). Effect of hemody- namic variability on Granger causality analysis of fMRI. Neuro- Image, 52(3), 884–896. https://doi.org/10.1016/j.neuroimage. 2009.11.060 Di, X., & Biswal, B. B. (2019). Toward task connectomics: Exam- ining whole-brain task modulated connectivity in different task domains. Cerebral Cortex, 29(4), 1572–1583. https://doi.org/10. 1093/cercor/bhy055 Devonshire, I. M., Papadakis, N. G., Port, M., Berwick, J., Kennerley, A. J., Mayhew, J. E. W., & Overton, P. G. (2012). Neurovascular cou- pling is brain region-dependent. NeuroImage, 59(3), 1997–2006. https://doi.org/10.1016/j.neuroimage.2011.09.050 Dremlin, I. M. (1994). Fractional moments of distributions. Journal of Experimental and Theoretical Physics Letters, 59(9), 585–588. Van Essen, D. C., Smith, S. M., Barch, D. M., Behrens, T. E. J., Yacoub, E., Ugurbil, K., . WU-Minn HCP Consortium. . (2013). The WU-minn Human Connectome Project: an overview. NeuroImage, 80, 62–79. https://doi.org/10.1016/j.neuroimage. 2013.05.041 . Faes, L., Erla, S., Porta, A., & Nollo, G. (2013). A framework for assessing frequency domain causality in physiological time se- ries with instantaneous effects. Philosophical, Transactions. Series A, Mathematical, Physical, and Engineering Sciences, 371(1997), 20110618. https://doi.org/10.1098/rsta.2011.0618 Frässle, S., Yao, Y., Schöbi, D., Aponte, E. A., Heinzle, J., & Stephan, K. E. (2018). Generative models for clinical applications in com- putational psychiatry. Wiley Interdisciplinary Reviews. Cognitive Science, 9(3), e1460. https://doi.org/10.1002/wcs.1460 Friedman, J., Hastie, T., & Tibshirani, R. (2008). Sparse inverse lasso. Biostatis- covariance estimation with the graphical tics (Oxford, England), 9(3), 432–441. https://doi.org/10.1093/ biostatistics/kxm045 Friston, K., Moran, R., & Seth, A. K., (2013). Analysing connectiv- ity with Granger causality and dynamic causal modelling. Cur- rent Opinion in Neurobiology, 23(2), 172–178. https://doi.org/ 10.1016/j.conb.2012.11.010 Friston, K. J., Harrison, L., & Penny, W. (2003). Dynamic causal modelling. NeuroImage, 19(4), 1273–1302. Friston, K. J. (2011). Functional and effective connectivity: A re- view. Brain Connectivity, 1(1), 13–36. https://doi.org/10.1089/ brain.2011.0008 Friston, K. J., Preller, K. H., Mathys, C., Cagnan, H., Heinzle, J., Razi, A., & Zeidman, P. (2017). Dynamic causal modelling re- visited. NeuroImage. https://doi.org/10.1016/j.neuroimage.2017. 02.045 Gates, K. M., & Molenaar, P. C. M. (2012). Group search algo- rithm recovers effective connectivity maps for individuals in homogeneous and heterogeneous samples. NeuroImage, 63(1), 310–319. https://doi.org/10.1016/j.neuroimage.2012.06.026 Geweke, J. (1982). Measurement of linear dependence and feed- back between multiple time series. Journal of the American Statistical Association, 77(378), 304–313. https://doi.org/10.1080/ 01621459.1982.10477803 Geweke, J. F. (1984). Measures of conditional linear dependence and feedback between time series. Journal of the American Sta- tistical Association, 79(388), 907–915. https://doi.org/10.1080/ 01621459.1984.10477110 Goodyear, K., Parasuraman, R., Chernyak, S., Madhavan, P., Deshpande, G., & Krueger, F. (2016). Advice taking from humans and machines: An fMRI and effective connectivity study. Fron- tiers in Human Neuroscience, 10, 542. https://doi.org/10.3389/ fnhum.2016.00542 Granger, C. W. J. (1969). Investigating causal relations by econo- metric models and cross-spectral methods. Econometrica, 37(3), 424–438. https://doi.org/10.2307/1912791 Hamilton, F., Berry, T., Peixoto, N., & Sauer, T. (2013). Real-time tracking of neuronal network structure using data assimilation. Physical Review E, 88(5). https://doi.org/10.1103/PhysRevE.88. 052715 Handwerker, D. A., Ollinger, J. M., & D’Esposito, M. (2004). Varia- tion of BOLD hemodynamic responses across subjects and brain regions and their effects on statistical analyses. NeuroImage, 21(4), 1639–1651. https://doi.org/10.1016/j.neuroimage.2003. 11.029 Havlicek, M., Friston, K. J., Jan, J., Brazdil, M., & Calhoun, V. D. (2011). Dynamic modeling of neuronal responses in fMRI us- ing Cubature Kalman ﬁltering. NeuroImage, 56(4), 2109–2128. https://doi.org/10.1016/j.neuroimage.2011.03.005 Havlicek, M., Roebroeck, A., Friston, K., Gardumi, A., Ivanov, D., & Uludag, K. (2015). Physiologically informed dynamic causal modeling of fMRI data. NeuroImage, 122, 355–372. https://doi. org/10.1016/j.neuroimage.2015.07.078 Havlicek, M., Ivanov, D., Roebroeck, A., & Uluda˘g, K. (2017). Determining excitatory and inhibitory neuronal activity from multimodal fMRI data using a generative hemodynamic model. Frontiers in Neuroscience, 11, 616. https://doi.org/10.3389/ fnins.2017.00616 He, B. J. (2014). Scale-free brain activity: Past, present, and future. Trends in Cognitive Sciences, 18(9), 480–487. https://doi.org/10. 1016/j.tics.2014.04.003 Heinzle, J., Koopmans, P. J., den Ouden, H. E. M., Raman, S., & Stephan, K. E. (2016). A hemodynamic model for layered BOLD Network Neuroscience 1034 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / / t e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d / . t f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Improving pairwise inference in fMRI using fractional moments of BOLD signals. NeuroImage, 125, 556–570. https://doi.org/10.1016/j. neuroimage.2015.10.025 van den Heuvel, M. P., de Lange, S. C., Zalesky, A., Seguin, C., Yeo, B. T. T., & Schmidt, R. (2017). Proportional thresholding in resting-state fMRI functional connectivity networks and conse- quences for patient-control connectome studies: Issues and rec- ommendations. NeuroImage, 152, 437–449. https://doi.org/10. 1016/j.neuroimage.2017.02.005 Hindmarsh, J. L., & Rose, R. M. (1984). A model of neuronal bursting using three coupled ﬁrst order differential equations. Proceedings, of the Royal Society of London. Series B, Biolog- ical Sciences, 221(1222), 87–102. https://doi.org/10.1098/rspb. 1984.0024 Hinne, M., Janssen, R. J., Heskes, T., & van Gerven, M. A. J. (2015). Bayesian estimation of conditional independence graphs im- proves functional connectivity estimates. PLoS Computational Bi- ology, 11(11), e1004534. https://doi.org/10.1371/journal.pcbi. 1004534 Hodgkin, A. L., & Huxley, A. F. (1952). A quantitative description of membrane current and its application to conduction and ex- citation in nerve. The Journal of Physiology, 117(4), 500–544. Hume, D. (1772). Cause and effect. In An Enquiry Concerning Hu- man Understanding. Hutcheson, N. L., Sreenivasan, K. R., Deshpande, G., Reid, M. A., Hadley, J., White, D. M., . . . Lahti, A. C. (2015). Effective connectivity during episodic memory retrieval in schizophrenia participants before and after antipsychotic medication. Human Brain Mapping, 36(4), 1442–1457. https://doi.org/10.1002/hbm. 22714 Hyvärinen, A., Zhang, K., Shimizu, S., & Hoyer, P. O. (2010). Esti- mation of a structural vector autoregression model using non- Gaussianity. Journal of Machine Learning Research, 11(May), 1709–1731. Hyvärinen, A., & Smith, S. M. (2013). Pairwise likelihood ratios for estimation of non-Gaussian structural equation models. Journal of Machine Learning Research, 14(Jan), 111–152. J., Zscheischler, Janzing, D., Mooij, J., J., Zhang, K., Lemeire, Daniušis, P., . . . Schölkopf, B. (2012). Information-geometric approach to inferring causal directions. Artiﬁcial Intelligence, 182–183, 1–31. https://doi.org/10.1016/j.artint.2012.01.002 Ji, J., Liu, J., Liang, P., & Zhang, A. (2016). Learning effective con- nectivity network structure from fMRI data based on artiﬁcial im- mune algorithm. PloS One, 11(4), e0152600. https://doi.org/10. 1371/journal.pone.0152600 Kami ´nski, M. J., & Blinowska, K. J. (1991). A new method of the description of the information ﬂow in the brain structures. Bio- logical Cybernetics, 65(3), 203–210. Kennaway, R. (2015). When causation does not imply correlation: Robust violations of the faithfulness axiom. ArXiv:1505.03118 [Math, Stat]. Retrieved from http://arxiv.org/abs/1505.03118 Kiebel, S. J., Garrido, M. I., Moran, R., Chen, C.-C., & Friston, K. J. (2009). Dynamic causal modeling for EEG and MEG. Human Brain Mapping, 30(6), 1866–1876. https://doi.org/10.1002/hbm. 20775 Kötter, R., & Stephan, K. E. (2003). Network participation indices: Characterizing component roles for information processing in neural networks. Neural Networks, 16(9), 1261–1275. https:// doi.org/10.1016/j.neunet.2003.06.002 Kruger, P. (2018). Why it is not a ‘failure’ to leave academia. Nature, 560, 7716. https://doi.org/10.1038/d41586-018-05838-y Ledoit, O., & Wolf, M. (2004). A well-conditioned estimator for large-dimensional covariance matrices. J. Multivar. Anal., 88(2), 365–411. https://doi.org/10.1016/S0047-259X(03)00096-4 Lin, T. W., Das, A., Krishnan, G. P., Bazhenov, M., & Sejnowski, T. J. (2017). Differential covariance: A new class of methods to esti- mate sparse connectivity from neural recordings. Neural Com- putation, 29(10), 2581–2632. https://doi.org/10.1162/neco_a_ 01008 Lindquist, M. A., Loh, J. M., Atlas, L. Y., & Wager, T. D. (2009). Mod- eling the hemodynamic response function in fMRI: efﬁciency, bias and mis-modeling. Neuroimage, 45(1 Suppl), S187–S198. https://doi.org/10.1016/j.neuroimage.2008.10.065 Markram, H., Toledo-Rodriguez, M., Wang, Y., Gupta, A., Silberberg, G., & Wu, C. the neo- (2004). cortical inhibitory system. Nature Reviews Neuroscience, 5(10), 793–807. https://doi.org/10.1038/nrn1519 Interneurons of Marrelec, G., Krainik, A., Duffau, H., Pélégrini-Issac, M., Lehéricy, S., Doyon, J., & Benali, H. (2006). Partial correlation for func- tional brain interactivity investigation in functional MRI. Neuro- Image,32(1), 228–237.https://doi.org/10.1016/j.neuroimage.2005. 12.057 Matsui, M., & Pawlas, Z. (2013). Fractional absolute moments of heavy tailed distributions. ArXiv:1301.4804 [Math, Stat]. Re- trieved from http://arxiv.org/abs/1301.4804 Mclntosh, A. R., & Gonzalez-Lima, F. (1994). Structural equation modeling and its application to network analysis in functional brain imaging. Human Brain Mapping, 2(1–2), 2–22. https://doi. org/10.1002/hbm.460020104 Melzer, S., Michael, M., Caputi, A., Eliava, M., Fuchs, E. C., Whittington, M. A., & Monyer, H. (2012). Long-range–projecting GABAergic neurons modulate inhibition in hippocampus and entorhinal cortex. Science, 335(6075), 1506–1510. https://doi. org/10.1126/science.1217139 Moran, R. J., Pinotsis, D. A., & Friston, K. J. (2013). Neural masses and ﬁelds in dynamic causal modeling. Frontiers in Compu- tational Neuroscience, 7. https://doi.org/10.3389/fncom.2013. 00057 Nackaerts, E., Michely, J., Heremans, E., Swinnen, S. P., Smits- Engelsman, B. C. M., Vandenberghe, W., . . . Nieuwboer, A. (2018). Training for micrographia alters neural connectivity in Parkinson’s disease. Frontiers in Neuroscience, 12, 3. https://doi. org/10.3389/fnins.2018.00003 Nie, L., Yang, X., Matthews, P. M., Xu, Z., & Guo, Y. (2015). Minimum partial correlation: An accurate and parameter-free measure of functional connectivity in fMRI. In Y. Guo, K. Friston, F. Aldo, S. Hill, & H. Peng (Eds.), Brain Informatics and Health (Vol. 9250, 125–135). Cham: Springer International Publishing. Obata, T., Liu, T. T., Miller, K. L., Luh, W.-M., Wong, E. C., Frank, L. R., & Buxton, R. B. (2004). Discrepancies between BOLD and ﬂow dynamics in primary and supplementary motor areas: Appli- cation of the balloon model to the interpretation of BOLD tran- sients. NeuroImage, 21(1), 144–153. https://doi.org/10.1016/j. neuroimage.2003.08.040 Patel, R. S., Bowman, F. D., & Rilling, J. K. (2006). A Bayesian the human brain. approach to determining connectivity of Network Neuroscience 1035 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / / t e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d / . t f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Improving pairwise inference in fMRI using fractional moments of BOLD Human Brain Mapping, 27(3), 267–276. https://doi.org/10.1002/ hbm.20182 J. (2000). Causality: Models, Reasoning, and Inference Pearl, Cambridge, UK: Cambridge University Press. Pinotsis, D. A., Geerts, J. P., Pinto, L., FitzGerald, T. H. B., Litvak, V., Auksztulewicz, R., & Friston, K. J. (2017). Linking canonical microcircuits and neuronal activity: Dynamic causal modelling of laminar recordings. NeuroImage, 146, 355–366. https://doi.org/ 10.1016/j.neuroimage.2016.11.041 Polich, J. (2007). Updating P300: An integrative theory of P3a and P3b. Clinical Neurophysiology, 118(10), 2128–2148. https://doi. org/10.1016/j.clinph.2007.04.019 Pool, E.-M., Leimbach, M., Binder, E., Nettekoven, C., Eickhoff, S. B., Fink, G. R., & Grefkes, C. (2018). Network dynamics en- gaged in the modulation of motor behavior in stroke patients. Human Brain Mapping, 39(3), 1078–1092.https://doi.org/10.1002/ hbm.23872 Power, J. D., Mitra, A., Laumann, T. O., Snyder, A. Z., Schlaggar, B. L., & Petersen, S. E. (2014). Methods to detect, characterize, and remove motion artifact in resting state fMRI. NeuroImage, 84. https://doi.org/10.1016/j.neuroimage.2013.08.048 Pruim, R. H. R., Mennes, M., van Rooij, D., Llera, A., Buitelaar, J. K., & Beckmann, C. F. (2015). ICA-AROMA: A robust ICA-based strategy for removing motion artifacts from fMRI data. Neuro- Image,112,267–277. https://doi.org/10.1016/j.neuroimage.2015. 02.064 Ramsey, J. D., Hanson, S. J., Hanson, C., Halchenko, Y. O., Poldrack, R. A., & Glymour, C. (2010). Six problems for causal inference from fMRI. NeuroImage, 49(2), 1545–1558. https://doi. org/10.1016/j.neuroimage.2009.08.065 Rangaprakash, D., Wu, G.-R., Marinazzo, D., Hu, X., & Deshpande, G. (2018). Hemodynamic response function (HRF) variability confounds resting-state fMRI functional connectivity. Magnetic Resonance in Medicine, 80(4), 1697–1713. https://doi. org/10.1002/mrm.27146 Regner, M. F., Saenz, N., Maharajh, K., Yamamoto, D. J., Mohl, B., Wylie, K., . . . Tanabe, J. (2016). Top-down network effec- tive connectivity in abstinent substance dependent individuals. PLoS One, 11(10), e0164818. https://doi.org/10.1371/journal. pone.0164818 Roebroeck, A., Formisano, E., & Goebel, R. (2005). Mapping di- rected inﬂuence over the brain using Granger causality and fMRI. NeuroImage, 25(1), 230–242. https://doi.org/10.1016/j. neuroimage.2004.11.017 Ryali, S., Supekar, K., Chen, T., & Menon, V. (2011). Multivari- ate dynamical systems models for estimating causal interactions in fMRI. NeuroImage, 54(2), 807–823. https://doi.org/10.1016/j. neuroimage.2010.09.052 Ryali, S., Shih, Y.-Y. I., Chen, T., Kochalka, J., Albaugh, D., Fang, Z., . . . Menon, V. (2016). Combining optogenetic stimulation and fMRI to validate a multivariate dynamical systems model for estimating causal brain interactions. NeuroImage, 132, 398–405. https://doi.org/10.1016/j.neuroimage.2016.02.067 Sathian, K., Deshpande, G., & Stilla, R. (2013). Neural changes with tactile learning reﬂect decision-level reweighting of perceptual readout. Journal of Neuroscience, 33(12), 5387–5398. https:// doi.org/10.1523/JNEUROSCI.3482-12.2013 Schlösser, R., Gesierich, T., Kaufmann, B., Vucurevic, G., Hunsche, S., Gawehn, J., & Stoeter, P. (2003). Altered effective connectivity during working memory performance in schizophrenia: A study with fMRI and structural equation modeling. NeuroImage, 19(3), 751–763. Schurger, A., & Uithol, S. (2015). Nowhere and everywhere: The causal origin of voluntary action. Review of Philosophy and Psy- chology, 6(4), 761–778. Seth, A. K., Chorley, P., & Barnett, L. C. (2013). Granger causal- ity analysis of fMRI BOLD signals is invariant to hemodynamic convolution but not downsampling. NeuroImage, 65, 540–555. https://doi.org/10.1016/j.neuroimage.2012.09.049 Seth, A. K., Barrett, A. B., & Barnett, L. (2015). Granger causal- ity analysis in neuroscience and neuroimaging. The Journal of Neuroscience, 35(8), 3293–3297. https://doi.org/10.1523/ JNEUROSCI.4399-14.2015 Sgouritsa, E., Janzing, D., Hennig, P., & Schölkopf, B. (2015). In- ference of cause and effect with unsupervised inverse regres- sion. In Proceedings of the UAI 2015 Conference on Advances in Causal Inference - Volume 1504. (pp. 89–89) Aachen, Germany, Germany: CEUR-WS.org. Retrieved from http://dl.acm.org/ citation.cfm?id=3020267.3020280 Shimizu, S., Hoyer, P. O., Hyvärinen, A., & Kerminen, A. (2006). A linear non-Gaussian acyclic model for causal discovery. Journal of Machine Learning Research, 7, 2003–2030. Silver, R. A. (2010). Neuronal arithmetic. Nature Reviews Neuro- science, 11(7), 474–489. https://doi.org/10.1038/nrn2864 Smith, S. M., Miller, K. L., Salimi-Khorshidi, G., Webster, M., Beckmann, C. F., Nichols, T. E., . . . Woolrich, M. W. (2011). Network modelling methods for FMRI. NeuroImage, 54(2), 875–891. https://doi.org/10.1016/j.neuroimage.2010.08.063 Sreenivasan, K. R., Havlicek, M., & Deshpande, G., (2015). Non- parametric hemodynamic deconvolution of FMRI using homo- morphic ﬁltering. IEEE Transactions on Medical Imaging, 34(5), 1155–1163. https://doi.org/10.1109/TMI.2014.2379914 Steen, F. V., Almgren, H. B. J., Razi, A., Friston, K. J., & Marinazzo, D. (2018). Dynamic causal modelling of ﬂuctuating connectivity in resting-state EEG. BioRxiv, 303933. https://doi.org/10.1101/ 303933 Stephan, K. E., Weiskopf, N., Drysdale, P. M., Robinson, P. A., & Friston, K. J. (2007). Comparing hemodynamic models with DCM. NeuroImage, 38(3), 387–401. https://doi.org/10.1016/j. neuroimage.2007.07.040 Stephan, K. E., Kasper, L., Harrison, L. M., Daunizeau, J., den Ouden, H. E. M., Breakspear, M., & Friston, K. J. (2008). Nonlinear dynamic causal models for fMRI. NeuroImage, 42(2), 649–662. https://doi.org/10.1016/j.neuroimage.2008.04.262 Stokes, P. A., & Purdon, P. L. (2017). A study of problems encoun- tered in Granger causality analysis from a neuroscience perspec- tive. Proceedings of the National Academy of Sciences of the United States of America, 114(34), E7063–E7072. https://doi.org/ 10.1073/pnas.1704663114 Valdes-Sosa, P. A., Roebroeck, A., Daunizeau, J., & Friston, K. (2011). Effective connectivity: Inﬂuence, causality and biophys- ical modeling. NeuroImage, 58(2), 339–361. https://doi.org/10. 1016/j.neuroimage.2011.03.058 Network Neuroscience 1036 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / / t e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d . t / f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Improving pairwise inference in fMRI using fractional moments of BOLD Varoquaux, G., & Craddock, R. C. (2013). Learning and compar- ing functional connectomes across subjects. NeuroImage, 80, 405–415. https://doi.org/10.1016/j.neuroimage.2013.04.007 Wang, H. E., Bénar, C. G., Quilichini, P. P., Friston, K. J., Jirsa, V. K., & Bernard, C. (2014). A systematic framework for functional con- nectivity measures. Frontiers in Neuroscience, 8. https://doi.org/ 10.3389/fnins.2014.00405 Wang, Y., David, O., Hu, X., & Deshpande, G. (2017). Can Patel’s τ accurately estimate directionality of connections in brain networks from fMRI? Magnetic Resonance in Medicine, 78(5), 2003–2010. https://doi.org/10.1002/mrm.26583 Wheelock, M. D., Sreenivasan, K. R., Wood, K. H., Ver Hoef, L. W., Deshpande, G., & Knight, D. C. (2014). Threat-related learning relies on distinct dorsal prefrontal cortex network connectivity. NeuroImage, 102 Pt 2, 904–912. https://doi.org/10. 1016/j.neuroimage.2014.08.005 Wong, K.-F., & Wang, X.-W. (2006). A recurrent network mech- Journal anism of of Neuroscience, 26(4), 1314–1328. https://doi.org/10.1523/ JNEUROSCI.3733-05.2006 time integration in perceptual decisions. Wu, G.-R., Liao, W., Stramaglia, S., Ding, J.-R., Chen, H., & Marinazzo, D. (2013). A blind deconvolution approach to re- cover effective connectivity brain networks from resting state fMRI data. Medical Image Analysis, 17(3), 365–374. https://doi. org/10.1016/j.media.2013.01.003 Zhang, L., Opmeer, E. M., van der Meer, L., Aleman, A., ´Curˇci´c- Blake, B., & Ruhé, H. G. (2018). Altered frontal-amygdala effec- tive connectivity during effortful emotion regulation in bipolar disorder. Bipolar Disorders, 20(4), 349–358. https://doi.org/10. 1111/bdi.12611 Zhao, Z., Wang, X., Fan, M., Yin, D., Sun, L., Jia, J., . . . Gong, J. (2016). Altered effective connectivity of the primary motor cor- tex in stroke: A resting-state fMRI study with Granger Causality analysis. PloS One, 11(11), e0166210. https://doi.org/10.1371/ journal.pone.0166210 Zhuang, J., LaConte, S., Peltier, S., Zhang, K., & Hu, X. (2005). Connectivity exploration with structural equation modeling: An fMRI study of bimanual motor coordination. NeuroImage, 25(2), 462–470. https://doi.org/10.1016/j.neuroimage.2004.11. 007 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / / t e d u n e n a r t i c e - p d l f / / / / 3 4 1 0 0 9 1 8 6 6 8 0 3 n e n _ a _ 0 0 0 9 9 p d t / . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Network Neuroscience 1037 METHODS image

Télécharger le PDF