RESEARCH ARTICLE - Am MIT spezialisierte KI-Forschung

RESEARCH ARTICLE

Connected from the outside: The role of U.S.
regions in promoting the integration of the
European research system

Keine offenen Zugänge

Tagebuch

Emanuele Rabosio1

, Lorenzo Righetto1

, Alessandro Spelta2

, and Fabio Pammolli1,3

1Human Technopole, Center for Analysis, Decisions and Society, Milano, Italien
2Università di Pavia, Dipartimento di Scienze Economiche e Aziendali, Pavia, Italien
3Politecnico di Milano, Dipartimento di Ingegneria Gestionale, Milano, Italien

Zitat: Rabosio, E., Righetto, L.,
Spelta, A., & Pammolli, F. (2020).
Connected from the outside: Die Rolle
of U.S. regions in promoting the
integration of the European research
System. Quantitative Science Studies,
1(2), 894–917. https://doi.org/10.1162/
qss_a_00034

DOI:
https://doi.org/10.1162/qss_a_00034

Erhalten: 22 Februar 2019
Akzeptiert: 27 Januar 2020

Korrespondierender Autor:
Emanuele Rabosio
emanuele.rabosio@fht.org

Handling-Editor:
Vincent Larivière

Schlüsselwörter: EU R&D integration, patent coinventorship network, resistance distance

ABSTRAKT

Considerable efforts have been deployed by the European Union to create an integrated
Forschung & Development area. In diesem Papier, we focus on the structure and evolution of the
European collaboration network as reflected by patent data. We study patent networks
representing collaborations between inventors located in different geographic areas. Existing
studies seem to indicate an increasing integration of the European research system, but none of
them has investigated which regions contribute most to this integration. We analyze the patent
coinventorship network to measure network-based distances between regions through
multiple metrics, in order to evaluate the role of different areas for the integration of the EU R&D
System. We study changes of the average closeness between European regions belonging to
different countries. Insbesondere, we perform a counterfactual exercise, simulating the impact
on EU integration of the removal of countries and individual regions. Our findings reveal an
important contribution from U.S. regions in favoring EU integration. Insbesondere, the size
and the density of the U.S. System, together with the presence of a few regional hubs, play a key
role in reducing the distances between European regions.

EINFÜHRUNG

Achieving strong integration between member countries is a primary goal for the European
Union (EU). In research & Entwicklung (R&D), specific policies have been implemented
(Nedeva & Stampfer, 2012; Scherngell & Barber, 2011). The Framework Programs for
Research and Technological Development are an example of such policies.

The EU R&D system has been analyzed in depth in the literature, with contrasting results.
Hoekman, Frenken, and Tijssen (2010) and Miguelez and Moreno (2013) have found that the
bias to collaborate within the same EU country has diminished over time. Morescalchi,
Pammolli, et al. (2015) have underlined that this decrease has stopped since the mid-1990s.
Chessa, Morescalchi, et al. (2013), darüber hinaus, have highlighted that the EU integration growth
might have been driven by trends toward globalization of research more than by the aforemen-
tioned EU-specific efforts.

In this paper we study a related, though different, Problem. Tatsächlich, we aim to understand
which countries and regions contribute most to the integration of the European R&D system.

Urheberrechte ©: © 2020 Emanuele Rabosio,
Lorenzo Righetto, Alessandro Spelta,
and Fabio Pammolli. Published under a
Creative Commons Attribution 4.0
International (CC BY 4.0) Lizenz.

Die MIT-Presse

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
Q
S
S
/
A
R
T
ich
C
e
–
P
D

F
/

1
2
8
9
4
1
8
8
5
7
7
4
Q
S
S
_
A
_
0
0
0
3
4
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

The role of U.S. regions in promoting the integration of the European research system

Our study includes both the EU and the U.S., to shed light on the role that relevant external
agents play in European integration. To the best of our knowledge, ours is the first attempt to
tackle this issue.

Networks of innovators (Orsenigo, Pammolli, & Riccaboni, 2001; Owen-Smith, Riccaboni,
et al., 2002; Powell & Grodal, 2005) can be analyzed to assess interregional connections. Wir
focus on the patent coinventorship network (Chessa et al., 2013; Morescalchi et al., 2015),
where nodes are regions, and edges are weighted by the number of coinventions occurring
between regions.

We employ the resistance distance (Klein & Randić, 1993) to measure distances within the
Netzwerk. Resistance distance takes into account the path(S) that must be covered on the net-
work to join two nodes. Auch, the resistance distance between two nodes of a network repre-
sents the expected time that a random walk needs to move from the first node to the second
eins (von Luxburg, Radl, & Hein, 2010). In unserem Fall, this measure can be considered as a proxy
of the velocity of the information flow (Stephenson & Zelen, 1989) along the network, welche
takes into account not only the shortest paths between the nodes (Goddard & Oellermann,
2011) but also longer ones, because information may flow indirectly on the network also
on these paths (Bozzo & Franceschet, 2013).

To evaluate the contribution of individual countries and regions to EU integration (d.h., their
integration capability), we first define an indicator of EU integration on the basis of the close-
ness centrality between EU regions belonging to different countries in the technological col-
laboration network. Dann, the integration capability of a country or region is quantified by
measuring the difference in the indicator value when that same country or region is removed
from the network. Our analyses are focused on patent data and therefore, as discussed in
Arora, Belenzon, and Patacconi (2018), Arora, Belenzon, et al. (2019), and Arora, Fosfuri,
and Gambardella (2004), are biased toward development activities rather than toward research
Aktivitäten. As a consequence, the knowledge flows that we are investigating are more related
to technological knowledge than to scientific knowledge.

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
Q
S
S
/
A
R
T
ich
C
e
–
P
D

F
/

1
2
8
9
4
1
8
8
5
7
7
4
Q
S
S
_
A
_
0
0
0
3
4
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

1.1. Summary of the Results

Our main findings are the following:

(cid:129) The countries exhibiting the largest contribution to EU R&D integration are Germany
und die Vereinigten Staaten, with the latter being more relevant than most EU countries.
(cid:129) In diesem Kontext, we find that a considerable fraction of the regions that are most relevant
for EU R&D integration are located in the United States, rather than within EU borders.
(cid:129) The smallest EU countries turn out to be those benefiting most from the U.S. contribution

to establish an indirect connection to other EU countries.

1.2. Paper Structure

The paper is organized as follows. Abschnitt 2 summarizes the previous studies on the border
and distance effects on the intensity of collaborations. Abschnitt 3 describes the Regpat data set
that has been employed in this work, and introduces the indicator we use to measure the in-
tegration capability. Abschnitt 4 shows a set of analyses on the coinventor network. Erste, Wir
propose some descriptive statistics and pictures, to provide an initial understanding of the
structure of the network (section 4.1). Zweite, the integration capability of countries (section 4.2)
and individual regions (section 4.3) is analyzed. Dritte, the previous results are deepened to

Quantitative Science Studies

895

The role of U.S. regions in promoting the integration of the European research system

understand which EU countries rely most on the United States to connect to other EU
Länder (section 4.4). Endlich, section 5 concludes the paper.

2. BACKGROUND

In den vergangenen Jahren, several studies have analyzed the effects of geography on R&D collaborations.
Insbesondere, the intensity of R&D collaborations between regions (d.h., their “R&D closeness”)
has been studied based on geographical distance and on belonging to the same country. Der
intuition suggests that in a globalized world, where low transport costs, ICT facilities, Und
widespread knowledge of the English language are making communication between widely
separated people easier, geographical factors should play a marginal role in determining the
collaboration intensity between two regions (Frenken, Hoekman, et al., 2009; Singh & Marx,
2013). Jedoch, the analyses proposed so far in the literature, relying on different data and
Werkzeuge, have produced conflicting conclusions.

Among the papers supporting a decrease in the importance of geographical factors over
Zeit, Brun, Carrère, Guillaumont, et al. (2005) consider the trade scenario; the authors pro-
pose a gravity model generated from data of the United Nations Commodity Trade Statistics,
where the effect of physical distance on the trade volume between countries is shown to di-
minish over time. Waltman, Tijssen, and van Eck (2011), in contrast, consider Web of Science
(WoS) data on scientific publications, and compute for each paper the greatest distance be-
tween the addresses of the authors; they observe that, in spite of differences between scientific
sectors, there is a clear trend of increasing distance over time.

Other studies, Jedoch, claim alternative evidence. Ponds (2009) studies international col-
laborations employing a probit regression on copublication data involving Dutch institutions;
he finds that these collaborations grow, but at the same pace as the national ones. Maisonobe,
Eckert, et al. (2016) build a copublication network between cities using data from the Science
Citation Index Expanded, and find that in most countries domestic collaborations grow faster
than international ones.

In the EU, an increase in collaborations between countries might be favored not only by the
trend toward globalization of research, but also by the specific policies undertaken. Hoekman
et al. (2010) apply a gravity model to copublication data from WoS, finding that the bias to-
ward collaborating with partners from the same EU country decreases over time, while the bias
toward cooperators that are geographically close does not. Miguelez and Moreno (2013) em-
ploy a gravity model to study the patent regional coinventor network; similar to Hoekman
et al. (2010), they find that the importance of belonging to the same country diminishes over
Zeit, while the distance effect actually grows. Chessa et al. (2013) propose difference-in-
differences estimates on four regional networks, concluding that integration between EU coun-
tries is growing, but no more than one would expect due to research globalization trends.
Morescalchi et al. (2015) claim, through a gravity model on patent regional networks, that dis-
tance and country effects within the EU decreased only until the mid-1990s. Another gravity
model for patent data is introduced by Cappelli and Montobbio (2016), who share the view that
the effects of distance and national borders within the EU are decreasing over time. Endlich,
Doria Arrieta, Pammolli, and Petersen (2017), using publications data, show that the 2004/
2007 EU enlargement has had a negative impact on cross-border collaborations.

The above results, though conflicting in some respects, show signs of a pattern toward high
R&D integration within the EU, while the effectiveness of European policies has not been fully

Quantitative Science Studies

896

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
Q
S
S
/
A
R
T
ich
C
e
–
P
D

F
/

1
2
8
9
4
1
8
8
5
7
7
4
Q
S
S
_
A
_
0
0
0
3
4
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

The role of U.S. regions in promoting the integration of the European research system

Tisch 1. Basic statistics on the Regpat data set

# patents

# Regionen

# Länder

# coinvented patents with inventors from different regions

that are also coassigned

3,175,990

5,520

1,171,993

112,915

demonstrated. In this work we study EU integration from a different point of view. We intro-
duce a different way to measure the R&D closeness between regions, not relying only on the
intensity of direct collaborations (z.B., number of coinventorships or coauthorships) but con-
sidering also indirect connections. The introduction of an indirect measure allows us to under-
stand which countries and regions provide the greatest contribution to EU integration, fostering
the connection of EU countries and regions.

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
Q
S
S
/
A
R
T
ich
C
e
–
P
D

F
/

1
2
8
9
4
1
8
8
5
7
7
4
Q
S
S
_
A
_
0
0
0
3
4
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

3. DATA AND METHODS

3.1. Data

The data employed in this study are drawn from the OECD Regpat database (Marsch 2018 ver-
sion), containing all patent applications filed with the European Patent Office (EPO). Each pat-
ent is associated with its inventors, whose geographic location, in terms of NUTS3 regions, Ist
also known. Tisch 1 reports some basic statistics related to the Regpat data set. Note that only
9.6% of the patents that are coinvented by inventors coming from different regions are also
coassigned to multiple institutions. This happens because many coinventor relations are re-
ferred to inventors working in different subsidiaries of multinational firms. daher, Die
coinventorship network reflects for a relevant part the organization of work between firms
and their subsidiaries. Trotzdem, we are interested in patterns of knowledge flows between regions,
so we maintain that the embedding of new knowledge in the collaborating regions is relevant
irrespectively of institutional boundaries.

The data set globally contains 5,520 Regionen, but in our analyses we consider just those
belonging to the EU-15 countries1 (1,067 Regionen) und die Vereinigten Staaten (3,144 Regionen).
We restrict the analysis to the EU-15 countries because they have been part of the
European Union for the longest time, and thus have been more significantly involved in its
policies. The remaining EU countries have been in the Union only since 2004 oder später (d.h.,
no more than 17.1% of the total time span considered in this work), which to us appears to be
too little to include them in a study on EU R&D integration.2 In Figure 1 we plot the number of
patents by year from 1980 Zu 2014, considering patents including at least one U.S. inventor
and patents including at least one EU-15 inventor; in both cases the number is steadily
increasing.

1 Österreich (AT), Belgien (BE), Denmark (DK), Finland (FI), Frankreich (FR), Deutschland (DE), Greece (GR), Ireland
(IE), Italien (IT), Luxembourg (LU), Niederlande (NL), Portugal (PT), Spanien (ES), Schweden (SE), Großbritannien
(Vereinigtes Königreich).

2 We have also performed experiments considering the EU-28 instead of the EU-15, and the results do not

show remarkable differences with respect to those obtained using EU-15.

Quantitative Science Studies

897

The role of U.S. regions in promoting the integration of the European research system

Figur 1. Number of patents by year, aus 1980 Zu 2014.

These data are used to build the coinventor geographic network, referred to specific time
periods. In these networks nodes are constituted by NUTS3 regions, while the weight wij(T)
of the edge joining the nodes i and j in the network at time t is given by the number of
coinventions happened between the i and j regions in the time period t. In our work the
time period t will be represented by one of the intervals 1980–1989, 1990–1999, 2000–2009,
and 2010–2014, or by individual years. Tisch 2 shows the number of edges and the sum of
the weights in the networks related to 1980–1989, 1990–1999, and 2000–2009; we omit the
latest period (2010–2014) because it is shorter. Like the number of patents, the values of these
indicators are remarkably also growing with time.

3.2. Methoden

In this section we illustrate the main methods and techniques used to carry out the analyses.
Erste, the resistance distance is introduced (section 3.2.1). Dann, we describe our measures
of integration capability (section 3.2.2) and how we use null models to support our claims
(section 3.2.3). Endlich, changepoint detection is explained (section 3.2.4).

3.2.1. Resistance distance

The distance dij between two nodes i and j of the network, representing how difficult the in-
formation flow is between the corresponding regions, is measured as the resistance distance
(Klein & Randić, 1993), which is defined as the effective (electrical) resistance between the two
nodes when each edge is associated with a conductance equal to its weight. Let L be the
Laplacian matrix3 of the network and L+ its Moore–Penrose pseudoinverse. The resistance dis-
tance dij between nodes i and j is computed as follows (Bozzo & Franceschet, 2013):
þ
dij ¼ L
ii

þ
þ L
jj

− 2L

(1)

þ
ij

In der Praxis, to avoid infinite values for pairs of nodes belonging to disconnected components,
we work with the closeness cij, defined as the reciprocal of the resistance distance: cij = 1
dij

3 The Laplacian matrix of a network with adjacency matrix M is defined as D − M, where D is the diagonal

matrix whose (ich, ich) entry is the degree of the ith node (Goddard & Oellermann, 2011).

Quantitative Science Studies

898

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
Q
S
S
/
A
R
T
ich
C
e
–
P
D

F
/

1
2
8
9
4
1
8
8
5
7
7
4
Q
S
S
_
A
_
0
0
0
3
4
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

The role of U.S. regions in promoting the integration of the European research system

Tisch 2. Number of edges and sum of the weights in the coinventor networks

Period
1980–1989

1990–1999

2000–2009

# edges
28,310

76,886

137,158

Sum of weights
236,904

848,988

2,013,473

We remark again the importance of evaluating the closeness between pairs of nodes with a
messen, like the inverse of the resistance distance, which takes into account multiple paths
joining the nodes, and not just first-order interactions. Erste, using a measure considering mul-
tiple paths joining the nodes recognizes that knowledge may also flow on the network in an
indirect, mediated way. Zweite, as we will explain in section 3.2.2, this allows us to measure
the contribution that nodes (d.h., Regionen) and sets of nodes (d.h., Länder) provide to the close-
ness of other nodes.

Note that a possible alternative closeness measure taking paths into account, as mentioned
in the introduction, is the inverse of the shortest-path distance, where the shortest-path dis-
tance between two nodes is the sum of the inverses of the weights of the edges lying on the
shortest path joining the two nodes. Jedoch, this measure is less suitable than the inverse of
the resistance distance in our scenario, because it considers only the shortest paths, neglecting
the fact that information may also flow on the network on other, longer paths.

3.2.2. Evaluation of the integration capability

The level of integration within the EU is assessed using the average cross border closeness (cid:1)C,
which is the average of the closenesses between all the pairs of regions belonging to different
EU countries:

(cid:1)c ¼

ich;jð

Þ:ich;j2 different EU countriescij
Þ : ich; J 2 different EU countries

ich; jð

(2)

Note that the value of (cid:1)c may be restricted to specific pairs of countries, thus measuring the
integration level between these pairs of countries.

To assess the contribution of a subset of the nodes of the network to the average cross-border
closeness (d.h., to the integration), we compute the percentage closeness loss that happens when
this subset is excluded from the network. We will exclude sets of nodes to evaluate the contri-
bution of the countries, and single nodes to evaluate the contribution of specific hubs (intended
as very relevant regions, characterized by many connections). The percentage closeness loss
associated with a region/country represents the integration capability of that region/country.

Let S be a subset of the nodes of the network. Typically, S may represent a country or a
single node. The quantity (cid:1)cS indicates the average cross-border closeness measured consider-
ing the cijs computed using paths involving the whole network, but averaged only on the re-
gions not included in S:

(cid:1)cS ¼

ich;jð

Þ:ich;j2= S∧i;j2different EU countriescij
Þ : ich; j 2= S∧i; J 2 different EU countries

ich; jð
J

(3)

Zum Beispiel, if the subset S contains the German nodes, then these nodes are not considered
in the averaging process, but the paths used to compute the cijs are allowed to pass through

Quantitative Science Studies

899

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
Q
S
S
/
A
R
T
ich
C
e
–
P
D

F
/

1
2
8
9
4
1
8
8
5
7
7
4
Q
S
S
_
A
_
0
0
0
3
4
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

The role of U.S. regions in promoting the integration of the European research system

Deutschland; daher, we are evaluating the capability of Germany to connect regions belong-
ing to other countries.

The quantity (cid:1)cSj, in contrast, denotes the value obtained measuring (cid:1)c once the subset S has
been removed from the network, therefore excluding S from both the average and the paths
used in the computation of the cijs. (cid:1)cSj can be computed through Eq. (3), but determining the
cijs exploiting only the paths not transiting from the nodes in S. Zum Beispiel, if the subset S
again contains the German nodes, the German nodes are not considered in the average and
are not allowed to appear in the paths used in the computation of the closeness between the
pairs of nodes.

The percentage closeness loss pclS associated with subset S, representing the contribution
of subset S to European integration, is measured as the percentage of closeness that is lost
when S is removed from the network:

pclS ¼

(cid:1)cS − (cid:1)cSj
(cid:1)cS

(4)

Note that the numerator of Eq. (4) is always not negative.

Endlich, to analyze in greater depth the contribution of the United States to European inte-
gration also shortest paths are computed. The shortest path between two nodes s and t of a
network is the path between s and t such that the sum of the weights of the constituent edges is
minimized (Goddard & Oellermann, 2011). In more detail, we will consider the shortest paths
between EU nodes belonging to different countries, counting how many American nodes are
contained in these paths.

Notice that, in summary, the percentage closeness loss represents the ability of a set of
nodes to make other nodes of the network closer. daher, it is a measure for sets of nodes
that is related to two other traditional centrality measures defined instead for individual nodes:
betweenness and current-flow betweenness. The betweenness of a node is the number of
shortest paths crossing that node, while current-flow betweenness measures the extent to
which a node lies on paths between other nodes. Betweenness considers only the shortest
paths while current-flow betweenness takes into account all the paths, although longer paths
give a lesser contribution. Percentage closeness loss considers all the paths, not only the short-
est ones; daher, it is a measure referred to sets of nodes that is more similar to current-flow
betweenness.

3.2.3. Null models

To better appreciate the percentage closeness losses obtained on our networks we sometimes
compare them with those measured on null models, where the null model of a network is
another network obtained by keeping some elements constant and randomizing other ones.
In more detail, we will use three classes of null models:

(cid:129) Gravity-based null model (Expert, Evans, et al., 2011). This model is used to check whether
the detected patterns are simple effects of gravitylike forces, depending on spatial distance
and on a concept of mass. Let gij be the geographical distance between the regions asso-
ciated with nodes i and j, and Mi and Mj the masses of the nodes. The weight wNM
of the
edge (ich, J ) in the null model is defined as wNM
ich;jjgij ¼ g wij)/
P
ij
ich;jjgij ¼ g MiMj). The weight of the edge (ich, J ) grows with the mass of i and j, and with
(
the weights that in the real network are associated with the nodes geographically at the
same distance of i and j. This null model preserves neither the weights of the edges nor

= MiMjf( gij), with f( G) = (

Quantitative Science Studies

900

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
Q
S
S
/
A
R
T
ich
C
e
–
P
D

F
/

1
2
8
9
4
1
8
8
5
7
7
4
Q
S
S
_
A
_
0
0
0
3
4
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

The role of U.S. regions in promoting the integration of the European research system

the strength of the nodes, but maintains the total weight of the network. In our framework
the spatial distances are continuous, so it is necessary to divide them into bins. We con-
sider the mass of a node as the total number of patents produced in the corresponding
region in the time frame of interest.

(cid:129) Null model where the edges within the United States are randomly reshuffled. We con-
sider two variants of this null model: one that does not preserve the strength of the
Knoten, and one that approximately preserves it (Rubinow & Spurns, 2011). Zum Beispiel,
if in the real network the edge between Boston and Los Angeles is 100, in the null model
this weight can be assigned to the edge between Portland and Memphis. In the first var-
iant all the U.S. nodes obtain approximately the same strength. Both the variants pre-
serve the weights of the edges, which are reshuffled. The two variants of this null model
are dubbed US-INT and US-INT-STR, jeweils.

(cid:129) Null model where the EU-US connections are randomly reshuffled. We consider two
variants of this null model: The first preserves the strength (of the EU-US connections)
just for the EU nodes, while the second (etwa) preserves it for both EU and U.S.
Knoten (Rubinow & Spurns, 2011). Zum Beispiel, if in the real network the edge between
Paris and Santa Clara has weight 200, in the null model this weight can be assigned to
the edge between Paris and Anchorage. Both variants preserve the weights of the EU-US
edges, which are reshuffled. The two variants of this null model are dubbed EU-US and
EU-US-STR, jeweils.

3.2.4. Changepoint detection

In order to better appreciate the yearly variations of the percentage closeness loss we use a
technique named changepoint detection (Killick, Fearnhead, & Eckley, 2012), which identifies
the time instants (changepoints) corresponding to abrupt changes in a function. Identifying the
changepoints splits the function in sections, and in particular we split the yearly percentage
closeness loss function where the regression line changes the most. This is achieved by finding
the sections of the function such that the sum of the residual errors of the regressions in each
section is minimized.

Let x1, … , xn be the points of the function that we are studying, and let SSx1,…,xi be the
residual error associated with the regression line approximating the function in the points
x1, … , xi. The changepoint detection procedure finds the time instants m1, … , mk minimizing
the following metric:

J ¼ SSx1;…;xm1−1

þ SSxm1

;…;xm2 −1

þ … þ SSxmk

;…;xn

(5)

Note that adding more changepoints keeps reducing the metric value. To cope with this prob-
lem, the procedure rejects further candidates when the decrease of the value of J provided by
the new candidate is lower than a given threshold. In this work the threshold has been set to
twice the variance of the function, meaning that we stop adding changepoints when the
subsequent new one would increase the R2 determination coefficient of the regression by
less than 2/n.

4. ERGEBNISSE

In this section we show the results of our analyses on the coinventorship network. We begin by
providing some preliminary statistics and pictures (section 4.1), and then we analyze the in-
tegration capability of countries (section 4.2) and of individual regions (section 4.3). Endlich,

Quantitative Science Studies

901

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
Q
S
S
/
A
R
T
ich
C
e
–
P
D

F
/

1
2
8
9
4
1
8
8
5
7
7
4
Q
S
S
_
A
_
0
0
0
3
4
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

The role of U.S. regions in promoting the integration of the European research system

we study in more detail the impact of the United States on the closeness of the individual EU
countries to the other EU countries (section 4.4).

4.1. Descriptives

In this section we show some preliminary statistics and illustrations to provide an initial un-
derstanding of the structure of the coinventor network. Insbesondere, we want to highlight the
features of the national subnetworks and of the hubs.

Tisch 3 contains some statistics related to the national subnetworks in the 2000–2009 pe-
Riod. For each country we report the number of nodes, the number of nodes with 250 or more
patents, and the average closeness (computed as the inverse of the resistance distance) Sei-
tween the pairs of nodes associated with 250 or more patents. We have computed this average
considering only the nodes associated with a certain number of patents because the other ones
are not likely to appear in effective paths joining EU regions belonging to different countries.
The average closeness between U.S. nodes is 78% greater than that of the EU (see the last two
lines of the table).

We can also see the greater number of connections existing in the U.S. subnetwork with
respect to the EU ones pictorially in Figure 2, where the network contains a node for each

Tisch 3. Statistics related to the national subnetworks in the 2000–2009 period

Country
Österreich

Belgien

Denmark

Finland

Frankreich

Deutschland

Greece

Ireland

Italien

Luxembourg

Niederlande

Portugal

Spanien

Schweden

Vereinigtes Königreich

Vereinigte Staaten

# Knoten
35

# nodes ≥ 250 patents
13

Average closeness between
nodes ≥ 250 patents
328.29

101

402

110

139

3,144

1,067

161

171

367

967.63

1,063.58

1,152.65

1,049.89

1,611.08

341.95

418.95

878.54

249.71

525.56

597.95

1,953.46

1,100.22

Quantitative Science Studies

902

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
Q
S
S
/
A
R
T
ich
C
e
–
P
D

F
/

1
2
8
9
4
1
8
8
5
7
7
4
Q
S
S
_
A
_
0
0
0
3
4
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

The role of U.S. regions in promoting the integration of the European research system

Figur 2.
weights of the edges in the national subnetworks, while the edge thickness is proportional to the number of coinventions.

EU-US coinventorship network. Nodes are EU countries (grau) und wir. Staaten (Blau). The node size is proportional to the sum of the

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
Q
S
S
/
A
R
T
ich
C
e
–
P
D

F
/

1
2
8
9
4
1
8
8
5
7
7
4
Q
S
S
_
A
_
0
0
0
3
4
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

country, but we have decided to decompose the United States into the component states; nur
the edges representing at least 500 coinventions are included. The edge thickness is propor-
tional to the number of coinventions and the size of the nodes is proportional to the sum of the
weights in the national subnetwork. The figure allows us to appreciate the links between the
UNS. Staaten, whereas the EU countries are less connected.

Endlich, Figur 3 represents the shortest paths connecting EU nodes belonging to different
countries in the 2000–2009 period. The figure shows the edges that are part of at least one
shortest path joining two regions belonging to different EU countries. The edge thickness
grows with the edge weight, while the node size grows with the betweenness, computed con-
sidering only the shortest paths joining EU nodes belonging to different countries. The figure is
quite difficult to read due to the size of the network, but in any case allows us to understand
Das, even when considering only the paths joining different EU regions, there emerge a sig-
nificant share of relevant hubs that are within the United States and not in the EU itself.

Summarizing, this preliminary analysis suggests that the U.S. subnetwork contains many
nodes associated with a relevant number of patents, and that these nodes are more connected
between each other than happens in the European national subnetworks. So it seems plausible
that the U.S. subnetwork as a whole might provide a faster, though indirect, connection be-
tween EU regions. Darüber hinaus, we can note that there are several nodes from both the EU and
the United States that are crossed by many shortest paths joining EU nodes belonging to dif-
ferent countries. Wieder, it seems worthwhile to investigate the relative importance of U.S. Und
EU hubs in making the EU R&D system more integrated.

4.2. The Integration Capability of the Countries

In this section we will use the procedure described in section 3.2 to show which countries
(among the EU countries and the United States) have the greatest integration capability; Das
Ist, contribute the most to increasing the closeness between the EU regions belonging to differ-
ent countries.

To measure the integration capability of a country, the formulas of section 3.2 are applied
considering the subset S to be excluded as the set of the nodes belonging to that country. In

Quantitative Science Studies

903

The role of U.S. regions in promoting the integration of the European research system

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
Q
S
S
/
A
R
T
ich
C
e
–
P
D

F
/

1
2
8
9
4
1
8
8
5
7
7
4
Q
S
S
_
A
_
0
0
0
3
4
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Figur 3. EU-US network representing the shortest paths between EU regions belonging to different countries. EU regions are grey, while U.S.
regions are blue. The node size is proportional to the betweenness computed considering only the shortest paths joining EU nodes belonging
to different countries, while the edge thickness is proportional to the edge weight.

this way the value resulting from Eq. (4) gives the percentage closeness loss due to the removal
of the country; the greater this percentage, the greater the integration capability of the country.

Tisch 4 shows the integration capability for each EU country and for the United States. Der
analyzed years are divided into four periods: 1980–1989, 1990–1999, 2000–2009, and 2010–2014.
The main feature that emerges from the table is that Germany and the United States are by far
the countries with the greatest capability to connect the EU countries. Notice that the United
States exhibits a large contribution to European R&D integration, greater than that shown by the
European countries themselves. On the one hand this is due to the larger population—the
population of the United States is, zum Beispiel, almost five times that of France—providing
more possibilities to establish collaborations, but on the other hand it indicates that the
United States plays a fundamental role in the European R&D system. In der Tat, a large pop-
ulation alone is not enough to develop joint R&D projects.

We want now to understand whether the measured integration capabilities are just the re-
sult of simple gravitylike forces (d.h., mass and distance effects), or are due to more complex

Quantitative Science Studies

904

The role of U.S. regions in promoting the integration of the European research system

Tisch 4. Percentage of the average cross-border closeness within the EU that is lost by excluding the
United States or the individual EU countries from the coinventor network

1980–1989
0.6452

1990–1999
0.4743

2000–2009
0.6208

2010–2014
0.7867

Country
Österreich

Belgien

Denmark

Finland

Frankreich

1.2581

0.4091

0.1573

2.8053

Deutschland

11.0070

Greece

Ireland

Italien

Luxembourg

Niederlande

Portugal

Spanien

Schweden

Vereinigtes Königreich

USA

0.0292

0.0808

0.7154

0.2478

1.726

0.0207

0.0896

1.0132

3.2244

7.4903

1.6850

0.4994

0.5247

3.1101

1.6606

0.5785

0.4984

3.3041

10.3100

10.5810

1.4251

0.4519

0.3668

2.8239

9.3136

0.0314

0.1452

0.8928

0.2378

1.5439

0.0304

0.3417

0.8602

2.7337

0.0882

0.2163

0.9290

0.2688

1.4793

0.0874

0.7870

0.9544

2.6222

10.4740

10.7590

0.0639

0.2317

1.068

0.2895

1.2793

0.1040

0.8800

1.0464

2.3039

9.9570

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
Q
S
S
/
A
R
T
ich
C
e
–
P
D

F
/

1
2
8
9
4
1
8
8
5
7
7
4
Q
S
S
_
A
_
0
0
0
3
4
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Dynamik (z.B., different propensity to long-distance collaborations). To this aim, the gravity
null model, introduced in section 3.2, is employed. The integration capabilities observed using
the gravity model are in Table 5, reported as differences with respect to the values observed
in the real network, while Figure 4 provides a pictorial representation of the comparison be-
tween the real network and the gravity null model referred to the 2000–2009 period. We note
that when the gravity model is used, almost all the European countries show an integration
capability that is greater than that observed in the real network (d.h., they contribute to
European integration less than is expected due to simple mass-distance effects); zum Beispiel,
Germany would be 2.5 times more important if R&D collaborations only depended on mass
and distance. Im Gegensatz, the United States exhibits an integration capability that is greater than
the amount due to gravity.

In order to further investigate the role of the United States in the EU R&D system and try to
understand the nature of the connections linking Europe and the United States, we make use of
a more straightforward measure of distance along the network (d.h., shortest paths) and we
employ null models disrupting some portions of the network (internal U.S. network and EU-
US connections) to assess their relative importance.

In the first place, we have determined all the shortest paths joining EU regions belonging
to different countries, and classified them on the basis of the number of U.S. regions that are
enthalten. The second column of Table 6 reports the shortest paths statistics related to the real
network in the 2000–2009 period (for brevity, the other periods follow a similar pattern). UNS.

Quantitative Science Studies

905

The role of U.S. regions in promoting the integration of the European research system

Tisch 5. Percentage of the average cross-border closeness within the EU that is lost by excluding the
United States or the individual EU countries from the coinventor network on the gravity null model.
The results are reported as differences with respect to the real coinventor network

Country
Österreich

Belgien

Denmark

Finland

Frankreich

1980–1989
+0.4191

1990–1999
+0.6404

2000–2009
+0.5965

2010–2014
+0.8478

+0.1100

+0.0951

+0.4314

+5.4822

+0.2641

+0.3354

+0.7692

+4.9078

+0.2400

+0.4880

+0.9484

+3.9589

+0.4878

+0.5462

+0.8673

+5.1109

Deutschland

+16.7832

+17.8759

+16.2197

+15.9801

Greece

Ireland

Italien

Luxembourg

Niederlande

Portugal

Spanien

Schweden

Vereinigtes Königreich

USA

+0.0130

+0.0042

+2.0169

+0.0022

+4.3309

−0.0087

+0.1805

+1.4089

+3.0774

−3.4335

+0.0341

+0.0379

+2.4311

−0.0179

+4.5983

+0.0063

+0.1970

+1.4092

+2.2613

−3.1522

+0.0036

+0.1101

+2.3147

+0.1055

+5.9839

−0.0167

+0.0560

+1.5709

+1.1042

−2.4052

+0.0140

+0.1464

+1.8654

+0.2347

+5.1791

−0.0204

+0.1986

+1.5873

+0.9792

−2.0788

D
Ö
w
N
Ö
A
D
e
D

F
R
Ö
M
H

T
T

:
/
/

D
ich
R
e
C
T
.

ich
T
.

e
D
u
Q
S
S
/
A
R
T
ich
C
e
–
P
D

F
/

1
2
8
9
4
1
8
8
5
7
7
4
Q
S
S
_
A
_
0
0
0
3
4
P
D

B
j
G
u
e
S
T

Ö
N
0
7
S
e
P
e
M
B
e
R
2
0
2
3

Figur 4. Regular and thematic map of the countries under study (EU-15 + US). In the latter case,
the deformation and the color code represent the ratio between the percentage loss in the average
closeness within the EU when links are rewired according to a gravity-law null model (see text for
Einzelheiten) over observations in the 2000–2009 period. Zum Beispiel, countries in white have less im-
portance in the null model case with respect to the observed one.

Quantitative Science Studies

906

The role of U.S. regions in promoting the integration of the European research system

Tisch 6. Percentage of shortest paths between EU regions belonging to different countries, including U.S. Regionen, classified by number of U.S.
regions included in the 2000–2009 period, in the real network and null models. The data values related to the null models are expressed as
differences with respect to the real network. The double asterisks (**) indicate that all the differences, except one, are statistically significant
at p < 0.01 % sp ≥1 USA node % sp 1 USA node % sp 2 USA nodes % sp 3 USA nodes % sp >3 USA nodes

Real network
57.7635

5.0215

18.9355

9.7906

24.0159

US-INT
−24.3377**

+22.5199**

−13.3102**

−9.5757**

−23.9717**

US-INT-STR
+6.7447**

−2.4165**

−11.9037**

+11.9008**

+9.1641**

EU-US
−52.4168**

−2.9377**

−18.6610**

−9.4098**

−21.4084**

EU-US-STR
+2.0894**

+15.4171**

+1.7815**

+0.5475

−15.6567**

regions participate in more than half of the intra-EU cross-border shortest paths, thus confirming
the importance of the United States in the EU R&D system. Interessant, as we sensed from
Figur 2, many shortest paths include several U.S. Knoten (24% of the shortest paths include
more than three U.S. Knoten), and this confirms that since the U.S. subnetwork contains many
internal connections, the most convenient way to link two EU regions is often to move to the U.S.
subnetwork, cover “cheap” paths inside this subnetwork, and find the most appropriate node to exit.

Let us consider the two classes of nongravity null models introduced in section 3.2: Der erste
class randomizes the connections inside the United States, while the second one randomizes the
connections between EU and U.S. Knoten. Daher, the first class allows us to evaluate the relevance
to the EU integration of the connections internal to the United States, while the latter permits us to
assess the importance of the EU-U.S. connections. Columns 3 Zu 6 of Table 6 contain the results
of the shortest path analysis for the null models (2000–2009 period), while Table 7 zeigt die
percentage closeness loss for the null models. All the values reported for the null models are
obtained by repeating the random generation of the models 100 times and then averaging the
measurements. The tables indicate the differences with respect to the results obtained with the
real network. We have performed a t-test to evaluate the statistical significance of the differ-
zen; all the values in Tables 6 Und 7 are statistically significant with p-values << 0.01, except for one value in Table 6. Regarding the class of null models reshuffling the U.S. internal connections, the first variant preserves just the weights of the network, while the second one also maintains the strengths of the nodes (i.e., it preserves the hubs within the U.S. subnetwork). First, we note that the U.S. integration capability in terms of resistance distance remains almost constant in both variants. Table 7. Percentage of the average cross-border closeness within the EU that is lost by excluding the United States from the coinventor network on the null models, expressed as difference with respect to the values observed on the real network. The double asterisks (**) indicate that all the differences are statistically significant at p < 0.01 1980–1989 1990–1999 2000–2009 2010–2014 US-INT −0.4737** −0.4150** −0.1736** −0.2280** US-INT-STR −0.0065** −0.0034** −0.0201** −0.0219** EU-US −3.2818** −2.8326** −1.9164** −2.4482** EU-US-STR −0.3502** −0.7904** −0.8137** −0.8394** 907 Quantitative Science Studies l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / e d u q s s / a r t i c e - p d l f / / / / 1 2 8 9 4 1 8 8 5 7 7 4 q s s _ a _ 0 0 0 3 4 p d / . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 The role of U.S. regions in promoting the integration of the European research system This happens because the U.S. subnetwork has many nodes and edges, and therefore reshuf- fling the connections leaves, in both cases, good paths between the pairs of U.S. nodes; this confirms that the great number of connections in the U.S. subnetwork helps join the EU regions. However, the second null model exhibits a performance that is more similar to that of the real case, thus suggesting that the presence of strong U.S. hubs facilitating the links is also important. When we analyze the effect of null models on shortest paths, which are more sensitive than the resistance distance to changes in the network, these considerations are reinforced: On the first null model, the number of shortest paths transiting from the United States falls with respect to the real situation, while on the second null model it even grows. This behavior seems to indi- cate again the importance of the hubs, confirming the intuitive evidence of Figure 3. The growth of the number of shortest paths with U.S. nodes in the second null model is probably due to the fact that once the hubs are preserved, a more balanced distribution of the weights to the edges helps find better paths. The relevance of the U.S. hubs is confirmed also by the last class of null models: those reshuf- fling the EU-US connections. The first null model of this class does the reshuffling by preserving the strength of the transatlantic connections just for the EU nodes, while the second one preserves this strength also for the U.S. nodes. The second null model behaves similarly to the real situa- tion, while in the first one the U.S. contribution to EU integration decreases; this suggests that it is not enough to connect to the U.S. network, it must be done through the right access points. Finally, we conduct a finer-grained temporal analysis to shed further light on the variations of the U.S. contribution that emerged in the four decades by using yearly networks. We want to assess whether the U.S. integration capability has evolved over time following a steady trend, or the tendency has changed through time. To do this, we employ changepoint detec- tion analysis. In brief, this method allows us to retrieve the optimal set of linear slope change- points to model the observed data (see section 3.2.4 for details), thus discovering the possible changes in the trend of the magnitude of the U.S. integration capability. We apply the method to the pattern of yearly percentage closeness loss in the EU network due to collaborations with the United States, with the aim of identifying the years in which the trend of growth or decrease of the U.S. contribution to the EU integration has changed significantly. We have considered the years from 1981 to 2014, omitting 1980, which is associated with few data. The resulting plot is in Figure 5(a). The changepoint associated with the greatest reduction of the residual error of the regression is detected in 1997 (highlighted in red in Figure 5); then, two more changepoints are identified in 1983 and 1987. Interestingly, before 1997 the U.S. contribution shows, globally, a positive trend, while after 1997 there is a long period with a clearly negative trend. Figure 5(b) shows the R2 and p-values of the discovered regressions; note that the last two regressions, which are those of greatest interest, are significant at p < 0.05. It can be seen that the latter result is consistent with the evidence shown by Chessa et al. (2013), who have highlighted that EU integration, in the same case of coinventorship, has ex- perienced growth starting in the years before 2000. Also, they find that the integration level has subsequently stabilized. Our changepoint detection analysis identifies a clear inversion of the tendency in the U.S. contribution to EU integration in the same period, which then started to decrease. Therefore, the growth of the EU integration level found by Chessa et al. (2013) seems to be reflected in a progressive emancipation of the EU from the U.S. R&D system. In this respect, we point out the possible role of EU policies, characterized by increasing financing of R&D programs, fostering intra-EU collaborations. Summarizing this section, we find that Germany and the United States provide the highest contribution to connect the EU countries. In particular, the United States has a more significant Quantitative Science Studies 908 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / e d u q s s / a r t i c e - p d l f / / / / 1 2 8 9 4 1 8 8 5 7 7 4 q s s _ a _ 0 0 0 3 4 p d . / f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 The role of U.S. regions in promoting the integration of the European research system l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / e d u q s s / a r t i c e - p d l f / / / / 1 2 8 9 4 1 8 8 5 7 7 4 q s s _ a _ 0 0 0 3 4 p d . / f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Figure 5. (a) Yearly percentage closeness loss due to the United States, between 1981 and 2014. The dashed vertical lines indicate the identified changepoints, while the regression lines in the sections delimited by the changepoints are in solid black. (b) R2 and p-values of the discovered regressions. impact than most European countries. Our analyses indicate that two important factors that make the United States able to help connect the EU regions are represented by the high num- ber of links in the internal U.S. network and the presence of U.S. hubs: To connect two EU nodes it is enough that these two nodes are close to two distinct American nodes, which may then usually be easily linked through a path within the U.S. subnetwork especially, due to the help of effective internal hubs. Finally, we observe that the U.S. contribution to EU integration seems to have been decreasing since 1997. 4.3. The Integration Capability of the Hubs In this section we appraise the integration capability of individual hubs. Studying individual nodes is interesting, because they are much more similar in terms of population than the coun- tries, thus leading to less biased analysis results. Quantitative Science Studies 909 The role of U.S. regions in promoting the integration of the European research system Table 8. Percentage of the average cross-border closeness that is lost by excluding the 10 EU and 10 U.S. (italicized) main hubs from the coinventor network (closenesses between pairs of nodes including a node in the same country of the hub not considered in the computation) Region (a) 1980–1989 Integration capability Aachen Berlin Munich Biberach Wuppertal Houston, TX San Jose, CA Milan Stockholm Mettmann Vienna Paris Cambridge, MA Oakland, CA San Mateo, CA San Diego, CA Chicago, IL Elizabeth, NJ San Francisco, CA White Plains, NY (c) 2000–2009 Cambridge, MA Munich San Jose, CA Berlin Helsinki San Diego, CA Aachen 0.6701 0.5640 0.4052 0.3431 0.3106 0.3037 0.2956 0.2743 0.2620 0.2542 0.2295 0.2191 0.2152 0.1549 0.1545 0.1427 0.1337 0.1256 0.1245 0.1159 0.4612 0.4480 0.3727 0.3527 0.3179 0.3155 Region (b) 1990–1999 Rockville, MD Cambridge, MA Munich San Diego, CA Paris Cincinnati, OH San Jose, CA Berlin Rotterdam Bad Homburg Stockholm Hamilton, OH Helsinki Norristown, PA Houston, TX Milan Cambridge Nanterre Raleigh, NC San Mateo, CA (d) 2010–2014 Cambridge, MA San Jose, CA Munich Aachen Helsinki Stockholm l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / e d u q s s / a r t i c e - p d l f / / / / 1 2 8 9 4 1 8 8 5 7 7 4 q s s _ a _ 0 0 0 3 4 p d . / f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Integration capability 0.3738 0.3360 0.3272 0.3133 0.3038 0.2872 0.2758 0.2753 0.2691 0.2436 0.2364 0.2287 0.2221 0.2211 0.2163 0.2095 0.2047 0.2036 0.1516 0.1424 0.4432 0.4220 0.3957 0.3648 0.3193 0.3090 0.2980 0.31316 Frankfurt Quantitative Science Studies 910 The role of U.S. regions in promoting the integration of the European research system Table 8. (continued ) Region Stockholm Madrid Houston, TX Paris Brussels Barcelona Lyon San Mateo, CA Oakland, CA Cincinnati, OH Norristown, PA Raleigh, NC Chicago, IL Integration capability 0.3023 Region Berlin Integration capability 0.2919 0.2828 0.2375 0.2332 0.2219 0.2187 0.2167 0.2159 0.1735 0.1597 0.1573 0.1572 0.1371 San Diego, CA Malmo Barcelona Lyon Houston, TX San Mateo, CA Paris Oakland, CA Raleigh, NC Midland, OH Cincinnati, OH Norristown, NJ 0.2805 0.2765 0.2658 0.2648 0.2636 0.2432 0.2326 0.1832 0.1713 0.1696 0.1528 0.1228 It must be noticed that the integration capability of a node may derive from two different factors: the ability to connect foreign regions, and the ability to connect regions of the same country with the outside. The American hubs can benefit only from the first factor, since we are considering just cross-border EU links. To evaluate the integration capability of an individual region, the procedures of section 3.2 are applied considering this region as the subset S to be excluded in Eqs. (3)–(4). Eqs. (3)–(4) actually result in evaluating both the ability to connect foreign regions and the ability to connect regions of the same country to the outside. We are also interested in evaluating the first factor alone, and to this end we consider, in the numerator and denominator of Eq. (3), only the pairs of regions not belonging to the same country of the hub. We begin by conducting a comparison between EU and U.S. hubs in terms of the ability to connect foreign regions, and then we an- alyze the European hubs considering also their ability to connect regions of the same country to the outside. We have considered for each analyzed time period an initial set of nodes with 30 EU regions and 30 U.S. regions chosen as those with the greatest current-flow betweenness, where the current-flow betweenness has been computed considering only the paths joining regions belonging to different EU countries. Table 8 shows for the four time periods mentioned above the percentage closeness loss for the EU and U.S. main hubs considering only the ability to connect foreign regions, while Table 9 repeats the evaluation only for the EU hubs appraising also the ability to connect nodes of the same country of the hub to the outside. The main evidence that arises from Table 8 is that the effect of the U.S. hubs is comparable to that of EU hubs, and even stronger in the period 1990–1999. This further supports our pre- vious considerations regarding the importance of the United States in the EU R&D system: The strong American hubs may act as entry and exit points in the U.S. subnetwork, and then also facilitate the connections inside the subnetwork. Quantitative Science Studies 911 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / e d u q s s / a r t i c e - p d l f / / / / 1 2 8 9 4 1 8 8 5 7 7 4 q s s _ a _ 0 0 0 3 4 p d . / f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 The role of U.S. regions in promoting the integration of the European research system Table 9. Percentage of the average cross-border closeness within EU that is lost by excluding the 10 EU main hubs from the coinventor network Region (a) 1980–1989 Milan Munich Paris Vienna Nanterre Outer London West Versailles Stockholm Lyon Kingston (c) 2000–2009 Milan Helsinki Berlin Munich Paris Stockholm Vienna Lyon Madrid Cambridge Integration capability Region Integration capability (b) 1990–1999 3.7978 1.7139 1.6977 1.3557 1.1936 1.1713 1.1591 1.0759 1.0603 0.9804 1.6807 1.3161 1.2280 1.2156 1.0897 1.0891 0.9037 0.8546 0.8383 0.8237 Milan Paris Munich Berlin Helsinki Stockholm Nanterre Lyon Versailles Vienna (d) 2010–2014 Helsinki Milan Stockholm Berlin Munich Paris Lyon Barcelona Grenoble Vienna 2.4952 1.2713 1.2369 1.1325 1.1146 1.0069 0.9880 0.9754 0.9564 0.8853 1.3885 1.3720 1.2719 1.2344 1.1829 1.0439 0.9194 0.9027 0.8606 0.7824 The most recurrent European hub is Munich, while other important regions are Berlin and Aachen. Regarding the United States, the main hub seems to be Cambridge, MA, with an im- portant role played by San Jose, CA; a very relevant integration capability is shown also by Houston, TX in 1980–1989 and Rockville, MD in 1990–1999. As a further insight about the U.S. hubs, we can analyze their main IPC patent classes. The most frequent class is Medical/ Veterinary for Cambridge and Rockville, Computing for San Jose and Drilling/ Mining for Houston. Therefore, with the exception of Houston in 1980–1989, it appears that the integration capability of the U.S. has been driven by regions focused on ICT and life science fields. Table 9, instead, takes into account also the ability to connect regions of the same country with the outside. In this table new regions emerge, for instance Milan and Vienna. These Quantitative Science Studies 912 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / e d u q s s / a r t i c e - p d l f / / / / 1 2 8 9 4 1 8 8 5 7 7 4 q s s _ a _ 0 0 0 3 4 p d / . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 The role of U.S. regions in promoting the integration of the European research system Table 10. For each EU country, percentage of the average cross-border closeness toward the other EU countries that is lost by excluding the United States from the coinventor network Country Austria Belgium Denmark Finland France Germany Greece Ireland Italy Luxembourg Netherlands Portugal Spain Sweden UK 1980–1989 4.4958 1990–1999 5.7493 2000–2009 6.2671 2010–2014 6.0019 7.2856 10.7555 11.6032 6.6648 7.0774 26.3186 21.9114 6.0771 21.4734 6.2872 5.4839 15.2605 12.6027 8.1891 11.9208 16.5860 13.1692 8.4478 9.5578 19.8067 30.0243 9.4906 24.4928 10.5942 8.8667 16.9715 12.4948 11.7321 10.2429 15.8985 9.2353 8.9996 9.9859 10.3177 27.4674 8.5954 17.9527 11.1229 9.4384 10.3864 10.5493 15.0948 9.1578 13.2177 7.1949 8.0087 9.1361 14.1905 28.0969 7.9810 14.9668 11.3276 6.2752 9.4545 10.9270 14.4985 regions play an important role in their national subnetworks. It is interesting to note that the contribution of Milan has diminished over time, suggesting that the other Italian regions may have become more capable to connect autonomously with foreign EU countries. 4.4. Impact of the United States on the Closeness of the Individual EU Countries to the Other EU Countries The previous section has highlighted the very relevant role of the United States in strengthening the connections between European countries. In this section we look further into this issue by trying to understand which European countries most need the United States to become close to the other ones. In order to evaluate the impact of the United States on the closeness of a specific EU coun- try to the other EU countries we evaluate again the percentage closeness loss. In this case Eq. (3) must consider only the pairs of nodes involving a region of the country of interest, and the subset S of the network to be excluded is represented by the American nodes. In this way Eq. (4) results in the contribution of the United States to the closeness of the country under analysis to the other EU countries. Table 10 shows the numerical results in the usual four time periods, while Figure 6 gives a graphical intuition of the proportion of the U.S. contribution to the different countries. It is possible to notice that the countries benefiting the most from U.S. collaborations are the smal- lest ones: Actually, these countries, due to their size, need more external collaborations to Quantitative Science Studies 913 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / e d u q s s / a r t i c e - p d l f / / / / 1 2 8 9 4 1 8 8 5 7 7 4 q s s _ a _ 0 0 0 3 4 p d / . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 The role of U.S. regions in promoting the integration of the European research system l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . Figure 6. Graphical representation of the results in Table 10. The four columns over each country show the percentage closeness loss due to the United States for that country, in the four time periods: 1980–1989 (blue), 1990–1999 (green), 2000–2009 (yellow), 2010–2014 (purple). The number indicated above each country represents the highest value measured in the four periods. In the figure, ‘a’ indicates Belgium, ‘b’ indicates Luxembourg and ‘c’ indicates The Netherlands. / e d u q s s / a r t i c e - p d l f / / / / 1 2 8 9 4 1 8 8 5 7 7 4 q s s _ a _ 0 0 0 3 4 p d . / f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 Figure 7. Heatmaps showing the closenesses between the EU regions with and without the United States, in the 2000–2009 period. Figure 7 (a) depicts the starting situation with the absolute closenesses measured on the network including only the EU; the absolute closeness values reported in the legend are the inverses of the resistance distance, as explained in section 3.2. Figure 7(b) represents the percentage closeness loss between pairs of EU regions due to the United States; the values in the legend in this case are therefore percentages. White indicates low values, while dark red indicates high values. Quantitative Science Studies 914 The role of U.S. regions in promoting the integration of the European research system carry out R&D projects. This confirms the findings of Waltman et al. (2011), according to which peripheral countries are more prone to start long-distance collaborations. An exception is represented by the UK, but such an exception was expected given the well-known strong relationship of this country with the United States. Figure 7 deepens the results shown in Table 10 and Figure 6, by proposing a heatmap of the percentage closeness loss due to the United States between pairs of EU regions. In more detail, the heatmap showing the percentage closeness loss is in Figure 7(b), while for the sake of completeness Figure 7(a) reports the starting situation, with the absolute closenesses computed on the network including only the EU. As in Table 10, it can be noticed that the most periph- eral countries benefit most from the U.S. contribution; in addition, Figure 7(b) underlines that the connections between pairs of peripheral countries are the ones helped most. Another pe- culiarity highlighted by Figure 7(b) is that the diagonal of the heatmap is very light, meaning that the connections between pairs of regions within the same EU country do not need the mediation of the U.S. to be established. Finally, note that Germany, which provides a very relevant contribution to EU integration (see the previous sections), does not seem to benefit that much from the help of the United States to become connected to the other EU countries. This may be due to the fact that Germany, as is clear from Figure 7(a), has a high closeness to the other countries also on the network containing only the EU countries, and so does not need the U.S. to create connections. 5. CONCLUSIONS In this work we have studied the patent coinventor network and used indirect distance mea- sures to investigate the contribution of individual countries and regions to European R&D in- tegration, that is, their integration capability. The analysis has been carried out on a network encompassing both the EU and United States, in order to ascertain also possible contributions from the United States to European integration. After having proposed some descriptive statistics and pictures for the coinventor network, we have analyzed the contribution to EU integration provided by countries and regions by computing the amount of network-based closeness between European regions belonging to different countries that is lost when specific subsets of the nodes of the network are removed. We can summarize the main conclusions of this work as follows: (cid:129) The countries that contribute most to connecting regions across EU countries are Germany and the United States. In particular, the United States proves to be more rel- evant in joining EU countries than most of the EU countries themselves. The integration capability of the United States is more than the country would have had if the collab- orations were driven exclusively by gravitylike effects, while the integration capability of almost all the EU countries is lower than that due to gravity. Moreover, our analyses indicate that an important factor that makes the United States able to foster the connec- tion of EU regions is represented by the high number of links in the U.S. subnetwork: To connect two EU nodes on the coinventor network it is enough that these two nodes are close to two distinct American hubs, which may then usually be easily linked through a “fast” path within the United States. Also, the connections within the U.S. subnetwork are facilitated by the presence of strong internal hubs. In addition, the U.S. contribution to EU integration seems to have been decreasing since 1997, in conjunction with re- newed efforts by the EU in support of European technological collaborations. Quantitative Science Studies 915 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / e d u q s s / a r t i c e - p d l f / / / / 1 2 8 9 4 1 8 8 5 7 7 4 q s s _ a _ 0 0 0 3 4 p d . / f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 The role of U.S. regions in promoting the integration of the European research system (cid:129) There are strong regional hubs in terms of integration capability in both Europe and the United States. Some European hubs, especially German ones, are able to connect re- gions of foreign countries. Other European hubs, the most notable example being Milan, have a remarkable effect on the average cross-border closeness, but their contri- bution is especially in the connection of nodes of their same country with the outside. (cid:129) The role of the United States in promoting integration with other EU countries is stronger for the smallest EU countries, probably because these countries, due to their size, need more external collaborations to carry out R&D projects. A first natural development of our work consists in using the techniques introduced in this paper to analyze the role that other external countries besides the United States, such as Japan, play in EU integration. Moreover, in the introduction we claimed that patents are more related to development activities than to research activities, and thus our coinventorship network is biased toward the flow of technological knowledge. It would be interesting to delve into the more scientific part of knowledge flow, building a collaboration network using scientific publication data. It would be interesting also to repeat the study using the regional coassignment network, which shows the connections between the regions where the headquarters of companies and institutions are located. This alternative analysis might highlight different trends, and compar- ing these results with those obtained on the coinventorship network—which more naturally describes the relationships of knowledge exchange between regions—might lead to further intriguing insight. In addition, the coassignment network might also be used to investigate the contribution of different institutional types (e.g., companies vs. public research organiza- tions) to EU R&D integration. Finally, another possible extension to our study regards the evaluation of the integration capability of a region through a mathematical model with weights to be learned from data; such weights would allow us to understand the relevance of various effects (e.g., the size of the nodes), to the integration capability. AUTHOR CONTRIBUTIONS Emanuele Rabosio: Data curation, Formal analysis, Investigation, Methodology, Software, Visualization, Writing—original draft, Writing—review & editing. Lorenzo Righetto: Formal anal- ysis, Methodology, Visualization, Writing—review & Editing. Alessandro Spelta: Methodology, Writing—Review & editing. Fabio Pammolli: Conceptualization, Methodology, Supervision, Writing—review & editing. COMPETING INTERESTS The authors have no competing interests. FUNDING INFORMATION No funding has been received for this research. DATA AVAILABILITY This research work was carried out using the Regpat data set (March 2018 version), publicly available from OECD. Quantitative Science Studies 916 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / e d u q s s / a r t i c e - p d l f / / / / 1 2 8 9 4 1 8 8 5 7 7 4 q s s _ a _ 0 0 0 3 4 p d / . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 The role of U.S. regions in promoting the integration of the European research system REFERENCES Arora, A., Belenzon, S., & Patacconi, A. (2018). The decline of science in corporate R&D. Strategic Management Journal, 39(1), 3–32. https://doi.org/10.1002/smj.2693 Arora, A., Belenzon, S., Patacconi, A., & Suh, J. (2019). The chang- ing structure of American innovation: Some cautionary remarks for economic growth ( Working Paper No. 25893). National Bureau of Economic Research. https://doi.org/10.3386/w25893 Arora, A., Fosfuri, A., & Gambardella, A. (2004). Markets for tech- nology: The economics of innovation and corporate strategy. Cambridge, MA: MIT Press. Bozzo, E., & Franceschet, M. (2013). Resistance distance, closeness, and betweenness. Social Networks, 35(3), 460–469. https://doi. org/10.1016/j.socnet.2013.05.003 Brun, J.-F., Carrère, C., Guillaumont, P., & de Melo, J. (2005). Has distance died? Evidence from a panel gravity model. World Bank Economic Review, 19(1), 99–120. https://doi.org/10.1093/wber/lhi004 Cappelli, R., & Montobbio, F. (2016). European integration and knowledge flows across European regions. Regional Studies, 50(4), 709–727. https://doi.org/10.1080/00343404.2014.931572 Chessa, A., Morescalchi, A., Pammolli, F., Penner, O., Petersen, A. M., & Riccaboni, M. (2013). Is Europe evolving toward an in- tegrated research area? Science, 339(6120), 650–651. https://doi. org/10.1126/science.1227970 Doria Arrieta, O. A., Pammolli, F., & Petersen, A. M. (2017). Quan- tifying the negative impact of brain drain on the integration of European science. Science Advances, 3(4). https://doi.org/ 10.1126/sciadv.1602232 Expert, P., Evans, T. S., Blondel, V. D., & Lambiotte, R. (2011). Uncovering space-independent communities in spatial networks. Proceedings of the National Academy of Sciences of the United States of America, 108(19), 7663–7668. https://doi.org/10.1073/ pnas.1018962108 Frenken, K., Hoekman, J., Kok, S., Ponds, R., van Oort, F., & van Vliet, J. (2009). Death of distance in science? A gravity approach to re- search collaboration. In A. Pyka & A. Scharnhorst (Eds.), Inno- vation Networks: New Approaches in Modelling and Analyzing, pp. 43–57. Berlin, Heidelberg: Springer. https://doi.org/10.1007/ 978-3-540-92267-4_3 Goddard, W., & Oellermann, O. R. (2011). Distance in graphs. In Structural Analysis of Complex Networks, pp. 49–72. Boston, MA: Birkhäuser. https://doi.org/10.1007/978-0-8176-4789-6_3 Hoekman, J., Frenken, K., & Tijssen, R. J. (2010). Research collabo- ration at a distance: Changing spatial patterns of scientific collab- oration within Europe. Research Policy, 39(5), 662–673. https:// doi.org/10.1016/j.respol.2010.01.012 Killick, R., Fearnhead, P., & Eckley, I. A. (2012). Optimal detection of changepoints with a linear computational cost. Journal of the American Statistical Association, 107(500), 1590–1598. https:// doi.org/10.1080/01621459.2012.737745 Klein, D. J., & Randić, M. (1993). Resistance distance. Journal of Mathematical Chemistry, 12(1), 81–95. https://doi.org/10.1007/ BF01164627 Maisonobe, M., Eckert, D., Grossetti, M., Jégou, L., & Milard, B. (2016). The world network of scientific collaborations between cities: Domestic or international dynamics? Journal of Informetrics, 10(4), 1025–1036. https://doi.org/10.1016/j. joi.2016.06.002 Miguelez, E., & Moreno, R. (2013). Do labour mobility and techno- logical collaborations foster geographical knowledge diffusion? The case of European regions. Growth and Change, 44(2), 321–354. https://doi.org/10.1111/grow.12008 Morescalchi, A., Pammolli, F., Penner, O., Petersen, A. M., & Riccaboni, M. (2015). The evolution of networks of innovators within and across borders: Evidence from patent data. Research Policy, 44(3), 651–658. https://doi.org/10.1016/j.respol. 2014.10.015 Nedeva, M., & Stampfer, M. (2012). From “science in Europe” to “European science”. Science, 336(6084), 982–983. https://doi. org/10.1126/science.1216878 Orsenigo, L., Pammolli, F., & Riccaboni, M. (2001). Technological change and network dynamics: Lessons from the pharmaceutical industry. Research Policy, 30(3), 485–508. https://doi.org/ 10.1016/S0048-7333(00)00094-9 Owen-Smith, J., Riccaboni, M., Pammolli, F., & Powell, W. W. (2002). A comparison of U.S. and European university-industry re- lations in the life sciences. Management Science, 48(1), 24–43. https://doi.org/10.1287/mnsc.48.1.24.14275 Ponds, R. (2009). The limits to internationalization of scientific research collaboration. Journal of Technology Transfer, 34(1), 76–94. https://doi.org/10.1007/s10961-008-9083-1 Powell, W. W., & Grodal, S. (2005). Networks of innovators. In The Oxford Handbook of Innovators, pp. 56–85. https://doi.org/ 10.1093/oxfordhb/9780199286805.003.0003 Rubinov, M., & Sporns, O. (2011). Weight-conserving characteriza- tion of complex functional brain networks. NeuroImage, 56(4), 2068–2079. https://doi.org/10.1016/j.neuroimage.2011.03.069 Scherngell, T., & Barber, M. J. (2011). Distinct spatial characteris- tics of industrial and public research collaborations: Evidence from the fifth EU framework programme. Annals of Regional Science, 46(2), 247–266. https://doi.org/10.1007/s00168-009- 0334-3 Singh, J., & Marx, M. (2013). Geographic constraints on knowledge spillovers: Political borders vs. spatial proximity. Management Science, 59(9), 2056–2078. https://doi.org/10.1287/mnsc.1120.1700 Stephenson, K., & Zelen, M. (1989). Rethinking centrality: Methods and examples. Social Networks, 11(1), 1–37. https://doi.org/ 10.1016/0378-8733(89)90016-6 von Luxburg, U., Radl, A., & Hein, M. (2010). Getting lost in space: Large sample analysis of the commute distance. In Proc. of NIPS 2010, 24th Annual Conference on Neural Information Processing Systems. Curran Associates, Inc. Waltman, L., Tijssen, R. J., & van Eck, N. J. (2011). Globalisation of science in kilometres. Journal of Informetrics, 5(4), 574–582. https://doi.org/10.1016/j.joi.2011.05.003 Quantitative Science Studies 917 l D o w n o a d e d f r o m h t t p : / / d i r e c t . m i t . / e d u q s s / a r t i c e - p d l f / / / / 1 2 8 9 4 1 8 8 5 7 7 4 q s s _ a _ 0 0 0 3 4 p d / . f b y g u e s t t o n 0 7 S e p e m b e r 2 0 2 3 RESEARCH ARTICLE image

PDF Herunterladen