Post on 08-May-2023
How Do Communication Costs Affect Scientific Collaboration? Exploring the Effect of Bitnet
August 2005
Ajay Agrawal and Avi Goldfarb * Abstract We examine the effect of connecting to Bitnet on research collaboration between US universities. Furthermore, we estimate the degree to which the quality of and distance between potential collaborators mediate any such effect. In other words, are benefits that accrue from Bitnet adoption spread uniformly across all adopters? We use publication data from seven engineering journals and Bitnet connection data to address these questions. Overall, we find that Bitnet adoption results in approximately an 85% increase in the likelihood of collaboration between pairs of institutions after they have both connected. In addition, medium-ranked universities seem to benefit most, largely by increasing their collaboration with top-ranked schools. Moreover, the effect is largest for co-located institutions. (JEL O33, R11, Z13) Keywords: collaboration, Internet, knowledge flows, communication technology, productivity
* University of Toronto (both authors). Corresponding author: ajay.agrawal@rotman.utoronto.ca. We thank seminar participants at the Canadian Economics Association meetings and the University of Toronto for useful comments. We also thank Raghav Misra, Swapnil Kotecha, Cara Saunders, and Alex Oettl, who provided excellent research assistance. Errors and omissions are our own. This research was funded by the Social Sciences and Humanities Research Council of Canada (Grant Nos. 410-2004-1770 and 538-02-1013). Their support is gratefully acknowledged.
1
1. Introduction
The objective of this research is to further our understanding of scientific collaboration, an important
component of the knowledge production function. Collaborations are important for a number of reasons.
First, under certain conditions, inter-organization collaboration is more efficient than single-organization
research since collaborators are able to draw upon a wider set of expertise, specialized equipment, and
other resources. Cockburn and Henderson (1998), who measure co-authorships across institutions, report
results suggesting that, at least in the pharmaceutical industry, firms that are more open to collaboration
with universities are more productive in terms of innovation.
Second, collaboration may facilitate greater knowledge flows, and knowledge flows, particularly
knowledge spillovers, are central to economic growth as characterized in endogenous growth models
(Romer, 1986 and 1990). For example, Singh (2005) provides evidence suggesting that network ties
between inventors, as inferred from their past collaborations, are important determinants of knowledge
diffusion patterns.
Third, there is a growing empirical literature on the relationship between geography and scientific
knowledge flows (Jaffe et al, 1993; Audrestch and Feldman, 1996; Zucker et al, 1998; Agrawal et al,
2003; Thompson and Fox-Kean, 2005). Many of these studies have challenged the notion of the “death of
distance” that some suggest might result from advances in communication technologies (Cairncross,
1997). Despite the significant work on this subject, the degree to which lower communication costs
change the nature of collaboration along the geographic dimension remains an open question.
Finally, as a result of various policy initiatives, significant public expenditures have been incurred
with precisely the objective of fostering inter-organization scientific collaboration. One such expenditure
was the creation of the Internet. However, the behavior of researchers in terms of collaboration patterns in
response to such initiatives remains largely unknown. We aim to address these issues here.
In particular, we address this question: To what extent does connecting to Bitnet increase the
propensity of researchers to collaborate with others who are also connected? We find that connecting to
Bitnet, and thus lowering the cost of communication, increases collaboration significantly. Overall, the
2
“Bitnet effect” results in approximately an 85% increase in the likelihood of collaboration between pairs
of institutions after they have both connected.
Next, we examine how the Bitnet effect is mediated by university quality, indicated by the
institution’s overall emphasis on research as measured by the Carnegie Foundation. Our results suggest
that medium-ranked universities benefit more than high-ranked and low-ranked universities from
connecting to Bitnet. We discuss the implications of this finding in our conclusion.
Finally, we turn our attention to the question of whether the Internet is a complement to or a
substitute for face-to-face interactions (Gaspar and Glaeser, 1998). With respect to agglomeration, the
former will encourage greater urbanization, the latter greater sprawl. Our findings suggest that the Bitnet
effect is greatest between nearby institutions, suggesting that connecting to the network is a complement
to face-to-face interactions. Again, we discuss the implications of this finding in the conclusion.
Although there is a significant literature concerning the economics of scientific collaboration
(Crane, 1969; Beaver and Rosen, 1978, 1979; Barnett, Ault, and Kaserman, 1988; Cockburn and
Henderson, 1998; Mairesse and Turner, 2005) and also concerning the social impact of lowering
communication costs via the Internet (Gaspar and Glaeser, 1998; Wellman and Gulia, 1997; Smith, 1999;
Olson and Olson, 2003; Van Alstyne and Brynjolfsson, 2005), there has been very little prior research
examining the effect of lowered communication costs on collaboration.
One notable exception is Hamermesh and Oster (1998), who explore the effect of the 1980s
“communications revolution” on collaborative research in economics. Using co-authored papers from
three major economics journals, they compare the citation counts of papers published in 1970-79 and in
1992-96. The authors find: 1) a significant increase in the fraction of distant co-authorships, 2) lower
quality of distant co-authored papers as compared to close co-authored papers, and 3) no decline in the
relative quality of distant co-authorships over time.
While this prior work is similar to our study in that both are interested in the effect of lowered
communication costs on research collaboration, our work differs in two fundamental ways. First, we
directly measure the relationship between institution-level Bitnet adoption and collaboration. They instead
3
show that general collaboration patterns changed from the 1970s to the 1990s without showing a direct
link to communications technology. Second, we focus on changes in the propensity to collaborate, using
institution-pairs as the unit of analysis. They focus on how citation counts of collaborative work changed,
using co-authored papers as the unit of analysis.
Another exception is Gaspar and Glaesar (1998). While collaboration is not the main focus of
their article, they show that there has been a rapid growth in local co-authorships in economics since the
1960s. They use this to support their argument that information technology is a complement to face-to-
face interaction. Using more rigorous econometric analysis, we confirm their result. Unlike Gaspar and
Glaesar, however, we examine the mediating role of university quality and show that adoption of the
technology leads to increased productivity among a particular set of schools because of their interaction
with more research-oriented local institutions. Furthermore, our focus is on how technology affects
knowledge flows rather than on how technology affects geographic agglomeration.
Our paper proceeds as follows. In Section 2, we provide a brief historical description of Bitnet
and how it may have facilitated research collaboration. In Section 3, we develop a simple framework of
research collaboration, which motivates our method and results. In Section 4, we describe the Bitnet
adoption data and the engineering publication data that form the basis of our key measures. In Section 5,
we present our empirical framework, and, in Section 6, we present the results, including the estimated
Bitnet effect and the estimated degrees to which the effect is mediated by the quality of and distance
between potential collaborators. Finally, in Section 7, we conclude by discussing the implications of these
findings.
2. A Brief History of Bitnet
Our research question is predicated on the assumption that Bitnet facilitates collaboration. In other words,
we assume a causal relationship between Bitnet adoption and increased collaboration. As such, it is
important to offer some context in terms of what Bitnet was used for, how it originated, what was
4
involved in the Bitnet adoption process, and how Bitnet evolved over time in order to provide some
intuition for our causality assumption.
Bitnet was an early leader in network communications for the research and education community.
It allowed communication via email, exchange of data through file transfer protocols, remote file
archives, Listserv, and compatibility with other operating systems such as UNIX.1 Archived emails of
Bitnet users in the 1980s show instances of individuals exchanging publications, sharing opinions,
conducting academic discussions, searching for research partners for collaboration, and facilitating
resource sharing.2
Our Bitnet data spans the period 1981-1990. The World Wide Web was invented near the end of
this period, in 1989; the first mass-market browser, Mosaic, was developed in 1993.3 Therefore, the
version of the network that we are examining in this paper predates the Internet as it is known today.
In the early 1970s, the Advanced Research Projects Agency (ARPA) created one of the foremost
networks of the existing Internet, ARPANET. This network facilitated exchange of computer data across
North America amongst ARPA-funded researchers for ARPA projects. ARPANET’s restriction to only
ARPA-funded researchers led to the development of several other networks, including CSNET,
USENET, and Bitnet. CSNET was founded in 1981 to serve the needs of computer scientists, while
USENET was created by computer science graduate students to help exchange information via
newsgroups.
Bitnet, on the other hand, was a network created to promote the tools of computer networking for
all scholars, not just select ARPA-funded researchers or computer scientists. Ira Fuchs, the Vice
Chancellor of University Systems at the City University of New York (CUNY), conceptualized Bitnet
(Because It's There NETwork) to take advantage of the existing supply of IBM mainframes on many
university campuses. It was intended as a low-cost and easy-to-setup computer network for
1 http://computing.dcu.ie/~humphrys/net.80s.html (Mark Humphrys, The Internet in the 1980s) 2 Copies of archived emails are available from the authors. 3 The invention of the Web is commonly credited to Tim Berners-Lee at CERN in Switzerland. The browser was developed at the University of Illinois National Center for Supercomputing Applications.
5
communication between university scholars. The Bitnet executive committee organized the effort and
established regulations and bylaws. The network was restricted to academia, and no commercial
communication was allowed.
In order to connect to Bitnet, each institution required a 9600 baud leased line between its
computer facility and another institution that was linked with the network as well as IBM networking
software.4 Bitnet setup costs were reasonably low because mainframes already existed on university
campuses, and communication relied on dial-up technology. Administratively, however, creating an
initial connection to the network seems to have been a reasonably involved process.5 The decision to
adopt was generally made by a university’s computing center managers and approved by higher-level
administrators.
The first Bitnet adopters were CUNY and Yale University in May 1981. Over the next three
years, the network expanded slowly to about 157 nodes and approximately 3000 users (Grier and
Campbell, 2000). It also established an executive committee to expand the network and oversee its
operations. In 1984, Bitnet received significant support through a monetary grant provided by IBM to
establish an operations and information center.
In 1986, NSFNET was founded for connecting supercomputer centers across North America. It
created the backbone of what would become the Internet, leading to the rapid growth of Bitnet over the
next few years. In 1987, the Bitnet Information Center introduced Listserv, an automatic mailing list
server for Bitnet. Listserv remains Bitnet’s lasting legacy and is still employed by many universities
around the world. By the end of the 1980s, Bitnet had become the largest academic network in the world
for computer-based communications; it was gradually replaced by the World Wide Web in the mid 1990s.
4 Bitnet was a "store-and-forward" network. Information originating at a given Bitnet-connected computer (node) was received by intermediate nodes and forwarded to its destination. 5 In a rich description of the National University of Singapore’s decision to connect, the Dean of Science at that institution describes the bureaucratic steps that he had to take, including obtaining the assistance and approval of a number of senior university administrators (who, in this case, were very supportive). (http://www.physics.nus.edu.sg/~phytanb/bitnet4.htm).
6
3. Framework
In this section, we develop a framework to explore the implications of Bitnet adoption on research output
and collaboration. First, our framework assumes that communications technology will increase total
research output and multi-institution collaborative output. Second, the framework allows better
communications technology to increase collaboration both among institutions that are close by and those
that are far away. The framework leaves ambiguous whether communications technology makes distance
relatively more or less important; we will address this open question empirically. Third, the framework
also leaves ambiguous which types of institutions benefit from improved communications technology.
High-quality institutions may collaborate more with each other but not with lower-quality institutions.
Alternatively, lower-quality institutions may benefit from increased access to researchers and equipment
in high-quality institutions. We estimate these relationships in Section 6.
We model the decision to collaborate in two stages. In Stage 1, a researcher receives an idea and a
potential collaboration partner. In Stage 2, the researcher decides whether to collaborate on the idea or to
pursue her own different research agenda. We first will describe Stage 2 and then detail Stage 1.
Stage 2
Researchers aim to maximize the probability of getting published in a top journal. Suppose researcher i
has an idea with potential collaborator j. Further suppose the researcher knows that the probability that
this collaborative effort will lead to a publication is:
(1) PrC=g(qi,qj)-C(D,θ)
We model g() as the benefit to collaboration. It is a function of the quality of institution i, qi and the
quality of institution j, qj. C() is the cost of collaboration. It is a function of the distance between
institutions i and j, D, and the current communications technology, θ.
Suppose the probability of publishing if researcher i does not collaborate on the project and
instead pursues her own agenda is:
7
(2) PrS=f(qi)
We model f() as the net benefit to not collaborating. It is a function of the researcher i's quality, qi. The
researcher will choose the collaborative project if PrC>PrS. Otherwise, the researcher will pursue her own
agenda. We impose the following assumptions:
A1) Costs increase with distance but decrease with communications technology:
0>∂∂DC
and 0<∂∂
θC
.
A2) The rate at which costs increase with distance is lower when the technology is
better: 02
<∂∂
∂θD
C.
A3) The probability of collaborative publication increases with the quality of both
institutions: 0>∂∂
iqg
and 0>∂∂
jqg
.
Assumption A1 implies that improved technology will lead to more collaboration. Assumptions
A1 and A2 combined imply that, in this stage, improved technology will lead to more long-distance
collaboration relative to short-distance collaboration.6 They do not, however, make firm predictions about
the impact of relative quality on collaboration probability. We estimate the relationship between
collaboration and distance in Section 6.3.
This model leads to ambiguous predictions about the impact of an improvement in
communications technology on collaboration across and within quality levels. It is possible that improved
technology will lead to more collaboration across quality levels but not within quality levels. In particular,
suppose there are two types of institutions: high-quality H and low-quality L. Furthermore, suppose a
researcher at a high-quality institution is deciding whether to collaborate on a given project with another
researcher.7 If the other researcher is also at a high-quality institution, then the first researcher will choose
to collaborate if
6 Gaspar and Glaesar (1998) suggest that the opposite effect is possible if communications technology is a complement to face-to-face communication. In this case, C12()>0. This idea is modeled in Stage 1. Overall, the full two-stage model suggests that the relationship between collaboration, technology, and distance is ambiguous, both with and without A2. 7 For ease of exposition, we focus on the decisions of researchers at high-quality institutions. The general implications on the effect of communication costs hold if we focus on only researchers from low-quality institutions.
8
(3) g(H,H)-C(D,θ)>f(H)
Similarly, if the other researcher is at a low-quality institution, then the first researcher will choose to
collaborate if
(4) g(H,L)-C(D,θ)>f(H)
By assumption A3, g(H,H)-C(D,θ)>g(H,L)-C(D,θ). Therefore, collaboration is more likely between the
high-quality institutions. However, the effect of an improvement in communications technology from θ to
θ’ will be ambiguous. If we assume that high-quality institutions almost always collaborate with other
high-quality institutions (i.e., g(H,H) is sufficiently large), then an improvement in technology will not
affect collaboration at the margin in equation (3). Alternatively, if we assume that the benefit to a high-
quality institution of collaborating with a low quality institution is sufficiently low (i.e., g(H,L) is
sufficiently large), then an improvement in communications technology will not affect collaboration at the
margin in equation (4). The effect of Bitnet adoption on collaboration depends on which potential
collaborations are on the margin and is therefore an empirical question that we examine in section 6.4.
Stage 1
Stage 1 models the arrival of ideas with potential collaborators. A given researcher at a high-quality
institution, i, receives a collaborative idea from a researcher at another high-quality institution with
probability π and an idea from a researcher at a low-quality institution with probability (1-π). Now
suppose that the collaborative offers arrive at a rate that depends on distance A(D).
A4) Offers arrive more frequently from nearby institutions than from faraway
institutions: 0<∂∂DA
.
Assumption A4 implies that people are more likely to collaborate with others who are nearby, perhaps
because they are more likely to know each other or perhaps because collaboration is much more effective
If f() is increasing in quality, then all collaborations accepted by high-quality institutions will be accepted by low-quality institutions.
9
if people can meet face-to-face. Therefore, the total collaboration probability for institution i at a given
point in time is:
(5) V=Α(D){π·1[g(H,H)-C(D,θ)>f(H)]+(1-π)·1[g(H,L)-C(D,θ)>f(H)]}
Where 1[] is the identity function. Improved communications technology has an ambiguous effect on
nearby collaboration relative to distant collaboration. The results from Stage 2 (resulting from A1 and A2)
still hold: Improved technology reduces the costs of distant collaboration more than nearby collaboration.
An opposing effect is now also relevant: The likelihood of collaborating at all decreases with distance.
This can be seen most cleanly by taking the derivative of equation (5) with respect to D and θ: Let
u(D,θ)={π·1[g(H,H)-C(D,θ)>f(H)]+(1-π)·1[g(H,L)-C(D,θ)>f(H)]}. Then
(6) θ
θθ
θθθθ ∂
∂=
∂∂
+∂
∂=
∂∂ ),()(),()(),()( DuDADuDADuDAV
(7) D
DuDADuDDA
DV
∂∂∂
+∂
∂∂
∂=
∂∂∂
θθ
θθ
θ),()(),()( 22
This term is ambiguous since 0)(<
∂∂
DDA
, 0),(>
∂∂
θθDu
, 0)( >DA , and 0),(2
>∂∂
∂D
Duθ
θ. Therefore,
equation (6) shows that improved communications technology (i.e., Bitnet) will increase the number of
collaborations. Equation (7) shows that whether distant or nearby potential collaborators are more
affected is an empirical question. Distant institutions may be more affected because their costs to
communicate fall more. Alternatively, nearby institutions may be more affected because they are more
likely to know each other and meet face-to-face. They thus may benefit more if connecting to Bitnet is
highly complementary to these knowledge flow mechanisms. Section 6.5 addresses this question.
4. Data
10
We use two units of analysis in our empirical work. First, we examine the effect of connecting to Bitnet
on individual universities and thus use the university as our unit of analysis.8 Then, we examine the Bitnet
effect on potentially collaborating pairs of universities and therefore use university-pairs as our unit of
analysis. We employ four types of data in our empirical work: 1) publication data, 2) Bitnet connection
data, 3) institution quality data, and 4) distance data. We describe these below.
4.1 Publication Data
Since we are interested in identifying the Bitnet effect on collaboration, we use publication data from
researchers in technical areas that are likely to be early adopters of this communication technology. We
do this because we exploit the variation in connection years for identification, and the publishing behavior
of these researchers is most likely to reflect these time differences. Therefore, we select a variety of
electrical engineering research topics that appeared in journals published by the Institute of Electrical &
Electronic Engineers (IEEE).
Specifically, we collect publication data from seven journals over the 15-year period 1977-1991:
1) IEEE Transactions on Aerospace and Electronic Systems, 2) IEEE Transactions on Nuclear Science, 3)
IEEE Transactions on Biomedical Engineering, 4) IEEE Journal of Quantum Electronics, 5) IEEE
Transactions on Electron Devices, 6) IEEE Transactions on Communications, and 7) IEEE Transactions
on Education. Each of these journals is considered among the top outlets for research in the specified
field. Since we focus only on these seven top journals, the total number of publications in our analysis
does not rise over time.
There are 28,312 papers published in these seven journals during the time period under
investigation.9 We parse out all unique author-affiliated institutions from each paper. An example of the
data structure is provided in Table 1. Papers are categorized as either single-institution or multi-institution
8 We only include US universities that have published at least one paper in our set of seven electrical engineering journals during the period 1977-1991. 9 The distribution of publications across journals is not uniform. In the order listed above, the number of publications per journal is 6174, 8505, 3418, 4976, 827, 3585, and 827.
11
(i.e., collaborative). Papers are classified as single-institution if all authors are from a single university.
So, in the example provided in Table 1, the left-hand-side paper is classified as single-authored since all
three authors are from Duke University. However, the right-hand-side paper is classified as multi-
institution since the authors have collaborated across institutional boundaries (Yale and Villanova).
After extracting the institutional information from the set of publications, we identify 739 unique
institutions, of which 289 are US universities - our institution-type of interest. These form the basis of our
unit of analysis. We focus on US universities because many of the international institutions and US non-
university research labs used other networks besides Bitnet. US universities are particularly likely to have
Bitnet as the first data communication network they adopted.10
These data allow us to construct two datasets, each of which is focused on the measurement of the
number of collaborative (multi-institution) papers. First, we construct a single institution dataset that
includes 15 years of publishing data (1977-1991) from the specified journals by the 289 institutions of
interest. This is therefore a balanced panel dataset that consists of 4335 observations. Then, we construct
an institution-pair dataset that includes the same 15 years of publishing data from the specified journals
by the 41,616 institution-pairs resulting in 624,240 observations.
4.2 Bitnet Connection Data
We use an online reference, Cyber Geography Research, for a record of Bitnet connections.11 This archive
lists the 1,054 institutions worldwide that connected to Bitnet by the end of 1990 and their connection
date.12 Of the 289 US universities that published at least once in the seven IEEE journals we examine, 225
connected to Bitnet during by this time. In other words, 64 US universities in our publishing sample had
not connected to Bitnet by the end of 1990. Figure 1 illustrates the connection rate. The variation in
10 http://computing.dcu.ie/~humphrys/net.80s.html (Mark Humphrys, The Internet in the 1980s) 11 http://www.cybergeography.org/atlas/bitnet_topology.txt 12 In our analysis, we use the year following adoption as the first year Bitnet is available at the university. Therefore, the effect of adoption is first measured in 1984 for a school that adopts in 1983. This is to account for the (relatively short) publication lag in engineering and any lag in intra-university adoption. Results are robust to using the year of adoption.
12
connection years is important for our econometric analysis since we exploit this to identify the effect of
Bitnet.
In addition, although the first three institutions connected were all on the east coast, by the second
year some universities further inland were connected, such as Ohio State, and two, UC Berkeley and UC
San Francisco, were on the west coast. By the third year, universities up and down the east coast were
connected, as well as institutions as far inland as the University of Missouri-Columbia (MO), in addition
to a third institution on the west coast (Stanford). Since we examine the Bitnet effect with respect to the
distance between collaborating institutions, a sense of the geographic distribution is also useful. As such,
the geographic distributions of connected universities over the first three years are illustrated in Figure 2.
4.3 Quality Data
We use the 1987 Carnegie Foundation classification system to classify the research “quality” of each
university in our dataset.13 We classify universities as Carnegie Type 1, 2, or 3 (CT1, CT2, CT3).
Carnegie Type 1 (CT1) is an aggregate of the Carnegie Foundation’s categories “Research University 1
and 2.” Thus, institutions with our CT1 classification offer a full range of baccalaureate programs, are
committed to graduate education through the doctorate degree, and give high priority to research. They
receive annually at least $12.5 million in federal support and award at least 50 Ph.D. degrees each year.14
Institutions with a CT2 classification are an aggregate of the Carnegie Foundation’s categories
“Doctorate-Granting Universities 1 and 2.” These institutions offer a full range of baccalaureate
programs, and their mission includes at least some commitment to graduate education through the
doctorate degree, such that they award annually 20 or more Ph.D. degrees in at least one discipline or 10
or more Ph.D. degrees in three or more disciplines, but do not meet the requirements for Carnegie Type 1.
All other universities are classified as CT3.
13 A Classification of Institutions of Higher Education (1987 Edition), A Carnegie Foundation Technical Report, Princeton University Press, Lawrenceville, NJ. 14 The years used in calculating average federal support were 1983, 1984, and 1985.
13
4.4 Distance Data
We generate distance data as follows. First, we search the Internet for the official website of each
university in our dataset to establish the primary location (city, state) of its research campus. Then, we
obtain latitude and longitude measures from the US Geological Survey based on the city-state data.15
Finally, we determine the distance between each university pair by employing the great circle method to
calculate the distance in kilometers between the two sets of geographic coordinates.16
5. Empirical Strategy
We present two types of results: institution-level and institution-pair-level. Our estimation strategy is
based on “difference-in-differences” identification. For the single institution data, we examine how total
publications and how multi-institution publications change after an institution adopts Bitnet relative to
other institutions. For the paired institution data, we examine how collaboration between institution-pairs
that both adopt Bitnet changes relative to other institution-pairs in which one or both have not adopted.
We observe the number of collaborative multi-institution publications at the institution and the institution-
pair levels.
5.1 Single Institution Estimation
We use the single institution data for two purposes. First, we explore whether Bitnet adoption is related to
an increase in total publications, and, second, we explore whether Bitnet adoption is related to an increase
in multi-institution publications specifically.
To estimate the relationship between Bitnet adoption and total publications, we estimate the
following fixed effects regression for institution i in year t:
(8) Total # publicationsit=βHas Bitnetit+µt+φi+εit
15 The US Geological Survey can be accessed at http://geonames.usgs.gov/ and a web query application exists at http://geonames.usgs.gov/pls/gnis/web_query.gnis_web_query_form. 16 The great circle formula used is: acos(cos(lat1)*cos(long1)*cos(lat2)*cos(long2) +cos(lat1)*sin(long1)*cos(lat2)*sin(long2)+sin(lat1)*sin(lat2)) * earthRadius
14
Where β measures the effect of Bitnet adoption, µt is a year fixed effect, φi is an institution fixed effect,
and εit is an idiosyncratic error term.
When estimating the effect of Bitnet adoption on multi-institution publications, we estimate a
similar equation:
(9) #Multi-instit. publicationsit=αXit+βHas Bitnetit+µt+φi+εit
Where Xit contains observed institution-year characteristics. In particular, Xit is a count of single-
institution publications, an observable measure of how institution quality may vary over time.
We use equations (8) and (9) to explore whether Bitnet adoption is related to an increase in both
total publications and in academic collaboration. Since the dependent variable is a count variable, our
main models estimate these using fixed-effects negative binomial regressions. We also show results using
OLS regressions.
For equations (8) and (9) to estimate the Bitnet effect on total publications and multi-institution
publications respectively, we assume that unobserved institution quality can be decomposed into an
additively separable fixed component and a time varying component. The time component is constant
across institutions (Athey and Stern, 2002). This assumption is slightly weaker in equation (9) than in
equation (8) because we have an observable measure of institution quality.
More generally, this assumption is questionable if Bitnet adoption is a signal of an unobserved
quality improvement. While the decisions of the administrators on adoption are likely removed from
changes in engineering faculties’ propensity to collaborate, this alternative hypothesis cannot be rejected
in the single-institution data. Therefore, the results based on equations (8) and (9) are merely suggestive.
They show that Bitnet may have increased productivity and collaboration.
The institution-pair results on Bitnet and collaboration are much less susceptible to criticisms of
this assumption. In the next section, we present an estimation strategy based on institution-pairs that is
less susceptible to criticisms of spurious correlation between adoption choice and collaboration.
15
Furthermore, it allows us to better understand the reasons Bitnet adoption may be correlated with
increased collaboration.
5.2 Institution-Pair Estimation
We are interested in Bitnet as the communications technology. Since Bitnet is only effective if both
institutions have the technology, we focus on the effect of both institutions adopting Bitnet on
collaboration. We interpret evidence that just one institution adopting Bitnet does not lead to significantly
more collaboration as support for our main findings. We label the first institution in the pair i, the second
j, and the year t.
We run linear regressions on the data to get the following equation:
(10) Collaborationijt=αXijt+βBoth Have Bitnetijt+µt+φij+εijt
Where φij now measures institution-pair fixed effects and Xijt is a vector of observable institution-pair-year
characteristics. This vector contains our proxy for observed pair quality in year t: the total number of
single-institution papers published by both universities in that year.17 In some specifications, it also
contains the distance between the institutions and whether any one of the pair has adopted Bitnet.
In our main models, we treat Collaborationijt as a dummy variable for whether institution i and j
have any collaborations in year t. We estimate equation (10) using a fixed effects linear probability (OLS)
regression. We treat collaboration as a dummy variable because 85% of all institution-pair-years with at
least one collaboration have only one collaboration. We also show results for a fixed effects OLS
regression on total number of collaborations, a fixed effects probit regression on only those pairs with at
least one publication, and a fixed effects negative binomial regression on total number of publications on
only those pairs with at least one publication.18
17 We also show that results do not change if the product of the single-institution publications at the two universities is used instead of the sum. 18 We limit the fixed effects probit and negative binomial regressions to only those pairs with at least one publication in order to overcome computational difficulties in estimating thousands of dummy variables.
16
For this linear equation to identify the average effect of Bitnet adoption on collaboration between
two given institutions, we again implicitly assume that unobserved institution-pair quality can be
decomposed into an additively separable fixed component and a time varying component. The time
component is constant across institution-pairs (Athey and Stern 2002). This is a weaker assumption than
that used for the single-institution data. In this case, unobserved quality is at the institution-pair rather
than the institution level. Furthermore, we present evidence that both institutions need to adopt for
adoption to be correlated with increased collaboration. If only one institution adopts, they are no more
likely to collaborate. The next section presents the estimates of these models.
6. Results
In this section, we examine whether Bitnet adoption influences research collaboration among university
scientists. In section 6.1, we begin with descriptive statistics that suggest that schools that adopt Bitnet do
publish more and collaborate more. Using a difference-in-difference type identification, Section 6.2
shows that after Bitnet adoption universities produce more papers in the top journals than they do before
Bitnet adoption relative to universities that do not adopt. Section 6.3 shows that inter-institution
collaboration in particular increased considerably. In Sections 6.4 and 6.5, we show that medium-ranked
universities seem to benefit most and that Bitnet especially facilitates collaboration between nearby
institutions.
6.1 Descriptive Statistics
We begin our analysis by examining the basic distributional properties of our key measures: 1) research
publications, 2) Bitnet adoption, 3) institution quality, and 4) distance between potential collaborators.
Table 2a presents descriptive statistics with institution-years as the unit of analysis, and Table 2b presents
descriptive statistics with institution-pair-years as the unit of analysis.19
19 As discussed in Sections 4 and 5, our chosen empirical methodology requires us to examine our data from two perspectives using two distinct units of analysis. First, we examine the effect of Bitnet adoption on the research
17
The average university in our dataset publishes 2.51 papers per year in our specified set of
journals over the particular time period under investigation. Of these, 1.36 are multi-institutional and 1.15
are single-institutional. In other words, on average these institutions publish approximately 18% more
multi-institutional papers per year.20 Overall, 37% of the institution-year observations have at least one
multi-institutional paper.
With respect to Bitnet adoption, while the first three institutions were connected to Bitnet in
1981, the average university in the dataset, conditional on being connected before or during 1990, is not
connected until halfway through 1985.21 Figure 1 illustrates the significant variation in adoption years
across universities, which is central to our identification strategy.
The first hint of a Bitnet effect is reflected in the comparison of multi-institutional papers per year
by institutions with and without Bitnet: 2.47 and 0.92 respectively. However, this difference could be
caused by other factors such as time (multi-institution papers are more likely in later years, perhaps for
reasons that are not related to Bitnet adoption, and universities are more likely to have adopted Bitnet in
later years) and quality (higher-quality institutions may be more likely to publish a greater number of
multi-institutional papers and may also be more likely to adopt Bitnet earlier). Also, it is interesting to
note that, at least at first glance, Bitnet adoption is correlated with overall output, including single-
institution paper production. As Table 2a illustrates, institution-years with Bitnet average 2.30 more total
papers per year, of which only 1.55 are multi-institutional. Thus, institution-years with Bitnet have 0.75
more single-institution papers.
Next, we compare average paper production by institutions in the three Carnegie research quality
categories. Overall, the research output from these institutions corresponds with what one would expect.
behavior of single institutions. For this we use “institution-year” as our unit of analysis. Then, we examine the effect of Bitnet adoption on the research behavior of pairs of institutions and use “institution-pair-year” as our unit of analysis. 20 It is important to note that since our dataset is constructed from a constant set of top journals in electrical engineering, the total number of publications in our sample does not increase steadily over time. In fact, there are a total of 1,702 papers published in the first year of observation (1977) and 1,646 papers published in the last year (1991). The total number of publications fluctuates from year to year due to the publication of special issues and occasional conference proceedings. 21 Of the 289 universities in our dataset, 64 were not connected by 1990.
18
Type 1 institutions produce more than six times as many multi-institutional papers as Type 2, and Type 2
institutions produce almost 1.5 times as many multi-institutional papers as Type 3. The ratios for single-
institutional papers are similar.
Finally, we turn to Table 2b to examine the basic properties of our institution-pairs. The average
institution-pair is separated by a distance of 1,742 kilometers (1,082 miles) and produces 0.00132 multi-
institution papers per year. Again, we see suggestive evidence of a Bitnet effect since university-pairs
generate, on average, more than four times as many multi-institution papers if both are connected to
Bitnet. However, for the same reasons as described above, other factors such as time and quality could be
confounding this relationship.
We now move on to a more rigorous examination of the effect of Bitnet. That is, following up on
the suggestive evidence presented in the descriptive statistics, we seek to estimate the extent to which
Bitnet adoption increases the propensity of universities to publish and to collaborate, after controlling for
likely confounding effects.
6.2 The Effect of Bitnet on Total Publications
We begin our examination of the Bitnet effect by examining whether Bitnet adoption is associated with an
increase in total publications. We simply regress our count of total papers per institution-year on whether
an institution has adopted Bitnet (Has Bitnet), including year and institution fixed effects. Column (1) of
Table 3 shows our main negative binomial specification with year and institution fixed effects. This
specification estimates a positive and significant coefficient on Has Bitnet, consistent with the descriptive
data presented above.22 Columns (3), (4), and (5) suggest that middle-level institutions especially benefit
from Bitnet adoption. We explore this idea further in Section 5.4.
Since total publications rise, this suggests that Bitnet adoption may increase the productivity of
the relevant university research group. While this result may be a function of spurious correlation between
22 The OLS specification, which is highly inefficient here, also estimates a positive coefficient as expected, but it is not statistically significant.
19
adoption and research productivity, it is at least suggestive of a correlation between adoption and research
output. The rest of the paper explores how Bitnet adoption might have increased productivity by focusing
on its role in increasing collaboration across institutions.
6.3 The Effect of Bitnet on Collaboration across Institutions
We conduct our estimation of the Bitnet effect on collaboration in two parts. First, we provide suggestive
evidence that Bitnet adoption increases collaboration using single-institution data. Then we use
institution-pair data to overcome the alternative possibility of spurious correlation between Bitnet
adoption and collaboration. We present each of these analyses in turn.
6.3.1 Single Institution Analysis
For this analysis, our dependent variable is a count of multi-institutional papers (Table 4 columns (1) and
(2)). We regress this dependent variable on Has Bitnet, again including year and institution fixed effects,
and now also include a count of single-institution papers to control for within-institution quality changes
over time. Both negative binomial and OLS specifications generate positive and significant coefficients
on Has Bitnet. The magnitude of this effect is also economically significant. Our results suggest that even
after controlling for year, institutional characteristics, and within-institution quality changes over time,
connecting to Bitnet results in almost a 30% increase in multi-institutional papers per year.23 This
provides preliminary evidence that Bitnet adoption increased academic collaboration.
6.3.2 Institution-Pairs Analysis
We now turn our attention to our second unit of analysis, the institution-pair. With the single institution
analysis above, even though we have controlled for year and institution effects as well as within-
university quality changes as measured by single-institution paper counts, one might worry about an
23 For robustness, we also have run Poisson, zero-inflated Poisson, and negative binomial regressions and find similar results.
20
unobserved variable bias. For example, perhaps young scholars have a taste for collaboration and also for
Bitnet (and are able to influence their universities to connect earlier than they otherwise would). In such a
case, we would observe universities that connect earlier also collaborating more, even though Bitnet
connection is not facilitating collaboration. Ideally, we would have an instrument that is correlated with
Bitnet adoption but not with publications. In the absence of such an instrument, we rely on evidence from
institution-pairs. It is much less likely that researchers in our sample who, at a stretch, might be able to
influence their universities to connect, also are able to influence other university administrations to do so.
As reported in Table 5 column (1), we find a significant effect on collaborative publications from
both universities in the pair being connected to Bitnet (Both Have Bitnet). In fact, the likelihood of a
collaboration between institutions increases by approximately 85% if both institutions have connected,
controlling for year and institution-pair effects as well as for within-pair quality changes over time
(measured by single-institution publications).
We also observe that this effect remains virtually unchanged when we control for whether only
one of the members of the pair has connected (Table 5 column (2)). We find this a particularly interesting
result. There is no increase in the likelihood of collaboration when only one joins the network (the
coefficient on One or more has adopted Bitnet is not significant), but there is a significant effect when
both connect. We interpret this result as suggesting that it is both universities in the pair being connected
that drives the increase in collaboration. This provides strong evidence that it is the connection between
universities that leads to increased collaboration. It does not appear to be a result of spurious correlation
between adoption and collaboration.
We conduct a variety of robustness checks on our institution-pair results. First, we employ a
count measure rather than a dummy for our dependent variable and generate results that suggest that both
institutions having Bitnet results in a 75% increase in the number of collaborations (Table 5 column (3)).
Next, we run probit and negative binomial specifications and find similar results. Finally, we drop the
fixed effects and conduct OLS with clustered standard errors by institution-pairs in column (6), include a
21
variation in the quality control measure in column (7), and omit the control measure in column (8). Still,
the main result persists.24
Now that we have established the general effect of Bitnet, we turn our attention to how it is
mediated by institution quality and distance. Are the benefits from Bitnet adoption spread uniformly
across all adopters, or do they vary with quality and distance?
6.4 Does the Bitnet Effect Vary with Institution Quality?
In Section 6.2, we show that medium-ranked schools seem to benefit most from Bitnet in terms of
increased collaboration. In particular, when we split our sample into the aggregated Carnegie research
strength categories, the coefficient on Has Bitnet is only statistically significant for universities in the
middle category. This result persists when we examine the relationship between multi-institutional papers
and Bitnet adoption in Table 4, although the coefficient is positive for all three categories in both tables.
We examine the relationship between university research quality and the Bitnet effect more
closely in Table 6. Here, using our institution-pair data, we divide our sample into six groups, reflecting
all possible combinations of quality types. While all coefficients on Both Have Bitnet are positive, except
for the low-low pairs, only the coefficient on high-middle pairs is statistically significant. For this sub-
sample, both universities in the pair being connected increases the likelihood of collaboration by over
150%.
We find these quality-related results striking. It appears that the benefits of Bitnet adoption,
measured by an increase in publications, accrue primarily to medium-ranked schools (Tables 3 and 4).
Furthermore, it seems as though these medium-ranked schools benefit by collaborating with top-ranked
schools (Table 6). Top-ranked schools seem neither helped nor harmed by Bitnet adoption.
We have two possible explanations for this result. First, it may be that top-ranked schools
collaborate with other top-ranked schools anyway, such that these collaborations are not on the margin.
24 In Table 5 column (7), we employ the product of the count of single-institution papers from each university in the pair rather than the sum as our control for within-pair quality changes over time.
22
However, collaboration between top-ranked schools and medium-ranked schools may be marginal such
that a decrease in the costs of collaboration leads to significantly more collaboration. This could be
professors at top schools collaborating with former students, which was not previously worthwhile, or a
number of other similar possibilities.
Second, top-ranked schools may have equipment that medium-ranked schools do not. When top-
ranked schools collaborate with each other, their communication needs are simple. Each researcher can
work on their own equipment, talk on the phone, and overnight drafts when necessary. Medium-ranked
schools, however, may not have the specialized equipment necessary to run certain experiments, even if
they have the idea. Therefore, there is a considerable amount of data sent back and forth in this type of
collaboration. Under this scenario, Bitnet adoption is more valuable in top-middle collaborations that
require “thick” data communications as compared to top-top collaborations that may require only “thin”
communications.
6.5 Does the Bitnet Effect Vary with Distance?
Here we examine whether the Bitnet effect varies with distance. As discussed above, if Bitnet adoption is
a substitute for face-to-face interaction, we would expect the benefits of adoption to be greatest for
universities that are furthest apart. On the other hand, if Bitnet adoption is a complement to face-to-face
interaction, we would expect the benefits of adoption to be greatest for universities that are co-located.
To address this issue, we employ a spline regression, grouping together universities that are: 1)
within 100km, 2) between 100 and 1000km apart, 3) between 1000 and 3000km apart, and 4) further than
3000km apart. Our results, which again include year and institution-pair fixed effects as well as a control
for within-pair productivity changes over time, suggest that Bitnet adoption is likely a complement for
face-to-face interactions. Bitnet adoption has the greatest effect on university-pairs that are co-located
(within 100km). We also find that collaboration increases regardless of distance, suggesting that Bitnet
does reduce the cost of collaboration even for distant institutions. Overall, the Bitnet effect is more than
twice as large for co-located universities as it is for those that are in the next category (100-1000km
23
apart). For robustness, we report various groupings using different distance ranges, but the main result
persists: Co-located universities benefit most from Bitnet adoption.
Finally, we examine the interaction effect of quality and distance on the Bitnet effect (Table 8).
Overall, our prior results persist: The greatest effect on multi-institutional paper production occurs for
medium-ranked universities that collaborate with co-located top-ranked universities. Indeed, medium-
ranked universities also increase their collaboration with non-co-located top-ranked universities, but the
effect of Bitnet is almost 10 times greater for those that are co-located. When the data are split this way,
we see that low-ranked universities also benefit from the Bitnet effect through collaboration with co-
located top-ranked institutions. Interestingly, pairs of top-ranked, co-located universities seem to reduce
their level of collaboration upon adopting Bitnet. We speculate that this may be due to substitution for
collaboration with medium-ranked schools but leave further investigation of this phenomenon for future
research.
7. Conclusions
Does connecting to Bitnet increase research productivity? If so, how? Are benefits evenly distributed
across all adopters? We offer evidence that connecting to Bitnet is related to an increase in overall
research productivity, particularly for medium-ranked schools. Moreover, our findings suggest that this
increase is mainly related to an increase in collaboration by medium-ranked universities with co-located
top-ranked schools.
Bitnet adoption increases productivity by lowering the cost of communication and thereby
increasing collaboration. Indeed, we find that both institutions being connected to Bitnet increases their
likelihood of collaboration by approximately 85%, after controlling for year and institution-pair effects as
well as for within-pair quality changes over time. As discussed in the introduction, collaboration allows
the sharing of knowledge as well as specialized equipment and other resources that may lead to a more
efficiently functioning market for ideas. Bitnet adoption may lead to collaboration simply by reducing the
costs of exchanging ideas. Alternatively, Bitnet adoption may make it easier for a researcher in one
24
location with an idea to collaborate with a researcher in another location with specialized equipment for
testing the idea.
Universities that are co-located seem to experience the greatest boost in collaboration as a result
of Bitnet adoption. We interpret this finding as indicating that low-cost electronic communication, while
certainly a substitute for face-to-face interactions under certain conditions, is also a complement.
Researchers communicate with people they know and are more likely to know those who work nearby.
The policy implications of this finding are clear. For communication infrastructure to increase
collaboration, it must connect people who want to communicate; presumably these are more likely to be
people who already know each other and have an existing social relationship.
Medium-ranked universities seem to benefit most. One might expect top-ranked universities to
benefit most from Bitnet adoption. On average, these institutions publish over six times as often as their
medium-ranked counterparts. However, for all permutations of the data, including total publications and
multi-institutional publications as well as for single institutions and institution-pairs, the statistically and
economically significant coefficient on Has Bitnet is primarily associated with medium-ranked
universities. Why?
The results presented in Table 6 offer some insight. They suggest that the main source of
increased collaboration is between top- and medium-ranked universities. We interpret this result as
indicating that researchers at medium-ranked universities offer something valuable to their colleagues at
top-ranked schools and that Bitnet adoption facilitates this exchange. Perhaps they have good ideas that
need testing on specialized equipment only available at top-ranked schools, or perhaps they are willing to
work harder as collaborators. In either case, from the perspective of the top-ranked institution
considering collaboration with either another top-ranked institution or with a middle-ranked institution,
the marginal benefit of Bitnet adoption may be greater for the latter.
At the same time, there is surprisingly little increase in collaboration within same quality groups;
there is no significant increase in collaboration amongst the prolific set of top-ranked institutions. We
speculate that this is because top-ranked universities have access to their own specialized equipment, and
25
therefore the data-sharing capability of Bitnet is not as important. Of course, that doesn’t mean these
universities don’t collaborate. They do. However, connecting to Bitnet does not result in an increased
propensity to collaborate. Low-ranked institutions do not benefit significantly from being connected to
top- and medium-ranked universities. Even though they have to be publishing institutions to be included
in the dataset, the research efforts of these universities may be too limited to attract collaborative partners.
Overall, these results suggest that welfare returns on public investments in scientific
communications infrastructure are highly sensitive to the demand, or latent demand, to communicate.
Bitnet seems to have achieved its main purpose of facilitating academic collaboration. In doing so,
middle-ranked universities are the main beneficiaries. Bitnet facilitates access to resources at co-located
top-ranked universities and consequently increases their productivity.
26
References Agrawal, Ajay, Iain Cockburn, and John McHale (2003) “Gone But Not Forgotten: Labor Flows,
Knowledge Spillovers, and Enduring Social Capital,” National Bureau of Economic Research Working Paper #9950.
Audrestch, David B., and Maryann P. Feldman (1996) “R&D Flows and the Geography of Innovation and
Production,” American Economic Review Vol. 86(3), 630-640. Barnett, Andy H., Richard W.Ault, and David L.Kaserman (1988) “The Rising Incidence of Co-
authorship in Economics: Further Evidence,” Review of Economics and Statistics 70(3), 539-543. Beaver, D. and R. Rosen (1978) “Studies in Scientific Collaboration. Part 1. The Professional Origins of
Scientific Co-authorship,” Scientometrics, 1, 65-84. Beaver, D. and R. Rosen (1979) “Studies in Scientific Collaboration. Part 2. Scientific Co-authorship,
Research Productivity and Visibility in the French Scientific Elite,” Scientometrics, 1, 133-49. Cairncross, F. (1997) The Death of Distance: How the Communications Revolution Will Change Our
Lives, Harvard Business School Press, Cambridge, MA. Cockburn, I., and R. Henderson (1998) "Absorptive Capacity, Coauthoring Behavior, and the
Organization of Research in Drug Discovery," Journal of Industrial Economics, Vol. XLVI, No. 2, pp. 157-182.
Crane, D. (1969) “Social Structure in a Group of Scientists: A Test of the ‘Invisible College’
Hypothesis,” American Sociological Review, XXXIV, 335-352. Gaspar, J. and E. Glaeser (1998) “Information Technology and the Future of Cities,” Journal of Urban
Economics, 48(1), 136-156. Grier, D.A., and M. Campbell (2000). “A Social History of Bitnet and Listserv, 1985-1991,” IEEE Annals
of the History of Computing, April-June, 42-41. Gurbaxani, V., 1990, Diffusion in Computing Networks: The Case of BITNET, Communications of the
ACM 33, 65-75. Hamermesh, D.S., and S.M. Oster (1998) "Tools or Toys? The Impact of High Technology on Scholarly
Productivity," National Bureau of Economic Research Working Paper #6761. Jaffe, A.B., M. Trajtenberg, and R. Henderson (1993) “Geographic Localization of Knowledge Flows as
Evidenced by Patent Citations,” Quarterly Journal of Economics 108, 577-598. Mairesse J. and L. Turner (2005) “Measurement and Explanation of the Intensity of Co-publication in
Scientific Research: An Analysis at the Laboratory Level,” in New Frontiers in the Economics of Innovation and New Technology: Essays in Honor of Paul David, eds. C. Antonelli, D. Foray, B. Hall, and E. Steinmueller, Edward Elgar Publishing, forthcoming.
Olson, G.M., and J.S. Olson (2003) “Mitigating the Effects of Distance on Collaborative Intellectual
Work,” Economics of Innovation and New Technology, Vol. 12(1), pp. 27-42.
27
Romer, P. M. (1986) "Increasing Returns and Long-Run Growth," Journal of Political Economy, Vol. 94, pp. 1001-37.
Romer, P. M. (1990) "Endogenous Technological Change," Journal of Political Economy, Vol. 98,
supplement to No. 5, pp. 71-102. Singh, J. (2005) “Collaborative Networks as Determinants of Knowledge Diffusion Patterns,”
Management Science, 51(5): 756-770. Smith, M.A. (1999) “Invisible Crowds in Cyberspace: Mapping the Social Structure of the Usenet” in
Communities in Cyberspace: Perspectives on New Forms of Social Organization, Routledge Press, London, UK.
Thompson, P. and M. Fox-Kean (2005) “Patent Citations and the Geography of Knowledge Spillovers: A
Reassessment”, American Economic Review, 95(1): 450-460. Van Alstyne, M. and E. Brynjolfsson (2005) “Global Village or Cyber-Balkans? Modeling and
Measuring the Integration of Electronic Communities,” Management Science 51, 851-868. Wellman, B. and M. Gulia (1997) “Net Surfers Don’t Ride Alone: Virtual Communities as Communities”
P. Kollock and M. Smith eds. Communities in Cyberspace, University of California Press, Berkeley, CA.
Zucker, L., M. Darby, and M.B. Brewer (1998) “Intellectual Capital and the Birth of U.S. Biotechnology
Enterprises,” American Economic Review 88, 290-306.
28
Table 1 – Example of Publication Data Single-Institution Multi-Institution
Title A VOLTAGE-TRIGGERED SYSTEM FOR ADAPTIVE
SAMPLING IN BODY-SURFACE MAPPING PARAMETRIC MODELING OF SOMATOSENSORY EVOKED-POTENTIALS
Author(s) BLANCHARD SM, BARR RC, SPACH MS JACOBS MH, RAO SS, JOSE GV Source IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING
29 (11): 726-730 NOV 1982 IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING 36 (3): 392-403 MAR 1989
Addresses DUKE UNIV, DEPT BIOMED ENGN, DURHAM, NC 27706 USA<br> DUKE UNIV, DEPT PEDIAT, DURHAM, NC 27706 USA<br> DUKE UNIV, DEPT PHYSIOL, DURHAM, NC 27706 USA
YALE UNIV, DEPT ELECT ENGN, NEW HAVEN, CT 06520 USA<br> VILLANOVA UNIV, DEPT ELECT ENGN, VILLANOVA, PA 19085 USA
29
Table 2a: Descriptive Statistics—Single Institution Data Variable (by year) Mean Standard
deviation Minimum Maximum # of
observations Total papers 2.508 6.439 0 131 4,335 Multi-institution papers 1.360 3.147 0 39 4,335 Single institution papers 1.149 3.739 0 92 4,335 Any Multi-institution papers (dummy) 0.374 0.484 0 1 4,335 Average year online* 1985.573 2.054 1981 1990 225 Has Bitnet 0.282 0.450 0 1 4,335 Total papers if do not have Bitnet 1.862 5.724 0 106 3,114 Total papers if have Bitnet 4.16 7.738 0 131 1,221 Multi-institution papers if do not have Bitnet 0.923 2.521 0 31 3,114 Multi-institution papers if have Bitnet 2.473 4.153 0 39 1,221 CMU Type 1 Multi-institution papers 3.062 4.537 0 39 1,575 Single institution papers 2.696 5.697 0 92 1,575 CMU Type 2 Multi-institution papers 0.494 1.046 0 9 945 Single institution papers 0.368 0.837 0 9 945 CMU Type 3 Multi-institution papers 0.333 1.131 0 17 1,815 Single institution papers 0.212 1.263 0 9 1,815 *Conditional on being online by the end of 1990
30
Table 2b: Descriptive Statistics—Institution-Pair Data Mean Standard
deviation Minimum Maximum # of
observations # collaborative papers between the pair 0.00132 0.0466 0 6 624,240 Dummy if any collaborative papers that year 0.00107 0.0328 0 1 624,240 # collaborative papers if at least one has not adopted Bitnet 0.000852 0.0379 0 6 516,635 # collaborative papers if both have adopted Bitnet 0.00360 0.0756 0 5 107,605 Distance 1,742.732 1,281.485 0 8,293.748 624,240 Sum of # of single institution papers produced by the pair 2.297 5.295 0 122 624,240 Product of # of single institution papers produced by the pair 1.362 15.250 0 3,496 624,240 Dummy if at least one of the pair has adopted Bitnet 0.391 0.488 0 1 624,240 Dummy if both institutions have adopted Bitnet 0.172 0.378 0 1 624,240
31
Table 3: Bitnet Adoption and Total Publications in the Single Institution Data (1) (2) (3) (4) (5) Main Specification:
Negative Binomial Regression
Linear Regression Main Specification on Institutions in
CT1
Main Specification on Institutions in
CT2
Main Specification on Institutions in
CT3 0.196 0.178 0.0248 0.700 0.260 Has Bitnet
(0.0597)** (0.177) (0.0712) (0.193)** (0.166) # of Observations 4,335 4,335 1,575 945 1,815 # of Groups 289 289 105 63 121 Log Likelihood -5,346.82 N/A -3,169.20 -947.11 -1,179.61 Regressions include year and institution fixed effects
32
Table 4: Bitnet Adoption and Multi-Institution Publications in the Single Institution Data (1) (2) (3) (4) (5) Main Specification:
Negative Binomial Regression
Linear Regression
Main Specification on Institutions in
CT1
Main Specification on Institutions in
CT2
Main Specification on Institutions in
CT3 0.163 0.394 0.0497 0.586 0.107 Has Bitnet
(0.0654)* (0.0799)** (0.0779) (0.235)* (0.180) 0.0193 0.221 0.0159 0.159 0.227 Single institution papers
(0.00280)** (0.00975)** (0.00260)** (0.0500)** (0.0477)** # of Observations 4,035 4,335 1,560 915 1,560 # of Groups 269 289 104 61 104 Log Likelihood -4,105.85 N/A -2,527.05 -672.92 -857.99
Regressions include year and institution fixed effects
33
Table 5: Bitnet Adoption and Collaboration Using Institution-Pairs (1) (2) (3) (4) (5) (6) (7) (8) Main specification:
Linear regression with a dummy for any collaboration as the dependent
variable
Includes variable if just one institution
has adopted
Dependent variable is the
total # of collaborations
Probit regression with only
pairs with at least one
collaboration
Negative binomial regression on total #
of collaborations using only pairs with at least one
collaboration
No fixed effects linear regression;
errors clustered by
institution-pair
Alternative control for pair
productivity
No control for pair
productivity
0.000917 0.000918 0.00102 0.270 0.409 0.00247 0.000921 0.000907 Both have Bitnet (0.000156)** (0.000158)** (0.000221)** (0.113)* (0.219)+ (0.000335)** (0.000156)** (0.000156)**
-0.00000500 One or more has adopted Bitnet (0.0001645)
0.0000544 0.0000544 0.0000399 0.00777 0.0190 0.000430 Sum of # of single institution papers (0.0000128)** (0.0000128)** (0.0000181)* (0.00425)+ (0.00829)** (0.0000542)**
0.0000154 Product of # of single institution papers (0.00000340)** # of Observations 624,240 624,240 624,240 6,930 6,930 624,240 624,240 624,240 # of Groups 41,616 41,616 41,616 462 462 N/A 41,616 41,616 Log Likelihood N/A N/A N/A -1,251.76 -1,373.50 N/A N/A N/A
Unless otherwise specified, regressions include year and institution-pair fixed effects
34
Table 6: Bitnet Adoption, Collaboration, and Institution-Pair Quality (1) (2) (3) (4) (5) (6) CT1 and CT1 CT1 and CT2 CT1 and CT3 CT2 and CT2 CT2 and CT3 CT3 and CT3
0.000809 0.00149 0.000368 0.000620 0.000171 -0.000463 Both have Bitnet (0.00102) (0.000380)** (0.000241) (0.000391) (0.000155) (0.000213)* 0.000214 -0.0000617 0.00000380 0.000214 0.00000630 -0.0000335 Sum of # of single
institution papers (0.0000486)** (0.0000270)* (0.0000163) (0.0000990)* (0.0000282) (0.0000273) # of Observations 81,900 99,225 190,575 29,295 114,345 108,900 # of Groups 5,460 6,615 12,705 1,953 7,623 7,260
Regressions include year and institution-pair fixed effects. CT1, CT2, and CT3 define the Carnegie Foundation’s rankings of research focus.
35
Table 7: Bitnet Adoption, Collaboration, and Institution-Pair Distance (1) (2) (3) (4) Main
specification Alternative distance (1)
Alternative distance (2)
Main specification with total # of
collaborations as the dependent variable
0.00203 0.00203 0.00203 0.00236 Distance is under 100 km and Both Adopted Bitnet (0.000867)* (0.000867)* (0.000867)* (0.00123)+
0.000742 0.000743 0.000533 Distance is between 100 km and 1000 km and Both Adopted Bitnet (0.000225)** (0.000225)** (0.000319)+
0.000754 0.00112 Distance is between 1000 km and 3000 km and Both Adopted Bitnet (0.000201)** (0.000284)**
0.00155 0.00154 Distance is over 3000 km and Both Adopted Bitnet (0.000293)** (0.000415)**
0.000998 Distance is between 100 km and 500 km and Both Adopted Bitnet (0.000327)**
0.000559 Distance is between 500 km and 1000 km and Both Adopted Bitnet (0.000282)*
0.000977 0.000977 Distance is over 1000 km and Both Adopted Bitnet (0.000179)** (0.000179)**
0.0000555 0.0000547 0.0000546 0.0000409 Sum of # of single institution papers (0.0000128)** (0.0000128)** (0.0000128)** (0.0000181)* # of Observations 624,240 624,240 624,240 624,240 # of Groups 41,616 41,616 41,616 41,616 Regressions include year and institution-pair fixed effects
36
Table 8: Bitnet Adoption, Collaboration, Institution-Pair Quality, and Distance (1) (2) (3) (4) (5) (6) CT1 and CT1 CT1 and CT2 CT1 and CT3 CT2 and CT2 CT2 and CT3 CT3 and CT3
-0.0185585 0.0192371 0.0047773 -0.0001092 0.0017877 -0.0019598 Distance is under 100 km and Both Adopted Bitnet (0.0040994)** (0.0017607)** (0.0012771)** (0.0020295) (0.0009123)+ (0.0012405)
-0.0002927 0.0017980 0.0004862 0.0011414 -0.0002585 -0.0003163 Distance is between 100 km and 1000 km and Both Adopted Bitnet (0.0012254) (0.0004926)** (0.0003517) (0.0005452)* (0.0002358) (0.0003274)
0.0017114 0.0001902 0.0001236 0.0005301 0.0003123 -0.0005181 Distance is between 1000 km and 3000 km and Both Adopted Bitnet (0.0011632) (0.0004408) (0.0003262) (0.0004502) (0.0002021) (0.0003200)
0.0027329 0.0030993 0.0001980 -0.0001052 0.0004064 -0.0004536 Distance is over 3000 km and Both Adopted Bitnet (0.0014988)+ (0.0006476)** (0.0004367) (0.0007454) (0.0003035) (0.0003971)
0.0002157 -0.0000576 0.0000042 0.0002141 0.0000058 -0.0000334 Sum of # of single institution papers (0.0000487)** (0.0000270)* (0.0000163) (0.0000991)* (0.0000282) (0.0000273) # of Observations 81,900 99,225 190,575 29,295 114,345 108,900 # of Groups 5,460 6,615 12,705 1,953 7,623 7,260
Regressions include year and institution-pair fixed effects. CT1, CT2, and CT3 define the Carnegie Foundation’s rankings of research focus.
Figure 1 Cumulative Bitnet Adoption over Time
0
50
100
150
200
250
300
350
1981 1982 1983 1984 1985 1986 1987 1988 1989 1990 after1990
Num
ber o
f Uni
vers
ities
Con
nect
ed
CT1CT2
CT3
TOTAL