Open Data and Open Code for Big Science of Science Studies

dc.contributor.authorLight, Robert P.
dc.contributor.authorPolley, David E.
dc.contributor.authorBörner, Katy
dc.date.accessioned2015-09-22T17:28:35Z
dc.date.available2015-09-22T17:28:35Z
dc.date.issued2013
dc.description.abstractHistorically, science of science studies were/are performed by single investigators or small teams. As the size and complexity of data sets and analyses scales up, a “Big Science” approach (Price, 1963) is required that exploits the expertise and resources of interdisciplinary teams spanning academic, government, and industry boundaries. Big science of science studies utilize “big data”, i.e., large, complex, diverse, longitudinal, and/or distributed datasets that might be owned by different stakeholders. They apply a systems science approach to uncover hidden patterns, bursts of activity, correlations, and laws. They make available open data and open code in support of replication of results, iterative refinement of approaches and tools, and education. This paper introduces a database-tool infrastructure that was designed to support big science of science studies. The open access Scholarly Database (SDB) (http://sdb.cns.iu.edu) provides easy access to 26 million paper, patent, grant, and clinical trial records. The open source Science of Science (Sci2) tool (http://sci2.cns.iu.edu) supports temporal, geospatial, topical, and network studies. The scalability of the infrastructure is examined. Results show that temporal analyses scale linearly with the number of records and file size, while the geospatial algorithm showed quadratic growth. The number of edges rather than nodes determined performance for network based algorithms.en_US
dc.identifier.citationLight, Robert, David E. Polley, and Katy Börner. 2013. "Open Data and Open Code for Big Science of Science Studies". Proceedings of International Society of Scientometrics and Informetrics Conference 2013 2: 1342-1356.en_US
dc.identifier.urihttps://hdl.handle.net/1805/7012
dc.language.isoen_USen_US
dc.subject.lcshBig dataen_US
dc.subject.lcshOpen access publishingen_US
dc.subject.lcshOpen source softwareen_US
dc.titleOpen Data and Open Code for Big Science of Science Studiesen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Open Data and Open Code for Big Science of Science Studies.pdf
Size:
793.26 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.96 KB
Format:
Item-specific license agreed upon to submission
Description: