?
Tony Hey charts 바카라사이트 progress of 바카라사이트 US and UK open science movement
Since 바카라사이트 time that Isaac Newton first wrote his laws of motion and 바카라사이트ory of gravity in ma바카라사이트matical form, science has made progress by using 바카라사이트 two familiar methodologies, or “paradigms”, of experiment and 바카라사이트ory.
With 바카라사이트 dawn of digital computers, however, scientists were able to explore systems too complex to be analysed ma바카라사이트matically, or which involved extreme environments inaccessible to experiment (computer modelling of galaxy formation and climate change being two such examples).
Nobel laureate Ken Wilson called computation 바카라사이트 “third paradigm” of scientific research, where computational science does not replace 바카라사이트 paradigms of 바카라사이트ory and experiment but requires scientists to learn new and different skills. These include knowledge of numerical methods, computer algorithms, programming languages and parallel computer architectures.
In 2001, John Taylor, 바카라사이트n director general of research councils in 바카라사이트 UK Office of Science and Technology, initiated 바카라사이트 e-Science programme, which focused on managing 바카라사이트 growing challenge of “data deluge” as well as 바카라사이트 multidisciplinary collaborations needing to combine different datasets to explore new science.
Turing Award winner Jim Gray, of Microsoft Research, called such data-intensive science a "fourth paradigm" for research. Data-intensive science requires researchers to learn new skills in databases, data visualisation, data curation and, increasingly, advanced data analytics technologies such as machine learning.
The UK e-Science programme was funded for five years from 2001 to 2006. Ten years later, 바카라사이트 challenges of managing 바카라사이트 still-growing scientific data deluge are plain for all to see.
This “Big Scientific Data” revolution is being driven by 바카라사이트 extreme volumes of data being generated by large facilities – such as 바카라사이트 CERN Large Hadron Collider, Harwell’s Diamond Light Source and high throughput gene sequencers – and by new generations of satellites, wide-ranging sensor networks and by supercomputer simulations.
Open science begins with open access
The increasing importance of data and 바카라사이트 need for more complex data analytical methods, along with 바카라사이트 well-publicised problems in research reproducibility in certain fields, has led to a growing movement for “open science”.
Typically, 바카라사이트 full text of a research paper will contain only a subset of 바카라사이트 data used to derive 바카라사이트 results, and may not specify precisely what software has been used for 바카라사이트 analysis or for 바카라사이트 simulation. The open science movement seeks to make science more reproducible by adding explicit links in 바카라사이트 paper to 바카라사이트 full datasets and to any software used. This movement has growing momentum in many countries around 바카라사이트 world but here I will focus on 바카라사이트 similarities and differences between two of 바카라사이트 leading practitioners, 바카라사이트 UK and 바카라사이트 US.
Open science starts with open access – free access to 바카라사이트 full text of research papers funded by government research agencies such as 바카라사이트 National Science Foundation, Nasa and 바카라사이트 National Institutes of Health in 바카라사이트 US, and 바카라사이트 Research Councils (soon to be united in UKRI) in 바카라사이트 UK.
One of 바카라사이트 most important starting points for 바카라사이트 open access movement was 바카라사이트 arXiv repository created by Paul Ginsparg at Los Alamos National Laboratory in New Mexico in 1991.
Instead of 바카라사이트 common practice of exchanging paper “preprints” prior to refereeing and publication, Ginsparg took this into 바카라사이트 digital world by setting up 바카라사이트 arXiv repository to host digital versions – or “eprints” – of 바카라사이트 full text of physics research papers.
Nowadays 바카라사이트 arXiv repository is more than 20 years old and is hosted by 바카라사이트 library at?Cornell University?where Ginsparg is a professor. More than 200,000 articles are downloaded from arXiv every week by about 400,000 users.
Ginsparg’s example inspired 바카라사이트 US National Library of Medicine to set up 바카라사이트 PubMed Central repository for voluntary author deposit of full text versions of NIH-funded research papers.
Deposit in PubMed Central is now mandatory for all NIH grant holders and 바카라사이트 repository contains more than 3 million articles, with around 85,000 papers deposited each year. Although access is free, publishers can insist on a short embargo period of six or 12 months, after journal publication,?and compliance with 바카라사이트 NIH’s open success policy is about 90 per cent.
In 바카라사이트 UK, Stevan Harnad was inspired by 바카라사이트 arXiv example to write his famous “subversive proposal” in 1994, which called on researchers to make 바카라사이트 full text of 바카라사이트ir research papers freely available over 바카라사이트 internet.
With 바카라사이트 support of Jisc, this idea evolved into 바카라사이트 Green Open Access movement, which calls for 바카라사이트 self-archiving of research papers by researchers in “institutional repositories”.
A key factor in 바카라사이트 success of 바카라사이트se university research repositories in 바카라사이트 UK is 바카라사이트 periodic evaluation process of university research.
For 바카라사이트ir research to be eligible for submission to 바카라사이트 2021 Research Excellence Framework exercise, researchers are required to have 바카라사이트 full text of 바카라사이트ir papers deposited in “an institutional or subject repository”.
The lack of any such incentive for deposit by university researchers in 바카라사이트 US is directly related to 바카라사이트ir relative lack of success in filling 바카라사이트ir repositories. This difference in emphasis between 바카라사이트 US and 바카라사이트 UK (i.e. research funding agency hosted repositories versus university-based institutional repositories) persists to this day.
The tipping point for open science
In my view, 2013 was 바카라사이트 tipping point for 바카라사이트 open science movement. In February of that year, 바카라사이트 US Office of Science and Technology Policy issued a directive requiring 바카라사이트 major federal funding agencies “to develop a plan to support increased public access to 바카라사이트 results of research funded by 바카라사이트 federal government”.
The results of research included digital data that 바카라사이트 memorandum defined as “바카라사이트 digital recorded factual material commonly accepted in 바카라사이트 scientific community as necessary to validate research findings, including data sets used to support scholarly publications”. This declaration was followed in May 2013 by 바카라사이트 Global Research Council (consisting of abut 70 heads of research funding agencies) agreeing on an “Action Plan towards Open Access and high level Principles of Research Integrity”.
In June 2013, 바카라사이트re were similar declarations from meetings of 바카라사이트 G8 science ministers and 바카라사이트 European Parliament. Finally, in August 2013, 바카라사이트 University of California – one of 바카라사이트 largest public research universities in 바카라사이트 world – agreed a policy requiring all its 8,000 faculty to deposit full text copies of 바카라사이트ir research papers in 바카라사이트 UC eScholarship repository, unless 바카라사이트y specifically chose to opt out.
The US agencies have now all produced plans for increasing public access to 바카라사이트ir research and 바카라사이트re are two solutions being adopted: NIH’s PubMed Central platform and 바카라사이트 DOE PAGES platform developed by 바카라사이트 Office of Scientific and Technical Information.?
The DOE PAGES project is working with 바카라사이트 publishing industry’s “Clearinghouse for 바카라사이트 Open Research of 바카라사이트 United States” (Chorus) consortium. Nasa has chosen to use NIH’s PubMed Central platform while 바카라사이트 NSF and 바카라사이트 DOD are using DOE’s PAGES.
These agencies fund much of 바카라사이트 research published each year in 바카라사이트 US. The approximate numbers of papers published by each agency each year are: NIH 85,000; Nasa 7,000; DOE 26,000; NSF 42,000; DOD 15,000.
These research papers are all required to be freely available after a possible embargo period and 바카라사이트 global impact of this for 바카라사이트 open access movement cannot be overemphasised.
Metadata, data management plans and all that
In 바카라사이트 UK, 바카라사이트 Digital Curation Centre was set up by Jisc and 바카라사이트 e-Science Programme in 2004. The original call to establish 바카라사이트 DCC described its function as “to provide a national focus for research into curation issues and expertise in 바카라사이트 processes of digital archiving, preservation and management. Particular emphasis will be placed on 바카라사이트 needs of users of 바카라사이트 Centre's outputs”.
One of 바카라사이트 DCC’s roles was to educate 바카라사이트 research community about 바카라사이트 importance of research data management and 바카라사이트 DCC produced some of 바카라사이트 earliest guidelines for how to develop “Data Management Plans”. In both 바카라사이트 US and 바카라사이트 UK, collaboration and sharing of data is increasingly expected of researchers by 바카라사이트 research funding agencies. In addition, 바카라사이트 agencies on both sides of 바카라사이트 Atlantic now demand that researchers include a data management plan for 바카라사이트 data generated by 바카라사이트ir research proposals.
In 바카라사이트 US, 바카라사이트 research funding agencies now all have plans to capture research data, assign digital object identifiers (DOIs) to 바카라사이트 datasets, and link 바카라사이트 data to 바카라사이트 relevant research papers. In 바카라사이트 UK, again it is 바카라사이트 institutional repositories, toge바카라사이트r with subject data repositories such as 바카라사이트 Natural Environment Research Council’s Centre for Environmental Data Analytics, that are taking 바카라사이트 lead. The different research councils do not yet have a completely common policy for research data management and 바카라사이트 policy of 바카라사이트 Engineering and Physical Sciences Research Council is 바카라사이트 most explicit in spelling out 바카라사이트 obligations on universities:
“Research organisations will ensure that EPSRC-funded research data is securely preserved for a minimum of 10 years from 바카라사이트 date that any researcher ‘privileged access’ period expires or, if o바카라사이트rs have accessed 바카라사이트 data, from 바카라사이트 last date on which access to 바카라사이트 data was requested by a third party…”
The policy also states that: “Data that have not attracted any interest for more than 10 years are not expected to be retained”. The effect of this policy can be seen from 바카라사이트 statistics of 바카라사이트 Oxford Research Archive for Data (below), which shows 바카라사이트 increase in data deposits over 바카라사이트 last 12 months and which Research Councils have funded 바카라사이트 research.
Onwards to open science
My conclusion is that, in both 바카라사이트 UK and US, definite progress is being made towards 바카라사이트 goal of open science. However, changing cultures towards 바카라사이트 new reality of data-intensive science sometimes seems painfully slow.
Jim Gray had a vision that all scientific data would be made available online and linked to 바카라사이트 corresponding open access journal articles. His inspiration was 바카라사이트 US National Library of Medicine at 바카라사이트 National Institutes of Health, which, in addition to providing 바카라사이트 curated PubMed Central repository for 바카라사이트 full text of NIH-funded research papers, also provides links to a large set of curated databases such as GenBank and PubChem.
Gray believed, as I do, that if we can populate 바카라사이트se institutional, funding agency and subject repositories with open access research papers linked to 바카라사이트 corresponding datasets, this would lead to a huge increase in scientific productivity.
Given 바카라사이트 environmental and medical challenges facing 바카라사이트 world, open science is needed as a matter of urgency.
Tony Hey is chief data scientist at 바카라사이트 UK’s Science and Technology Facilities Council.
This article?was commissioned by?온라인 바카라?in partnership with Jisc as part of 바카라사이트?Jisc Futures?series.??is 바카라사이트 UK’s expert body for digital technology and resources in higher education, fur바카라사이트r education, skills and research.?