Abstract
This paper is based upon a lightening talk that was given on February 14, 2023 at the 2023 NISO Plus conference. The presentation described the research for the creation of a white paper on the use of blockchain technology along the scientific research workflow that will be released in early 2024 by the International Union of Pure and Applied Chemistry (IUPAC).
What the three-year study found is that the technology is indeed being used in almost all the steps in the scientific research workflow - from hypothesis development through to publication - by commercial organizations as well as by non-profits and across all market sectors, even governments. For example, the U.S Department of Health and Human Services (HHS) uses blockchain technology in a pilot program, the Grant-recipient Digital Dossier (GDD), to manage their grant program more efficiently. As of July 2021, GDD had reduced the time required to complete grant assessment tasks from four-plus-hours to a fifteen-minute process.
This paper briefly summarizes the findings, discusses the pros/cons of the technology, and provides a glimpse of how the technology is impacting the future of scientific research.
Keywords
Introduction
The development of the white paper was initiated because during the 2019 IUPAC World Chemistry Congress a representative from one of IUPAC’s Member Countries raised the question “What is Blockchain Technology?” They felt that it had become a prominent buzzword, not unlike Artificial Intelligence and asked if IUPAC could provide information on how it was impacting science in general and chemistry in particular. At that time, I was already personally fascinated by the technology and had written a paper on it the prior year for IUPAC’s news journal,
What is blockchain technology???
According to a report [1] from the National Institute of Standards and Technology (NIST), “Blockchains are immutable distributed digital ledger systems (i.e., without a central repository) and usually without a central authority. At its most basic level, they enable a community of users to record transactions in a ledger public to that community such that no transaction can be fraudulently changed once published". Indeed, an alternate name for blockchain is Distributed Ledger Technology (DLT), not unlike the double-entry book-keeping method that dates back to the fourteenth century. Basically, it is a technology platform that can be used to support many diverse applications. While this new technology gained popularity because of its use by those interested in cryptocurrencies, specifically Bitcoin, the technology is simply the engine “under the hood” of Bitcoin - an engine that can be used for other purposes as well. Indeed, “Blockchain is to Bitcoin, what the Internet is to email.
In fact, blockchain technology actually
Cc8be069bda4863a638d900f827235ef (hash for “I love green tea”)
The blocks record and confirm the time and sequence of transactions and each block contains a digital fingerprint or unique identifier called a “hash”, timestamped batches of recent valid transactions, and the hash of the previous block. The hash of the previous block links the blocks together and prevents any block from being altered or a block being inserted between two existing blocks. Each subsequent block strengthens the verification of the previous block and hence the entire blockchain. The method renders the blockchain tamper-evident, lending to the key attribute of immutability. The technology is important for many reasons, but the primary one is that it provides incontrovertible proof-of-creation of an idea, research data, etc. Once hashed to a blockchain and time-stamped, the data can be changed, but such changes are captured and time-stamped, making fraudulent tampering visible. This makes digital goods immutable, transparent, externally-provable, decentralized, and distributed.
Haber and Stornetta left Bellcore in 1994 to co-found a spin-off company, Surety, which provided time-stamping services based upon their algorithms and it was the first company to provide commercial blockchain-based services. Things remained quiet on the blockchain front until 2008 when Satoshi Nakamoto released a White Paper entitled, “Bitcoin: a peer-to-peer electronic cash system” [2] which proposed a system of electronic transactions that did not require a reliance upon trust - the middleman, in this case a bank, was removed. Haber and Stornetta’s seminal paper, “How to Time-Stamp a Digital Document” [3], is referenced in the Bitcoin White Paper.
Over the decade since Nakamoto built upon Haber and Stornetta’s work there have been a series of enhancements to the technology, primarily driven by the realization that it did
The Scientific research workflow
We started our journey by defining our view of the scientific research workflow - what areas did we want to examine to see how blockchain was being used and how well it was performing. We looked at five steps in the flow: Step #1: Develop Hypothesis/Define an idea; Step #2: Seek Funding; Step #3: Perform the Experiment/make observations; Step #4: Perform analysis/make insights; Step #5: Publish/share results. During our research over the past three years, we held about a dozen in-depth interviews with major global players across diverse disciplines who are successfully using blockchain technology for a variety of purposes in the scientific workflow and the white paper will provide details on these uses.
Step #1: Develop hypothesis/define an idea
The major use of blockchain in the first step is for the time-stamping of ideas - the purpose for which the technology was originally developed. It provides researchers with their proof-of-concept and intellectual property ownership and many organizations around the globe have been offering this service for years. All the providers of this service, just a few of which are listed below, work the same way.
ARTiFACTS https://artifacts.ai (basic system free to researchers) Bernstein https://www.Bernstein.io (can try for free) Bloxberg https://bloxberg.org (member-based global Consortium) Stampery https://www.stampery.com
You upload your information so that it can be “hashed” in order to create a tamper-proof timestamp for it. A hash is a concatenation of letters and numbers that uniquely identifies your asset - similar to a human fingerprint. The hash is uploaded to the blockchain, where it cannot be changed or deleted afterward. The time when the hash is added to the blockchain is your tamper-proof timestamp. The players listed above are based in Europe with the exception of ARTiFACTS - a U.S. company based in San Diego, CA, U.S.A. It was launched in 2018 as the first blockchain-based service for securing provenance (via time stamping) of unpublished research, making
Step #2: Seek funding
From what we found, blockchain is not widely-used as a funding mechanism for scientific research…
One of the new funding organizations is Molecule [5] - a collaborative platform that connects stakeholders primarily in drug development. Their main focus is allowing researchers to license their Intellectual Property to interested investors using IP-Non-fungible tokens. In August 2022 they announced that, for the first time, they would be funding drug discovery research based in the United States at the University of Washington in Seattle.
It should be noted that uses of blockchain technology on the
Step #3: Perform the experiment/make observations
It is in this area of the scientific research workflow that the inherent features of blockchain technology are invaluable. The following information can be captured in this step and if posted to a blockchain can make the research transparent and reproducible. (1)
(2)
(3)
(4)
(5)
As of this writing there are several organizations who offer services in support of the experimental phase of the scientific workflow. ARTiFACTS, which I already mentioned is one of them. Another is the Open Science Chain. Launched in 2018, this is a consortium blockchain that is funded by the U.S National Science Foundation (Award: 1840218 [7]) and is based at the San Diego Supercomputer Center at the University of California San Diego. Its main objective is to support data sharing to enable independent verification of scientific data and to foster reuse for the advancement of science [8]. Users can search, view, and validate scientific datasets, including lineage information and create “research workflows” linking one or more entries in the OSC blockchain and other repositories such as GitHub, thereby documenting an auditable record of the data workflow process behind the research hypothesis. The project has been very successful and, in the summer of 2021, received an additional half million dollars from the NSF for expansion. The service is free to members of the academic and research community.
Step #4: Perform analysis/make insights
The next step in the scientific workflow process involves taking the results from the experiment performed in Step #3 and performing one or more calculations, analyses, and visualizations on that data in order to generate conclusions, make insights, and to guide the next stage of the experimental, discovery cycle. In many respects step #4 is very similar to step #3. It executes an operation or an algorithm on experimental data that was generated
In step #4, we have to all intents and purposes, the same “actors and artefacts” as we had in step #3, namely: (i) the person who ran the analysis (the “who”); (ii) the methods they used (the “how”), which in this case comprises the software, algorithms, and mathematics they used; and (iii) the results, observations, and conclusions that were generated. There is a new Electronic Laboratory Notebook, Labii [9], that uses blockchain to secure research data, and in 2021 the National Institute of Standards and Technology (NIST) funded a research project for the development of a prototype Digital Research Notebook system called KnowLedger. The concept is to build a system where research data is captured semantically immediately at birth and stored on the blockchain, incrementally as knowledge is added. This is the brainchild of one of the authors of the blockchain white paper, Stuart Chalk, Professor of Chemical Informatics, University of North Florida, USA.
Step #5: Publish/share
The final step of the research workflow is that of sharing and or publishing research results. ARTiFACTS is a mainstream player in this space, providing services for publishers as well as authors. As an example, the
Other applications
Blockchain technology is also being used for related activities in research and science education, among them being those listed below:
Data sharing within closed networks/communities. Near real-time research tracking (e.g., Covid-19 tracking is done via blockchain by the U.S. Department of Health and Human Services; the same can be applied to any research having global priorities). Authentication of data at the source for auditing, compliance, and regulatory purposes. Equipment management (maintenance records, training, usage history, ownership history, etc.) - can even ensure compliance with the Good Manufacturing Practices (GMP) required by regulatory agencies. Supply-chain tracking for both tangible/physical and digital/data assets. Decentralized publishing/peer review. Research Funding/tracking. Crowd-sourcing/collaborative research (both public and within companies) where blockchain helps to confirm who owns what because it can demonstrate who did what. Degree and qualification certification. Identity certification of people and objects. Research data re-use (including sales of such data). Retrieval of chemicals/pharmaceuticals at the end of their life cycles or during their life cycle for re-purposing.
While the focus of our research was blockchain technology usage in science, our journey exposed us to its usage across many industries - finance, law, environment, global trade and commerce, insurance, real estate, media and entertainment, supply chain management, etc. A recent report from the Davos 2023 World Economic Forum states that “blockchain technology offers more promises than problems and that as a technology it will continue to grow exponentially, and its use cases expand. The real-world applications of blockchain, many already in use by organizations focused on international development, offer greater utility and cost savings” [11].
Bloxberg Consortium
One of the major examples of how far blockchain has become utilized in the global scientific community is the Bloxberg Consortium [12] that is under the umbrella of the Germany-based Max Planck Society. In response to requests from researchers working at its member institutes for time stamping of their research, the Max Planck Digital library built their own blockchain and launched the consortium in February 2019. They began with eleven members and there are now more than fifty members worldwide despite the pandemic - it is emerging as the global blockchain for science and aims to foster collaboration across the global scientific community. Organizations who do not want to build their own platform can utilize Bloxberg’s if they sign the Manifesto and abide by the governance rules. Commercial entities can also join the consortium, but they cannot serve as authority nodes. In May of 2022 Bloxberg members voted to approve what is called the “Bloxberg Association for the Advancement of Blockchain in Science” that will be founded under German legislation this year. They also agreed to introduce tokenomics to ensure sustainable funding of the infrastructure [13]. Keep your eyes on them!
Failed projects
During the course of the many interviews that we held with active players in this space, we found that blockchain technology is not always the perfect solution for a specific problem. Two notable examples are the Blockchain for Peer Review Project initiated by Digital Science that failed supposedly because a solution to the problem already existed in another form. The other was a project that was being conducted by a group within Elsevier’s Healthcare Education Group that failed because users of the product that they were designing wanted to use it for clinical registry and one of the key attributes of blockchain - its immutability - ran afoul of the laws around “the right to forget and the right to be forgotten” which vary from country-to-country. The white paper will cover these in detail.
Lessons learned
Indeed, the people we interviewed discussed many of the lessons that they learned in their use of blockchain technology. Key are the following:
Blockchain Is not the perfect solution for every problem. Potential users need to understand what they want to accomplish. Administrative use cases can really benefit from blockchain technology. Its use requires learning and thinking differently so that you can see how it fits in – it needs a champion! Blockchain applications need to be seamless and embedded in the systems that researchers already are using. Use of blockchain requires some learning and a new mindset
When we interviewed Dr. Naseem Naqvi, Editor-in-Chief for
The future
What we learned from our interviews is that usage of the technology is growing in general and is accelerating in the life and health sciences. We found that all parts of the research cycle
While the scope of blockchain technology’s impact is hard to predict since it’s in the early stages of broad adoption, we believe that its future looks bright. It was selected as one of the Top Ten Emerging Technologies in Chemistry in 2021by IUPAC and a recent report from the European Chemical Industry Council, “Artificial Intelligence and Blockchain: Insights and Actions for the Chemical Industry”, boldly states that blockchain technology holds the potential for disruption across the chemical enterprise [15]. Only the future will tell. Our goal is to update the white paper at regular intervals as the technology is enhanced and new use cases and new players emerge.
White paper authors
In closing, I want to recognize my fellow co-authors of the white paper who have worked tirelessly over the past three years to make the white4 paper a reality. Because of them our three year journey has been a pleasurable adventure:
