Embedding data provenance into the Learning Health System to facilitate reproducible research
Data provenance supporting reproducibility in the LHS release_ym2kisnjdfgzpfzon6s4256o44

by Vasa Curcin

Published in Learning Health Systems by Wiley.

2016   Volume 1, Issue 2, e10019

Abstract

The learning health system (LHS) community has taken up the challenge of bringing the complex relationship between clinical research and practice into this brave new world. At the heart of the LHS vision is the notion of routine capture, transformation, and dissemination of data and knowledge, with various use cases, such as clinical studies, quality improvement initiatives, and decision support, constructed on top of specific routes that the data is taking through the system. In order to stop this increased data volume and analytical complexity from obfuscating the research process, it is essential to establish trust in the system through implementing reproducibility and auditability throughout the workflow. Data provenance technologies can automatically capture the trace of the research task and resulting data, thereby facilitating reproducible research. While some computational domains, such as bioinformatics, have embraced the technology through provenance-enabled execution middlewares, disciplines based on distributed, heterogeneous software, such as medical research, are only starting on the road to adoption, motivated by the institutional pressures to improve transparency and reproducibility. Guided by the experiences of the TRANSFoRm project, we present the opportunities that data provenance offers to the LHS community. We illustrate how provenance can facilitate documenting 21 CFR Part 11 compliance for Food and Drug Administration submissions and provide auditability for decisions made by the decision support tools and discuss the transformational effect of routine provenance capture on data privacy, study reporting, and publishing medical research. If the scaling up of the LHS is to succeed, we have to embed mechanisms to verify trust in the system inside our research instruments. In the research world increasingly reliant on electronic tools, provenance gives us a lingua franca to achieve traceability, which we have shown to be essential to building these mechanisms. To realize the vision of making computable provenance a feasible approach to implementing reproducibility in the LHS, we have to provide viable mechanisms for adoption. These include defining meaningful provenance models for problem domains and also introducing provenance support to existing tools in a minimally invasive manner.
In text/plain format

Archived Files and Locations

application/pdf  1.0 MB
file_anfabusmnnc6zhb3msx7y6fc6u
kclpure.kcl.ac.uk (web)
web.archive.org (webarchive)
application/pdf  1.0 MB
file_3ny57k6jfzgjjg6lye2mnverjy
kclpure.kcl.ac.uk (web)
web.archive.org (webarchive)
Read Archived PDF
Preserved and Accessible
Type  article-journal
Stage   published
Date   2016-12-27
Language   en ?
DOI  10.1002/lrh2.10019
PubMed  31245557
PMC  PMC6516719
Container Metadata
Open Access Publication
In DOAJ
In ISSN ROAD
In Keepers Registry
ISSN-L:  2379-6146
Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)
Catalog Record
Revision: 9822ddc1-35f7-4ac5-866a-2414e12db6ba
API URL: JSON