Research Papers
Data Conservancy Provenance, Context, and Lineage Services: Key Components for Data Preservation and Curation
Authors:
Matthew S Mayernik ,
National Center for Atmospheric Research, University Corporation for Atmospheric Research, Boulder, CO
Tim DiLauro,
Johns Hopkins University Sheridan Libraries, Baltimore, MD
Ruth Duerr,
National Snow and Ice Data Center, Boulder, CO
Elliot Metsger,
Johns Hopkins University Sheridan Libraries, Baltimore, MD
Anne E Thessen,
School of Life Sciences, Arizona State University, Phoenix, AZ
Abstract
Among the key services that institutional data management infrastructures must provide are provenance and lineage tracking and the ability to associate data with contextual information needed for understanding and use. These functionalities are critical for addressing a number of key issues faced by data collectors and users, including trust in data, results traceability, data transparency, and data citation support. In this paper, we describe the support for these services within the Data Conservancy Service (DCS) software. The DCS provenance, context, and lineage services cross the four layers in the DCS data curation stack model: storage, archiving, preservation, and curation.
How to Cite:
Mayernik, M.S., DiLauro, T., Duerr, R., Metsger, E., Thessen, A.E. and Choudhury, G.S., 2013. Data Conservancy Provenance, Context, and Lineage Services: Key Components for Data Preservation and Curation. Data Science Journal, 12, pp.158–171. DOI: http://doi.org/10.2481/dsj.12-039
Published on
17 Nov 2013.
Peer Reviewed
Downloads