A trustworthy repository provides assurance in the form of management documents, event logs, and audit trails that digital objects are being managed correctly. The assurance includes plans for the sustainability of the repository, the accession of digital records, the management of techlogy evolution, and the mitigation of the risk of data loss. A detailed assessment is provided by the ISO-16363:2012 standard, Space data and information transfer systems-Audit and certification of trustworthy digital repositories. This book examines whether the ISO specification for trustworthiness can be enforced by computer actionable policies. An implementation of the policies is provided and the policies are sorted into categories for procedures to manage externally generated documents, specify repository parameters, specify preservation metadata attributes, specify audit mechanisms for all preservation actions, specify control of preservation operations, and control preservation properties as techlogy evolves. An application of the resulting procedures is made to enforce trustworthiness within National Science Foundation data management plans.
Dr. Reagan W. Moore is a professor in the School of Information and Library Science at the University of North Carolina, Chapel Hill, chief scientist for Data Intensive Cyber Environments at the Renaissance Computing Institute, and director of the Data Intensive Cyber Environments Center at University of North Carolina (UNC-CH). He coordinates research efforts in the development of data grids, digital libraries, and preservation environments. Developed software systems include the Storage Resource Broker data grid and the integrated Rule-Oriented Data System. Supported projects include the National Archives and Records Administration Transcontinental Persistent Archive Prototype and the science data grids for seismology, oceanography, climate, high-energy physics, astronomy, and bioinformatics. An ongoing research interest is the use of data-grid technology to automate the execution of management policies and validate the trustworthiness of repositories. Dr. Moore's previous roles include the following: director of the DICE group at the San Diego Supercomputer Center; and Manager of production services at SDSC. He previously worked as a computational plasma physicist, at General Atomics on equilibrium and stability of toroidal fusion devices. He has a Ph.D. in plasma physics from the University of California, San Diego (1978), and a B.S. in physics from the California Institute of Technology (1967). Dr. Hao Xu is a research scientist at the Data Intensive CyberEnvironment Center, University of North Carolina at Chapel Hill (UNC-CH). He has been working on improving the rule engine and the rule language, and the metadata catalog of the integrated Rule-Oriented Data System (iRODS) since 2010. He developed the pluggable rule engine architecture that allows interoperability between different programming languages and the iRODS data management systems. He also developed QueryArrow, a semantically unified query engine that allows bidirectional integration of metadata from multiple heterogeneous metadata sources. His research interests include the theory of data management, automatic theorem proving, programming languages, distributed data systems, and formal methods in software development. He has a B.E. in Computer Science and Engineering and a B.S. minor in Applied Mathematics from Beihang University, and a Ph.D in Compuer Science from UNC-CH. Mike Conway is a software developer with over 20 years experience in distributed systems development. Mike is currently finishing up his master's degree in Information Science at the School of Information and Library Science at the University of North Carolina at Chapel Hill (UNC-CH). Most recently, Mike has been an architect and developer on the iRODS data grid, as well as the DataNet Federation Consortium, developing interfaces and protocols to support distributed, policy-managed environments in support of scientific research and digital preservation. Mike is also a developer with the iRODS Consoritum, based at the Renaissance Computing Institute (RENCI) at UNC-CH, contributing to the continued development of the iRODS open source data grid. Mike has also contributed to research in the areas of metadata management, applied cyberinfrastructure, and trusted digitial preservation environments. Dr. Arcot Rajasekar is a professor in the School of Library and Information Science at the University of North Carolina, Chapel Hill (UNC-CH), and a chief scientist at the Renaissance Computing Institute (RENCI). Previously, he was at the San Diego Supercomputer Center at the University of California, San Diego, leading the Data Grids Technology Group. He has been involved in research and development of data grid middleware systems for over a decade and is a lead originator behind the concepts in the Storage Resource Broker (SRB) and the integrated Rule-Oriented Data Systems (iRODS), two premier data grid middleware systems developed by the Data Intensive Cyber Environments Group. Dr. Rajasekar has a
Arcot Rajasekar, Hao Xu, Helen Tibbo, Jon Crabtree, Mike Conway, Reagan W. Moore
Morgan & Claypool Publishers
Date of Publication
Computing: Textbooks & Study Guides
Synthesis Lectures on Information Concepts, Retrieval, and Services