Digital Preservation - The Planets Way
Return to the main page for this event
Abstracts Day 1
Introduction to Digital Preservation: Why preserve?
Ross King, Austrian Research Centers
We are experiencing an explosion of digital information. An estimated 1800 Exabytes of digital information will be created, captured, and replicated worldwide by the end of 2011. Unfortunately, digital data is as transient as it is ubiquitous. There are two well-known problems associated with the long-term storage and access of digital data; the bit-stream preservation problem, and the logical preservation problem. Bit-stream preservation addresses the problem of storage media obsolescence and degradation over time. Logical preservation addresses the problem of accessing bitstreams, whose interpretation may depend on obsolete operating systems, applications, or formats. The concept of "Digital Preservation" includes the standards, best-practices, and technologies utilised in order to ensure access to digital information over time.
The potential market for digital preservation is enormous - from government, to industry, to private individuals. There are also numerous legal mandates or incentives for implementing digital preservation, from legal deposit laws to protection of intellectual property. The primary barrier to the adoption of digital preservation principles is the short-term planning that characterises today's market. In order to overcome this, we should approach digital preservation as a risk-management methodology that avoids future liabilities, rather than a product with an expected return on investment. If we can reasonably estimate the losses that can be avoided through proper risk management, we can justify the investment in long-term digital preservation practices.
The Preservation Action Cycle: Introduction to Planets
Clive Billenness, British Library
This session will provide an executive overview of the activities are required in order to effectively preserve digital materials. It will relate it to ISO and British Standards on the archival of information and will consider how it fits into an organisation's wider Business Risk context. The presenter will then show in overview how these standard approaches are reflected in Planets. This will enable delegates to put into context the more detailed examination of Planets tools and services in the following sessions, and also enable them to enter into a dialogue with their own Business Risk Managers about Digital Preservation.
Preserving Digital Content
Volker Heydegger, University at Cologne
While there are parallels between preservation of analogue and digital objects, the nature of each is at the same time fundamentally different. What is digital content in general? What is digital content in the context of specific preservation tasks? What is the relationship between information and its digital representation?
This session discusses such questions and explains the elements that make up digital preservation, especially with respect to preservation of digital content. It explains how the main preservation tasks deal with digital content in general and the technical problems which can occur during individual stages of the preservation process.
Digital Preservation: How to Preserve
Sara van Bussel, The National Library of The Netherlands
Planets Preservation Actions provides solutions for making digital objects available. Both migration and emulation tools are being developed and/or adapted to be used in the Planets environment. At the same time gaps in the tool provision are being tracked. Preservation tools are described in the tool registry as part of the enhanced Pronom format registry. This gives users of Planets registry the capability to identify, compare, deploy and invoke the most appropriate tools or services. The future of preservation action is steered by ongoing research and investigation of emerging technologies.
Tools: How to Understand Files
Jan Schnasse, University at Cologne
Understanding file content depends on the tools used to interpret the bit-stream. In the best case, it is possible to completely render an environment and to view the file as it was originally intended. One degree of complexity below there are tools which interpret only certain parts of the bit-stream to allow a human user or a machine to characterise the files. Within the currently discussed preservation scenarios of migration or emulation, characterisation tools can play an enormous role. Most characterisation tools focus on the extraction of technical metadata and format-specific properties. The Planets project has developed a high-end characterisation approach that goes far beyond that. The Extensible Characterisation Language (XCL) provides a file format description language as well as a general container format for file characterisation.
After an overview of the state of the art of characterisation tools this lecture will give a general introduction into the XCL approach.
Digital Preservation: How to Verify
Petra Helwig, The National Archives of the Netherlands
This session explains how the Planets Testbed, Corpora and workflow can be used to support digital preservation activities.
Why do we have to plan preservation solutions? and Digital preservation: How to Plan
Christoph Becker, Vienna University of Technology
The rapid technological changes in today's information landscape have considerably turned the preservation of digital information into a pressing challenge. A lot of different strategies, i.e. preservation actions, have been proposed to tackle this challenge. However, which strategy to choose, and subsequently which tools to select to implement it, poses significant challenges. The creation of a concrete plan for preserving an institution's collection of digital objects requires the evaluation of possible preservation solutions against clearly defined and measurable criteria. Preservation planning aids in this decision-making process to find the best preservation strategy considering the institution's requirements, the planning context and possible actions applicable to the objects contained in the repository. Performed manually, even evaluating a rather small number of possible solutions against requirements takes a good deal of time. Plato, a web-based, interactive software tool, supports and partly automates this process.
This series of presentations and exercises will
- Discuss the needs of preservation planning,
- Review the preservation planning methodology and workflow,
- Show how to quantify and measure requirements,
- Discuss examples coming from case studies,
- Demonstrate how Planets tools and services aid in the requirements definition and evaluation process,
- Utilise the range of services and tools Planets is delivering, and
- Engage participants in group discussions on requirements for selected digital objects.
Tools: How to Integrate the Components of Digital Preservation
Ross King, Austrian Research Centers
The Planets approach to digital preservation is driven by the requirements of memory institutions, primarily national libraries and archives. These institutions generally already have archiving systems in place, which are often custom solutions or based on commercial tools. Replacing such systems is neither feasible nor desirable. Therefore the Planets preservation suite was designed to run in parallel with existing archive systems; it is neither meant to replace these, nor to provide archiving functionality.
We will describe both conceptually and technically how a number of processes or workflows within an OAIS-compliant archive can be supported by Planets software: the ingest process, which can be customised with various tools for the identification, validation, characterisation, and normalisation of incoming digital objects; the access process, which can include dynamic transformations to target formats (the migration approach) or the invocation of viewers (the emulation approach) for delivering data information packages; and complex preservation plans, the result of the preservation planning process, which can be carried out on a selection of archive objects.
Case Study
Barbara Sierman, The National Library of The Netherlands
Since 2003 the KB, National Library of the Netherlands, stores digital publications into the e-Depot, the digital archive for long term preservation. Participating in the Planets project offered an opportunity to help creating practical tools and services that will support our long term preservation. This presentation will give an overview of our approach to make use of the Planets products and to integrate them in the current and future environment of the KB.
Abstracts Day 2
Introduction to the Digital Preservation Scenario and to a "real collection".
Vittore Casarosa, HATII at the University of Glasgow
The day will begin with an introduction to the practical scenario to demonstrate how to preserve a sample collection.
Preservation Planning with Planets
Hannes Kulovits and Christoph Becker, Vienna University of Technology
Practical exercise: a guided walk-through of the first three steps of the preservation planning workflow using Plato and leading to the definition of an objective tree.
Characterisation of Digital Documents
Volker Heydegger and Jan Schnasse, University at Cologne
The first part of this session explains how the content of file formats can be described using the XCL approach. This includes a general discussion of the most important characteristics of file formats and their different representations in file formats. The XCDL way of content representation is finally explained drawing on examples, especially in relation to the main purpose of XCL, evaluation of file format migration.
Practical exercise: a demonstration of an application of the XCL approach with a specific scenario. The Extensible Characterisation Extraction Language (XCEL) will be explained using examples.
Preservation Actions
Sara van Bussel, The National Library of The Netherlands
This session explains the available strategies (migration and emulation), the tools needed and the environments where Planets will be most useful.
Practical exercise: how to use Planets preservation actions with documents extracted from the sample collection. Evaluate tools with a special focus on their fitness for long-term preservation.
Benchmarking Preservation Tools: the Testbed Environment
Petra Helwig, The National Archives of the Netherlands
and
Brian Aitken, HATII at the University of Glasgow
This session provides a detailed presentation of the functionality and components of the Testbed, and the use of the Corpus and the Registry will be provided, together with an explanation of the Testbed Workflow and the use of the Testbed in a real organisation.
Practical Exercise: how to use some of the tools seen in the previous sessions in the Testbed environment.
Abstracts Day 3
Finalising a Preservation Plan
Hannes Kulovits, Vienna University of Technology
Practical exercise: Preparation of a complete preservation plan for the test collection, using the objective tree defined on the previous day to perform preservation actions on the selected objects; evaluate the results using characterisation tools presented on the previous day. Discussion and evaluation of the completed plan with the audience.
Validating the preservation plan with the Testbed tool
Brian Aitken, HATII at the University of Glasgow
Practical exercise: part of the preservation plan developed in the previous session is tested, in order to evaluate the practical aspects of the plan and, if needed, to test possible alternatives.
Experiments are performed using the selected tools and criteria elaborated in the previous slot. The experiment is executed in the Testbed.
Pulling it all together: Implementing Digital Preservation using the Planets Interoperability Framework
Clive Billenness, British Library
This session provides an overview of the Planets Installation Package, the Workflow Design Tool and the Security aspects (authentication and authorisation).