◀ 1.2. Semantic Desktop for ForgetIT

In the following, we want to point out the contribution of the Semantic Desktop approach to the goals of ForgetIT. This section is an excerpt from the ForgetIT deliverable D9.1 (Section 2). Please refer there for full coverage and references).

1.2.1 Current State of Personal Preservation and Main Obstacles

When considering preservation for personal, non-professional, or home usage, we have to contend with a vast increase in ways to create digital artefacts (computers, smartphones, tablets, digital cameras, ...) as well as an ever increasing amount of storage for this digital material. These days, users’ personal information space consists of a substantial number of information objects connected to the person’s life such as wedding videos, travel pictures, or graduation keepsakes. It requires serious dedication and cognitive effort to organize all this data and keep it accessible as time passes.

Moreover, these digital artefacts often represent past moments but are not associated with a physical memento. Therefore, they form a valuable resource for the user and future generations. If the material is lost or corrupted due to improper conservation, it will be useless. Most users still use backups as their main form of preservation.

In addition, many people keep everything. Five main reasons are pointed out (see [Marshall, 2011]):

This recommendation leaves all steps to the user, i.e., what to save, how to organize, where to store (hard disk and online storage), and when to migrate. All this is a lot of effort for users, various decisions need to be made and it requires discipline in, e.g., maintaining and updating the archive. Creating a structure for preservation in particular is one of the major problems. Every person who adds material has to follow this structure every time further material is added, and people who want to search for files need to be aware of this structure. After a long period of time, someone else such as descendants need to be able to interpret the structure.

This cognitive up-front effort is one of the reasons why the cloud storage offered by Drop- Box, Microsoft SkyDrive, or Google Drive is not a preservation system in itself, but only a tool in a larger preservation strategy. Started as syncing, file sharing, and backup solutions, those services offer organisation methods such as file folders or (keyword-) tags, but do not comply with the OAIS standard. Other services, such as Amazon Cloud, comply with OAIS, but do not support users before ingesting data into the store. Either way, users are left to their own devices for large parts of the preservation process.

Considering the current state of personal preservation, the main obstacles we see so far are:

1.2.2 How does ForgetIT address this?

Within the ForgetIT framework, we will address these challenges as follows.

1.2.2 Motivation for Personal Information Management using the Semantic Desktop Approach in ForgetIT

While some users are concerned about preservation, it is not part of most users’ regular practice. Preservation requires manual effort and the users need to think about it to actually do it, it poses a cognitive burden on the users.

Therefore, the approach envisioned in ForgetIT for Personal Preservation is to embed it in the user’s activities in the personal information space in order to collect material to be preserved, evidence for preservation values, and triggers for preservation while keeping user involvement minimal.

But how can this be achieved? By concentrating on the Personal Information Management (PIM) of users, we can cover various life events together with associated digital material, usage of the digital material, and evidence for preservation values. For example, we can detect whether a file is only relevant for a certain time frame (such as time tables) or has emotional relevance (such as a picture showing the user’s daughter). Furthermore, it is a chance to derive the user’s mental model on the contents of the material, and thus, get a means to describe the preserved material from a user’s point of view with less effort.

By providing an ecosystem for PIM we can show that collecting material and deriving evidence for preservation is possible. Motivated by the research done in the Semantic Desktop field, by using the Semantic Desktop paradigm in ForgetIT we can

1.2.4 How will the Semantic Desktop approach contribute to ForgetIT?

By using a Semantic Desktop approach in ForgetIT, we can support Preservation, Forgetting, and Remembering as follows:

Preservation: The Semantic Desktop ecosystem (applications, plug-ins, mobile apps) allows us to connect the PIMO to the user’s information objects through annotating photos and web pages, organizing documents and emails, and managing tasks and reminders. Information objects are connected by reusing concepts such as contacts, which are part of the PIMO, for annotating pictures and writing emails. The resulting personal information space tightly links resources and concepts. Evidence for preservation values and context for preserving an information object can be derived from this information and formalised using the PIMO knowledge representation. Importantly, the continuously evolving PIMO not only covers information objects in current use but also objects which have already been stored in the archive for later use and are therefore no longer directly accessible for the user.

Forgetting: The data about information objects in the Semantic Desktop ecosystem that is held together by the PIMO provides evidence for preservation value, topical and long-term relevance. Observations in the PIMO are similar to files on the computer. For example, while topics of previous projects might still be relevant to the user, most of the associated resources, such as meetings, notes, and presentations, might no longer be of interest. The PIMO and its ecosystem therefore provide crucial input for the managed forgetting system.

Remembering: Just like the human brain, the PIMO is still capable to retrieve things which seem to be forgotten. Similar to humans, who can remember things or situations by starting with a cue and then follow associations, PIMO can provide paths through the semantic graph that start from a particular node. For example starting from a project (ForgetIT), we can follow a path to an associated event (the Kick-off Meeting in Hanover) to a photo (the group in front of the town hall) to a person (the professor from UEDIN). At each node along the path, the links from the node to other concepts provide the context required to remember. Thus, the PIMO contributes to contextualized remembering.

1.2.5 Semantic Desktop as Active System in the Preserve-or-Forget (PoF) Framework

The Personal Preservation Pilot is an implementation of the Preserve-or- Forget (PoF) Framework (see Figure below; the technical details are explained in deliverable D8.6) with the Semantic Desktop as Active System and is built in accordance with the PoF Reference Model as explained in deliverable D8.5.

ForgetIT Preserve-or-Forget (PoF) Framework architecture diagram
Architecture diagram of the Preserve-or-Forget (PoF) Framework (taken from D8.5 PoF Reference Model) with the Semantic Desktop as Active System.

The pilot was made possible by the close cooperation of WP9 with all ForgetIT work packages. This resulted in successfully deploying and running of the pilot connected to ForgetIT’s Preserve-or-Forget (PoF) Middleware components and the Digital Preservation System which resembles the architecture depicted above. Furthermore, several components from other ForgetIT work packages are used in the pilot. This allows for preservation of content on the computer with connection to the Semantic Desktop infrastructure as well as restoring from the archive.

In 10.4. Overcoming Obstacles of Personal Preservation, we present how the Semantic Desktop approach overcomes the obstacles for Personal Preservation.