Documents and deliverables
Content aggregation: tools & guidelines
Share on facebookShare on printShare on twitterShare on googleMore Sharing Services
Home > DOCUMENTS AND DELIVERABLES
Documents and deliverables
May 2011. DC-NET/Linked Heritage Position Paper (7 points) (PDF, 174 kb )
Deliverable number and title
D1.1.1 Working Groups Terms of Reference (PDF, 989 kb)
These Terms of Reference give a short description of the Linked Heritage best practice network and outline the model and procedures for establishing and managing working groups (WGs), each of which focuses on a specific aspect of Linked Heritage. This document is targeted to (a) Linked Heritage stakeholders who will find guidelines on how to set up a working group and (b) to working group members who will be informed about the running of a working group. The goal of these Terms of Reference is to ensure that all WGs have common procedures and adhere to a common model, in order that their results can be integrated across the project and to facilitate project management.
D2.1 Best practice report on cultural heritage linked data and metadata standards (PDF, 992 kb)
This deliverable has three roles in the project: 1) Educate the partners, and the wider cultural heritage community, about linked data. This includes linked data‟s associated technical standards; 2) Give advice based on the use of linked data in the cultural heritage community; 3) Inform the subsequent work of WP 2 in the rest of the project.
D2.2 State of the art report on persistent identifier standards and management tools (PDF, 1016 kb)
This deliverable is first of two deliverables which are the outcomes of Task 2.2 – Resource identification. Ithas three roles in the project: 1) Educate the partners, and the wider cultural heritage community, about persistent identifiers; 2) Give best practice advice based on the use of persistent identifiers in the cultural heritage ommunity, and in particular their use in the context of linked data; 3) Inform the subsequent work of WP 2 in the rest of the project.
D3.1 Best practice report – Terminology (PDF, 3451 kb)
This report is the first deliverable of Work Package 3 (WP3). This work package is dedicated to terminology management and multilingualism within the Linked Heritage project. In this document you will find general and specific information about the objectives and work done by WP3 within the Linked Heritage project. This first deliverable is a best practice report on terminology and provides a state of the art of the terminology resources in use by cultural heritage institutions and the private publishing sector providing digital content through Linked Heritage to Europeana, as well as a reference to some important terminology resources developed or in use by other European projects. A set of recommendations will be provided on the basis of this state of the art and key issues will be raised.
D3.3 Terminology management & Terminology Registry
D4.1 Best practice report – Public Private Partnership (PDF, 3162 kb )
This report provides an introduction to the existing commercial sector best practice in content identification, description, supply chain and rights management metadata across four sectors within the European Union; thus presenting the essential results of Task 4.1, “Private Sector Business Exploration”. The focus is on the description of the media industry and sectors within it, the technical specifications of standard identifiers and metadata schemas used within each sector, and the authoritatively recommended working practices in each. These descriptive and explanatory considerations will form the evidential, methodological and theoretical basis for the remaining Linked Heritage Work Package 4 (Public-Private Partnership) deliverables which will make recommendations for setting up partnerships between Europeana and the commercial sector.
D4.2 Specification of the technologies chosen (PDF, 4022 kb)
The current report builds on the findings of D4.1 with the results of Tasks T4.3, an empirical estimate and evaluation of potential commercial data contributors, and T4.4, an evidence-based technical specification for aggregating such data at scale. The current state of theory and practice in data integration is reviewed, focussing on efforts to achieve cultural-commercial sector interoperability, and potential solutions for the problem at hand are considered for feasibility given the limited resources. An experiment was undertaken to assess the feasibility of applying Linked Heritage’s existing data integration format, LIDO, and aggregation platform, MINT, to the previously identified industry standard metadata formats. The experiment’s focus was the ONIX for Books 3.0 mapping described explicitly in Linked Heritage’s Description of Work, although DDex for recorded music, EIDR for audiovisual materials and IPTC for photos were also investigated. This was done with the knowledge and cooperation of the relevant standards bodies, to achieve a reliable and standardised result in accordance with accepted industry best practice.
The report finds that, although other technical solutions exist – and some have been applied successfully to integration of commercial and heritage data – Linked Heritage’s existing pragmatic solution is adequate to this task in the case of ONIX for Books 3.0 and shows promising signs for the other three schemas. There is basic semantic compatibility in practice, confirming the theoretical assumptions of Linked Heritage D4.1. The existing ONIX mapping can be further tested and refined in support of discussions with potential commercial sector data contributors, and experiments on their test and prototype data, as work towards D4.3.
In order to progress from semantic schema mappings to a full-scale aggregation of data, significant technical questions remain to be answered in the cases of IPTC and EIDR data, and several enhancements to the LIDO schema and to the MINT aggregation software are proposed to bring Linked Heritage’s aggregation model in line with current commercial metadata best practice, as exemplified in the schema mappings considered.
Sources of data for each standard are described according to the likely amounts and quality of data available, and the costs, legal framework and technical requirements for accessing them. These themes will be expanded upon in the remaining deliverable from this Work Package.
Finally, the report recommends specific work to assist these enhancements, both within Linked Heritage and also the wider cultural heritage community, including Europeana itself and the international cultural heritage documentation committee, CIDOC.
D4.3 Specification of legal/licensing environment
D5.1 Linked Heritage Technology Platform (PDF , 1980 kb)
This document reports on deliverable D5.1 (Prototype, Public) “Linked Heritage Technology Platform”, that is made available online for validation and for the large-scale contribution of content to Europeana (in WP6) and for dissemination & training (WP7). The technology team led by NTUA integrated all the necessary components into a common technology platform, starting from the basis of the ATHENA ingestion server. The Linked Heritage Technology Platform provides content holders with the ability to perform in an efficient way the required mapping of their own metadata schemas to the project’s reference metadata schema, as well as their publication to Europeana. It is based on NTUA’s metadata interoperability platform MINT, that follows a typical web-based architecture offering an expanding set of services for metadata aggregation and remediation. It addresses the ingestion of metadata from multiple sources, the mapping of the imported records to a well-defined machine-understandable reference model, the transformation and storage of the metadata in a repository, and the provision of services that consume, process and remediate these metadata. Although its deployment is also guided by expediency, the system has been developed using established tools and standards, embodying best practices in order to animate familiar content provider procedures in an intuitive and transparent way.
D5.2 Documented APIs (PDF , 478 kb)
This report documents a set of application programming interfaces offering a simple and structured way to access the aggregation functionalities and repository capabilities of the Linked Heritage technology platform. A set of read access interfaces allows external systems to programmatically search and retrieve user and organization profiles, mapping files, reports and other data from the system. Write capabilities enable external systems to dynamically add new content as well as perform operations on existing datasets. The architecture design is based on predominant models such as REST and, standard wire-level protocols such as the AMPQ, in an effort to adopt existing tools and technologies that facilitate interoperability with external systems. Data exchange is following open standards such as XML, JSON, RSS and the Atom Publishing Protocol to allow developers to get up to speed quickly and reuse existing solutions. The APIs have been used in Task 5.2 to setup the authentication of the Terminology Management Platform (TMP) developed in WP3, allowing its integration with the technology platform, to establish the publication through the current Europeana harvesting mechanism and, in an experimental integration of the whole MINT platform with the newly introduced Europeana United Ingestion Manager (UIM).
D5.3 Metadata gateway (PDF , 726 kb)
This report documents the technological solution for the establishment of the Linked Heritage Metadata Gateway (Task 5.3). The service complements the project’s technology platform (Task 5.1, Deliverable D5.1) offering the technical components necessary for metadata remediation and in particular for the delivery of metadata to Europeana (ESE/EDM compliant). Following the modeling requirements of WP2, WP3 and WP4 and, close collaboration with the Europeana Office ensures interoperability of the Linked Heritage Technology Platform with the Europeana Ingestion infrastructure and the Europeana portal. The Linked Heritage Metadata Gateway handles and remediates XML metadata for the execution of the ingestion plan and RDF resources for prototyping of semantic web-enabled metadata services.
D7.1 Project public website (PDF, 679 kb)
This deliverable describes the website developed for the project, in particular the aims it intends to reach, the users for which it is targeted, the software used, the structure of the public and the reserved areas, the implementation work, the services, the editorial board, the tools for monitoring the website.
D7.2 Dissemination materials (PDF, 1705 kb)
This document, targeted to all components of the Linked Heritage best practice network intends to be an easy-to-use internal guide describing dissemination material that have been completed or will be available in the near future, as well as guidelines on how, where and when to distribute the material, and last but not least, on the dissemination methodologies activities carried out by partners. The goal of this document is to inform, improve, streamline, and standardize the procedures concerning the project’s dissemination activities. The standardization of procedures will also help the project management in monitoring and reporting activities and outputs.
D7.3 Training materials
D7.4 Virtual Learning Environment
D7.5 Dissemination Plan (PDF, 506 kb)
This Dissemination Plan provides an overview of the dissemination strategy, activities, and materials the Linked Heritage Project intends to use over the lifetime of the project with the goal of distributing the Project’s results as widely as possible.
The Linked-Heritage project results are targeted at: content providers, aggregators, policy groups, agencies and governments, sister projects, the cultural heritage research community, national research and competence centres in the areas of cultural heritage, and e-Infrastructures, the private sector, the Europeana Foundation (including Network, and Labs).
The Dissemination Plan identifies the different messages to be addressed to each of these target groups; the methods employed, and the expected outcome of those activities.
Central to the dissemination of information is the project website, whose pivotal role is described in this document, and whose design is discussed in more detail in Deliverable 7.1.
In addition, a series of conferences and seminars will also play an equally critical role in the dissemination process, and together with the website anchor will insure that the results will reach a broad range of pertinent audiences across Europe and beyond.
The final chapter describes how this process will be monitored.
This document, targeted to all sectors of the Linked Heritage best practice network, serves as an easy-to-use internal guide that describes how, where and when the guidelines were created, or will be made available in the near future. In addition, this report describes the dissemination methodologies and activities carried out by partners in the project.
The goal of this document is to inform, improve, streamline, and standardise the procedures concerning the project’s dissemination activities.
The standardisation of procedures will also help the project management in monitoring and reporting activities and outcomes.
As several events took place over the first months of the project, the need arose to provide guidelines to the partners of the project. The mechanisms employed for Linked Heritage dissemination at this early stage, therefore, were already fully described in D7.2 Dissemination material. For this reason, we are not repeating but summarising the information in this plan.