»A machine processor converting art and culture into numbers«

Vera Münch, in Information,
Wissenschaft & Praxis, 2011

IAIS-Cortex manages huge quantities of data and establishes new semantic knowledge networks

IAIS-Cortex was conceived and developed for the German Digital Library (Deutsche Digitale Bibliothek or DDB for short). It is the core system of the DDB in terms of integrating information and data relating to the cultural and scientific heritage gathered from Germany’s cultural and scientific institutions.

IAIS-Cortex is modular and flexible. It can be applied to every application scenario where large and heterogeneous data sources shall be aggregated and linked. The system's flexible construction gives it the advantage of being able to be applied to every specific data models and formats. Using semantic technologies the gathered factual knowledge can be linked internally and enriched with external knowledge from linked open data and company specific knowledge databases as well as from ontologies. The fact that IAIS-Cortex uses a standardized interface provides it with a wide range of end user applications for the desired information, from the web portal right up to the smartphone app. IAIS-Cortex can also be integrated into given infrastructure environments.

IAIS-Cortex has proved its worth in terms of scalability and reliability in the German Digital Library which, in autumn 2012, already contained more than five million objects with the database continuously expanding.

Features of IAIS-Cortex

Flexible data integration (ingest)

An automated process based on the OAIS reference model organizes the parallel integration of information gathered from a number of data sources. Using individual mappings heterogeneous source data formats are converted to an internal data model and simultaneously combined with factual knowledge already gathered and semantically enriched with external ontologies. At the ingest stage semantic facets are generated as a basis for the search filters which are offered during later searches. The loss-free internal data model maintains the semantic depth of a range of multifaceted meta data formats meaning the original objects are preserved and can be made available at any time.

Reliable data management (persistency)

Data management of the aggregated information (e. g. indexing data), the dedicated binary data (e. g. multimedia files), and the search indices (e. g. Solr) takes place in IAIS-Cortex via a flexible interface. For instance via a freely scaleable file system, a database-based system or in a storage cloud. This means the system is particularly easy to configure for large data quantities and can be designed in several redundant ways for a particularly high system reliability.

Individual access (information retrieval)

An open interface enables individual access to stored data via third-party systems or to end user interfaces such as web portals or smartphone apps. The facetted search among the information portfolio comprises semantic filtering for the purpose of systematically restricting the amount of search results and the access to objects, binaries and the original objects.

