Skip to content
Snippets Groups Projects
user avatar
Philipp Schneider authored
838caea5
History

DHO Knowledge Graph Data Integration

Automated Pipelines and Mappings to integrate new data into the Digital Heraldry Knowledge Graph

Directory Structure

  • data/
    • input/ New data to be integrated into the Knowledge Graph
    • rdf-output/ RDF files created by the transformation pipelines
  • src/
    • rdf-mappings Mapping script to transform data into RDF
  • config/ Includes json files, containing information how to run the scripts. Each config-file has the corresponding script name embedded in its name as well as in its content.

Pipeline

  • Visualisation of the complete Pipeline with Github mermaid

Usage

  • Add usage instructions, when pipeline is complete

Important Dependencies

Mapping Scripts

Map descriptions of Coats of Arms to RDF

Uses the descriptions from the OMA table tblBranch. Mapping is done by the script map-tblBranch.py. The script can be configured through the file config/config-map-tblBranch.json. This config-file contains:

  • csv_input_path: source file from which the coat of arms descriptions shall be mapped.
  • term_mappings: Mapping table, resolving abbreviations for heraldic terms, that are used in tblBranch.
  • initial_ontology_definitions: Decides, if classes and properties are defined before adding new data to the knowledge graph. Can be set with a python file which contains a number class and property definitions, executed by rdflib. These definitions are then executed in map-tblBranch.py before any data is being mapped from tblBranch (set in csv_input_path). If null is given as a value for initial_ontology_definitions, no classes or properties are added in advance.