Data on our cultural heritage hold enormous potential for Europe’s economic growth, for the construction of a more inclusive narrative of its past, and for the future collective identity of its citizens. The biggest obstacle to unlocking that potential is the current lack of integration and interoperability among the countless datasets describing the holdings of European heritage institutions. ManuscriptAI will help remove that obstacle for the data on Europe’s medieval written heritage, manuscripts. Premodern handwritten books are a pivotal category of our heritage, yet they are currently underrepresented in large research infrastructures and their catalog data locked in digital silos. ManuscriptAI will employ machine learning algorithms to construct a model capable of facilitating the autonomous integration of distinct data sources describing medieval manuscripts, under a predefined set of machine-understandable vocabulary terms. The model will be made accessible through a human engagement interface and tested during a pilot in a real-world setting.
The project will fill two important desiderata: (1) a user-friendly AI-tool to allow heritage professionals to convert their metadata on manuscripts to Linked Open Data, and (2) a dedicated ontology for the description of medieval manuscripts to complete CIDOC-CRM extensions for the cultural heritage domain. The project, building on the achievements of the ERC-2018-stg PASSIM, is supported by a consortium of research institutes, heritage professionals, and (inter)national research infrastructures. ManuscriptAI will advance the EU’s agenda for digital heritage. The tool will help democratise datafication, making Linked Open Data accessible to small heritage institutions and actively involving them in its development. This integration tool for data on medieval manuscripts will be a huge step forward for the digital preservation and usability of Europe’s unique handwritten heritage.