Data Quality Issues in Current Nanopublications

Imran Asif
Heriot-Watt University

Wednesday 18 September 2019
11:15 - 12:15
CM T.01


Nanopublications are a granular way of publishing scientiļ¬c claims together with their associated provenance and publication information. More than 10 million nanopublications have been published by a handful of researchers covering a wide range of topics within the life sciences. We were motivated to replicate an existing analysis of these nanopublications, but then went deeper into the structure of the existing nanopublications. In this paper, we analyse the usage of nanopublications by investigating the distribution of triples in each part and discuss the data quality issues raised by this analysis. From this analysis we argue that there is a need for the community to develop a set of community guidelines for the modelling of nanopublications.

Host: Alasdair Gray