New Workshop Paper: Wikidata Reference Statement Statistics

Wikidata has the ability to provide references for each and every statement that it stores. This paper builds on Seyed’s earlier work to create topical subsets over Wikidata. Here we focus on the provision of references for statements in 6 topical subsets of Wikidata and report summary statistics to report on their usage.

  1. Reference Statistics in Wikidata Topical Subsets

    Abstract: Wikidata is the only general-purpose open knowledge graph with the capability of specifying references for every single statement. Currently, about 68% of Wikidata statements have at least one reference but the quality of these references is rarely covered in data quality studies. There is also a lack of a comprehensive framework for evaluating references. In this paper, we investigate the statistics of Wikidata references in 6 topical subsets of Wikidata. We compare these statistics over two Wikidata dumps; one from 2016 and one from 2021.

    Hosseini Beghaeiraveri, Seyed Amir and Gray, Alasdair J. G. and McNeill, Fiona

    In Proceedings of the 2nd Wikidata Workshop (Wikidata 2021), co-located with the 20th International Semantic Web Conference (ISWC 2021), 2021

About Me


I'm an Associate Professor in Computer Science at Heriot-Watt University. My research focuses on linking datasets. Read more