OpenCitations Data Quality Monitor

Meta Monitor Results

Collection: OpenCitations Meta

Endpoint: https://test.opencitations.net/meta/sparql

Date and Time: 13/02/2025, 15:26:55

Total Running Time: 320.958 seconds

Label Issue Description Test result Running Time (seconds) Raised Error
duplicate_br There is at least one case of multiple fabio:Expression entities sharing the same ID value for any given scheme (e.g. the same DOI is linked to 2 separate journal articles, as in https://opencitations.net/meta/api/v1/metadata/omid:br/061103623233 and https://opencitations.net/meta/api/v1/metadata/omid:br/061602208852). Failed 2.571 -
multiple_id_values There are more than 10,000 BRs that have 2 or more values for at least one of their supported ID schemes (e.g. 2 DOIs for a single journal article, as in https://opencitations.net/meta/api/v1/metadata/omid:br/06120410820). - 284.812 Remote end closed connection without response
duplicate_agent There is at least one case of multiple foaf:Agent entities sharing the same ID value for any given scheme (e.g. different authors have the same ORCID, as for the authors of https://opencitations.net/meta/api/v1/metadata/omid:br/061702782637). Failed 1.202 -
multiple_manifestations There are more than 15 BRs that are embodied in multiple fabio:Manifestation entities (e.g. https://opencitations.net/meta/br/062503856318.html). Passed 12.344 -
br_in_multiple_venues There are more than 1,000 BRs that (wrongly) appear to be contained in different venues (e.g. the journal article described at https://test.opencitations.net/meta/api/v1/metadata/omid:br/0603904064). Failed 20.028 -

Index Monitor Results

Collection: OpenCitations Index

Endpoint: https://test.opencitations.net/index/sparql

Date and Time: 13/02/2025, 15:32:16

Total Running Time: 0.52 seconds

Label Issue Description Test result Running Time (seconds) Raised Error
circular_citation The same entity appears as both citing and cited entity for the same Citation. - 0.52 EndPointInternalError: The endpoint returned the HTTP status code 500. Response: b'{\n "exception": "Tried to allocate 16.1 GB, but only 13.8 GB were available. Clear the cache or allow more memory for QLever during startup",\n "query": "PREFIX cito: \\n\\nSELECT ?citation WHERE {\\n ?citation a cito:Citation ;\\n cito:hasCitingEntity ?entity ;\\n cito:hasCitedEntity ?entity .\\n}\\nLIMIT 1",\n "resultsize": 0,\n "runtimeInformation": {\n "cache_status": "computed",\n "children": [\n {\n "cache_status": "computed",\n "children": [\n {\n "cache_status": "computed",\n "children": [],\n "column_names": [\n "?citation",\n "?entity"\n ],\n "description": "IndexScan ?citation ?entity",\n "details": null,\n "estimated_column_multiplicities": [\n 1.0000373125076294,\n 27.116676330566406\n ],\n "estimated_operation_cost": 2013014332,\n "estimated_size": 2013014332,\n "estimated_total_cost": 2013014332,\n "operation_time": 0,\n "original_operation_time": 0,\n "original_total_time": 0,\n "result_cols": 2,\n "result_rows": 0,\n "status": "failed",\n "total_time": 0\n },\n {\n "cache_status": "computed",\n "children": [],\n "column_names": [\n "?citation",\n "?entity"\n ],\n "description": "IndexScan ?citation ?entity",\n "details": null,\n "estimated_column_multiplicities": [\n 1.0000373125076294,\n 27.661487579345703\n ],\n "estimated_operation_cost": 2013014332,\n "estimated_size": 2013014332,\n "estimated_total_cost": 2013014332,\n "operation_time": 0,\n "original_operation_time": 0,\n "original_total_time": 0,\n "result_cols": 2,\n "result_rows": 0,\n "status": "not started",\n "total_time": 0\n }\n ],\n "column_names": [\n "?citation",\n "?entity"\n ],\n "description": "MultiColumnJoin on ?entity ?citation ",\n "details": null,\n "estimated_column_multiplicities": [\n 1.0000746250152588,\n 27.117687225341797\n ],\n "estimated_operation_cost": 8771447564,\n "estimated_size": 72778609,\n "estimated_total_cost": 12797476228,\n "operation_time": 0,\n "original_operation_time": 0,\n "original_total_time": 0,\n "result_cols": 2,\n "result_rows": 0,\n "status": "failed because child failed",\n "total_time": 0\n },\n {\n "cache_status": "computed",\n "children": [],\n "column_names": [\n "?citation"\n ],\n "description": "IndexScan ?citation ",\n "details": null,\n "estimated_column_multiplicities": [\n 1.0\n ],\n "estimated_operation_cost": 2012939079,\n "estimated_size": 2012939079,\n "estimated_total_cost": 2012939079,\n "operation_time": 0,\n "original_operation_time": 0,\n "original_total_time": 0,\n "result_cols": 1,\n "result_rows": 0,\n "status": "not started",\n "total_time": 0\n }\n ],\n "column_names": [\n "?citation",\n "?entity"\n ],\n "description": "Join on ?citation",\n "details": null,\n "estimated_column_multiplicities": [\n 1.0,\n 18.98238182067871\n ],\n "estimated_operation_cost": 2136662712,\n "estimated_size": 50945024,\n "estimated_total_cost": 16947078019,\n "operation_time": 1,\n "original_operation_time": 0,\n "original_total_time": 0,\n "result_cols": 2,\n "result_rows": 0,\n "status": "failed because child failed",\n "total_time": 1\n },\n "status": "ERROR",\n "time": {\n "computeResult": 4,\n "total": 4\n }\n}'
Last updated: 13/02/2025, 16:32:16