Skip to main content

Table 1 Distribution of disease-phenotype associations in the generated datasets by provenance

From: Linking common human diseases to their phenotypes; development of a resource for human phenomics

Provenance Text Mined Text Mined (UKB) Semi-automatic Semi-automatic (UKB)
PubMed 2,755,333 985,511 - -
Wikidata - - 1,838 295
HPO (through OMIM–ICD-10 from Wikidata) - - 32,323 3,914
HPO (through OMIM–ICD-10 from UMLS) - - 2,362 423
UMLS - - 1,287 541
Expert curation - - - 433
Propagation(ICD-10) - - 10,201 1,214
Propagation(HPO) - - 9660 756
TOTAL 2,755,333 985,511 57,671 7,576
  1. UKB denotes the subset covering common diseases only from UK Biobank