Skip to main content

Table 1 Bio2RDF datasets currently available

From: Ontology-Based Querying with Bio2RDF’s Linked Open Data

Dataset Namespace # of triples # of unique subjects # of unique predicates # of unique objects
Affymetrix affymetrix 44469611 1370219 79 13097194
Biomodels* biomodels 589753 87671 38 209005
Comparative Toxicogenomics Database ctd 141845167 12840989 27 13347992
DrugBank drugbank 1121468 172084 75 526976
NCBI Gene ncbigene 394026267 12543449 60 121538103
Gene Ontology Annotations goa 80028873 4710165 28 19924391
HUGO Gene Nomenclature Committee hgnc 836060 37320 63 519628
Homologene homologene 1281881 43605 17 1011783
InterPro*† interpro 999031 23794 34 211346
iProClass iproclass 211365460 11680053 29 97484111
iRefIndex† irefindex 31042135 1933717 32 4276466
Medical Subject Headings mesh 4172230 232573 60 1405919
National Center for Biomedical Ontology*† ncbo 15384622 4425342 191 7668644
National Drug Code Directory* ndc 17814216 301654 30 650650
Online Mendelian Inheritance in Man omim 1848729 205821 61 1305149
Pharmacogenomics Knowledge Base pharmgkb 37949275 5157921 43 10852303
SABIO-RK* sabiork 2618288 393157 41 797554
Saccharomyces Genome Database sgd 5551009 725694 62 1175694
NCBI Taxonomy taxon 17814216 965020 33 2467675
Total 19 1010758291 57850248 1003 298470583
  1. The Bio2RDF datasets currently available for SPARQL querying and download at http://bio2rdf.org. The total number of triples, number of unique subject, number of unique predicates and number of unique objects are listed along with the Bio2RDF namespace for each dataset.
  2. * Datasets new to the Bio2RDF network
  3. † InterPro contains 13 domain resources, iRefIndex contains 13 interaction resources, and NCBO contains 107 OBO ontologies.