Skip to main content

Table 3 All available terminological resources have been compared against each other to identify the most comprehensive and the most universal ones

From: Evaluating gold standard corpora against gene/protein tagging solutions and lexical resources

   

Tagger

       

Match

Corpus

# Entries

SwissProt

[%]

Biolexicon

[%]

SwissProt (GP7)

[%]

GP7

[%]

 

SwissProt

228,893

  

208,069

90.9%

121,369

53.0%

135,018

60.0%

Exact

BioLexicon

653,212

207,976

31.8%

  

243,573

37.3%

422,477

64.7%

 

SP(GP7)

868,050

121,030

13.9%

243,271

28.0%

  

860,094

99.1%

 

GP7

1,725,500

134,275

7.8%

421,520

24.4%

859,536

49.8%

  
 

SwissProt

228,893

  

213,009

93.0%

201,633

88.1%

206,047

90.0%

Alias

BioLexicon

653,212

229,759

35.2%

  

375,550

57.5%

585,205

89.6%

 

SP(GP7)

868,050

219,185

25.3%

364,171

42.0%

  

865,590

99.7%

 

GP7

1,725,500

267,947

15.5%

644,115

37.3%

956,314

55.4%

 Â