From: Evaluating gold standard corpora against gene/protein tagging solutions and lexical resources
 |  |  | Tagger |  |  |  |  |  |  |  |
---|---|---|---|---|---|---|---|---|---|---|
Match | Corpus | # Entries | SwissProt | [%] | Biolexicon | [%] | SwissProt (GP7) | [%] | GP7 | [%] |
 | SwissProt | 228,893 |  |  | 208,069 | 90.9% | 121,369 | 53.0% | 135,018 | 60.0% |
Exact | BioLexicon | 653,212 | 207,976 | 31.8% | Â | Â | 243,573 | 37.3% | 422,477 | 64.7% |
 | SP(GP7) | 868,050 | 121,030 | 13.9% | 243,271 | 28.0% |  |  | 860,094 | 99.1% |
 | GP7 | 1,725,500 | 134,275 | 7.8% | 421,520 | 24.4% | 859,536 | 49.8% |  |  |
 | SwissProt | 228,893 |  |  | 213,009 | 93.0% | 201,633 | 88.1% | 206,047 | 90.0% |
Alias | BioLexicon | 653,212 | 229,759 | 35.2% | Â | Â | 375,550 | 57.5% | 585,205 | 89.6% |
 | SP(GP7) | 868,050 | 219,185 | 25.3% | 364,171 | 42.0% |  |  | 865,590 | 99.7% |
 | GP7 | 1,725,500 | 267,947 | 15.5% | 644,115 | 37.3% | 956,314 | 55.4% |  |  |