Skip to main content

Table 2 The PGN tagging solutions are incorporate different components, i.e. lexical resources or trained machine-learning based entity recognizers

From: Evaluating gold standard corpora against gene/protein tagging solutions and lexical resources

Tagger

Acronym

Tagger

Lexical

# Lexical

Id

Training

FP

name

 

type

resource

entries

 

data

filter

Banner

 

ML

–

–

No

BC2

Banner

Chang2

Ch2

ML

–

–

No

BC2

Chang2

Abner (BC1)

 

ML

–

–

No

BC1

Abner (BC1)

Abner (Jnlpba)

 

ML

–

–

No

Jnlpba

Abner (Jnlpba)

SwissProt

SP

Lex

SwissProt

228,893

Yes

–

BNC

SwissProt (GP7)

 

Lex

GP7

868,050

Yes

–

BNC

BioLexicon

 

Lex

BioLexicon

653,212

Yes

–

BNC

GeneProt 7.0

GP7

Lex

GP7

1,725,500

Yes

–

BNC

Wh-Ukpmc

 

Lex+ML

SwissProt

228,893

Yes

–

BNC, Chang2

Wh-Ukpmc (GP7)

WH7

Lex+ML

GP7

868,050

Yes

–

BNC, Chang2

Gnat (human)

 

Lex+ML

Human genes

 

Yes

BC2

–

Gnat (all)

 

Lex+ML

11 species

80,000

Yes

BC2

–

Gnat-GN (all)

 

Lex+ML

11 species

80,000

Yes

BC2

–