Skip to main content

Table 1 A number of gold standard corpora have been delivered to the public for the evaluation of PGN tagging solutions

From: Evaluating gold standard corpora against gene/protein tagging solutions and lexical resources

Name

Release

# Annot.

# Units

Topic

Jnlpba

2004

6,142

401 abs.

Subset of Genia

BioCreative-II

2005

5,144

4,171 sent.

Human proteins

PennBio

2006–07

18,148

1,414 abs.

Oncology

FsuPrge

2009

59,483

3,236 abs.

Gene regulatory processes