Skip to main content

Table 2 Statistics on the three data sets provided by the BioNLP task.

From: Simple tricks for improving pattern-based information extraction from the biomedical literature

 

Training

Development

Test

Abstracts

800

150

260

Sentences

7,449

1,450

2,447

Words

176,146

33,937

57,367

Gene expression events

1,738

356

722

Protein catabolism events

111

21

14

Transcription events

576

82

137

Phosphorylation events

169

47

135