Skip to main content

Table 2 Statistics on the three data sets provided by the BioNLP task.

From: Simple tricks for improving pattern-based information extraction from the biomedical literature

  Training Development Test
Abstracts 800 150 260
Sentences 7,449 1,450 2,447
Words 176,146 33,937 57,367
Gene expression events 1,738 356 722
Protein catabolism events 111 21 14
Transcription events 576 82 137
Phosphorylation events 169 47 135