Skip to main content

Table 3 Effect of filtering on combined training data (cross-validation folds from development and training corpus) and on the held-back test data set.

From: Simple tricks for improving pattern-based information extraction from the biomedical literature

  Development (per split) Test
  # patterns Aver. pattern length Precision Recall F1 Precision Recall F1
Baseline 590 8.93 24.7 49.2 32.9 17.2 43.9 24.8
Split 1 50 5.34 65.6 51.8 57.9 64.7 42.7 51.4
Split 2 50 4.86 78.1 52.3 62.6 63.0 37.8 47.3
Split 3 60 4.68 67.6 52.9 59.3 60.9 42.5 50.1
Split 4 40 5.02 67.7 49.5 57.2 66.6 36.7 47.3
Split 5 50 4.80 63.7 48.7 55.2 64.2 40.7 49.8
Union of patterns 104 5.65     58.2 46.8 51.9
Best 90 90 5.66     59.7 45.1 51.4
Best 80 80 5.75     64.8 37.7 47.6
Best 70 70 6.01     69.4 26.7 38.6
Best 60 60 6.17     60.0 10.0 17.1
Results of the winner of the shared task [21]       78.5 69.8 73.9
  1. See the definition of splits in text in Results (Evaluation of Test Data)