Skip to main content

Table 7 Mean test \(F_1\) scores of the best models of each category per slot (mean and standard deviation of 10 training runs)

From: Comparing generative and extractive approaches to information extraction from abstracts describing randomized clinical trials

 

Type 2 diabetes \(F_1\)

Glaucoma \(F_1\)

Slot name

Generative

Extractive

Generative

Extractive

AggregationMethod

0.41 (± 0.07)

0.54 (± 0.03)

0.52 (± 0.08)

0.67 (± 0.13)

AllocationRatio

0.0 (± 0.0)

0.92 (± 0.05)

-

-

analysesHealthCondition

0.84 (± 0.05)

0.73 (± 0.06)

0.87 (± 0.02)

0.86 (± 0.03)

Author

0.97 (± 0.03)

0.92 (± 0.01)

0.8 (± 0.02)

0.94 (± 0.04)

AvgAge

0.0 (± 0.0)

0.37 (± 0.06)

-

-

BaselineUnit

0.42 (± 0.03)

0.44 (± 0.05)

0.54 (± 0.06)

0.56 (± 0.07)

BaselineValue

0.49 (± 0.06)

0.3 (± 0.03)

0.67 (± 0.08)

0.59 (± 0.03)

CTDesign

0.82 (± 0.02)

0.9 (± 0.01)

0.8 (± 0.04)

0.82 (± 0.03)

CTduration

0.89 (± 0.03)

0.89 (± 0.04)

0.78 (± 0.06)

0.87 (± 0.04)

ChangeValue

0.41 (± 0.06)

0.19 (± 0.05)

0.59 (± 0.07)

0.52 (± 0.05)

ConclusionComment

0.73 (± 0.04)

0.88 (± 0.04)

0.84 (± 0.02)

0.91 (± 0.02)

ConfIntervalChangeValue

0.0 (± 0.0)

0.0 (± 0.0)

-

-

ConfIntervalDiff

0.46 (± 0.12)

0.43 (± 0.05)

0.29 (± 0.11)

0.28 (± 0.11)

Country

0.68 (± 0.07)

0.64 (± 0.08)

0.86 (± 0.05)

0.86 (± 0.03)

DeliveryMethod

0.0 (± 0.0)

0.0 (± 0.0)

0.34 (± 0.2)

0.42 (± 0.06)

DiffGroupAbsValue

0.45 (± 0.09)

0.43 (± 0.06)

0.31 (± 0.16)

0.43 (± 0.11)

DoseDescription

0.0 (± 0.0)

0.0 (± 0.0)

-

-

DoseUnit

0.77 (± 0.04)

0.24 (± 0.04)

0.8 (± 0.08)

0.6 (± 0.06)

DoseValue

0.79 (± 0.04)

0.77 (± 0.07)

0.75 (± 0.07)

0.65 (± 0.06)

Drug

0.82 (± 0.05)

0.7 (± 0.02)

0.58 (± 0.04)

0.45 (± 0.06)

Duration

-

-

0.0 (± 0.0)

0.2 (± 0.32)

EndoPointDescription

0.34 (± 0.02)

0.3 (± 0.02)

0.26 (± 0.05)

0.25 (± 0.03)

FinalNumPatientsArm

0.0 (± 0.0)

-

0.0 (± 0.0)

0.02 (± 0.06)

FinalNumberPatientsCT

-

-

0.0 (± 0.0)

0.64 (± 0.13)

Frequency

0.61 (± 0.06)

0.62 (± 0.02)

0.77 (± 0.06)

0.71 (± 0.04)

Journal

0.96 (± 0.05)

0.92 (± 0.05)

0.67 (± 0.08)

0.74 (± 0.07)

MeasurementDevice

-

-

0.0 (± 0.0)

0.2 (± 0.32)

MinAge

0.0 (± 0.0)

0.67 (± 0.17)

-

-

NumberAffected

0.16 (± 0.19)

0.08 (± 0.13)

0.4 (± 0.22)

0.0 (± 0.0)

NumberPatientsArm

0.83 (± 0.09)

0.87 (± 0.02)

0.68 (± 0.12)

0.7 (± 0.06)

NumberPatientsCT

0.65 (± 0.08)

0.93 (± 0.04)

0.65 (± 0.09)

0.86 (± 0.02)

ObjectiveDescription

0.43 (± 0.06)

0.49 (± 0.05)

0.49 (± 0.07)

0.51 (± 0.09)

ObservedResult

0.03 (± 0.03)

0.01 (± 0.01)

0.01 (± 0.03)

0.0 (± 0.0)

PMID

0.97 (± 0.03)

1.0 (± 0.0)

0.98 (± 0.02)

0.99 (± 0.01)

PValueChangeValue

0.1 (± 0.11)

0.0 (± 0.0)

0.0 (± 0.0)

0.33 (± 0.05)

PercentageAffected

0.59 (± 0.08)

0.21 (± 0.03)

0.31 (± 0.18)

0.15 (± 0.03)

Precondition

0.22 (± 0.08)

0.4 (± 0.07)

0.27 (± 0.05)

0.18 (± 0.04)

PublicationYear

0.97 (± 0.03)

1.0 (± 0.0)

0.98 (± 0.02)

1.0 (± 0.0)

PvalueDiff

0.31 (± 0.05)

0.48 (± 0.02)

0.24 (± 0.06)

0.39 (± 0.06)

RelativeChangeValue

0.04 (± 0.09)

0.0 (± 0.0)

0.13 (± 0.2)

0.57 (± 0.12)

RelativeFreqTime

-

-

0.0 (± 0.0)

0.36 (± 0.1)

ResultMeasuredValue

0.27 (± 0.07)

0.21 (± 0.04)

0.57 (± 0.07)

0.35 (± 0.02)

SdDevBL

0.18 (± 0.18)

0.14 (± 0.19)

0.53 (± 0.09)

0.62 (± 0.05)

SdDevChangeValue

0.02 (± 0.06)

0.0 (± 0.0)

0.38 (± 0.14)

0.45 (± 0.07)

SdDevResValue

0.2 (± 0.13)

0.19 (± 0.02)

0.62 (± 0.13)

0.34 (± 0.02)

SdErrorChangeValue

-

-

0.0 (± 0.0)

0.57 (± 0.0)

SubGroupDescription

0.0 (± 0.0)

0.0 (± 0.0)

-

-

TimePoint

0.35 (± 0.11)

0.22 (± 0.03)

0.39 (± 0.07)

0.41 (± 0.03)

Title

0.86 (± 0.05)

0.93 (± 0.02)

0.88 (± 0.06)

0.85 (± 0.03)

Total Micro \(F_1\) Score

0.54 (± 0.03)

0.55 (± 0.01)

0.58 (± 0.02)

0.64 (± 0.01)