Skip to main content

Table 3 Distribution of data in our different sets

From: Identifying genotype-phenotype relationships in biomedical text

Data set

Sentences

Instances

Positive instances

Negative instances

Training set

509

845

576

269

Test set

244

823

536

287

Unlabelled data

408

823

N/A

N/A