Skip to main content

Table 6 Protein residue relation statistics of silver corpus

From: Literature mining of protein-residue associations with graph rules learned through distant supervision

Parameter

Number

Total number of abstracts

18,045

Total number of sentences

138,790

Total sentences with protein names

41,722

Total sentences with at least one amino acid or mutation

13,729

Sentences with co-mentions of protein-amino acid (or) mutation

5,256

Sentences with validated protein-residue relations

2,516

Physically validated protein-residue relations

2,814

Total abstracts with validated protein-residue relation

1,728