Skip to main content

Table 2 List of syntactic and surface features

From: Identifying genotype-phenotype relationships in biomedical text

Features

Description

Syntactic features

Stemmed version of relationship term in the Least Common Ancestor (LCA) node of the two entities

If the head6 of the LCA node of the two entities in the syntax tree is a relationship term then this feature takes a stemmed version of the head word as its value, otherwise it takes a NULL value.

The label of each of the constituents in the path between the LCA and each entity combined with its distance from the LCA node

 

Surface features

Relationship terms and their relative positions

The relationship terms between two entities or within a short distance (4 tokens) from them.