Fig. 3From: We are not ready yet: limitations of state-of-the-art disease named entity recognizersNER results for all tested ML algorithms. The F1-score is shown for the test set that belongs to the training set (corresponding test set) and to the test set of the respective other data setBack to article page