Skip to main content

Table 2 Statistical results of semantic queries of different complexity (complexity is represented by number of query concepts). All statistics are calculated with MedCalc® [29]. Raw data can be found in Additional file 1: Table S2 in the addendum

From: Integrating terminologies into standard SQL: a new approach for research on routine data

Query “Vitium cordis” “Respiration disorder” “Lethal malformation”
Number of query concepts 1 5 16
Sample size (Percentage of all cards) 1868 (100%) 467 (25%) 1868 (100%)
Sensitivity 93.22% (CI 95%: 89.22–96.08%) 97.76% (CI 95%: 95.79–98.97%) 92.13% (CI 95%: 89.48–94.29%)
Specificity 99.94% (CI 95%: 99.66–100.00%) 95.45% (CI 95%: 87.29–99.05%) 96.30% (CI 95%: 95.16–97.24%)
Positive predictive value 99.55% (CI 95%: 96.87–99.94%) 99.24% (CI 95%: 97.75–99.75%) 90.57% (CI 95%: 87.75–92.92%)
Disease prevalence 12.57% (CI 95%: 11.10–14.15%) 85.90% (CI 95%: 82.41–88.92%) 27.80% (CI 95%: 25.78–29.89%)
F score 0.96 0.98 0.91