From: tESA: a distributional measure for calculating semantic relatedness
 | MEDLINE | PMC OA | Wikipedia |
---|---|---|---|
Size | 14073912 | 1024890 | 3807314 |
Type | Scientific | Scientific | Encyclopedic |
Documents | Abstacts and titles | Mostly fulltext + abstracts + titles | Fulltext + titles |
Snapshot date | Autumn 2015 | September 2015 | December 2015 |
Token count [M] | 2531,14; 264,84 | 3684,89; 15,8 | 2434,55; 11,13 |
Unique token count [M] | 3,85; 1,24 | 35,57; 0,48 | 12,53; 0,98 |