From: tESA: a distributional measure for calculating semantic relatedness
| MEDLINE | PMC OA | Wikipedia | |
|---|---|---|---|
| Size | 14073912 | 1024890 | 3807314 |
| Type | Scientific | Scientific | Encyclopedic |
| Documents | Abstacts and titles | Mostly fulltext + abstracts + titles | Fulltext + titles |
| Snapshot date | Autumn 2015 | September 2015 | December 2015 |
| Token count [M] | 2531,14; 264,84 | 3684,89; 15,8 | 2434,55; 11,13 |
| Unique token count [M] | 3,85; 1,24 | 35,57; 0,48 | 12,53; 0,98 |