Skip to main content

Table 1 The genres of the documents in the MedEval document collection

From: MedEval — A Swedish medical test collection with doctors and patients user groups

Type of source

Number of documents

Percent of documents

Number of tokens

Percent of tokens

Journals and periodicals

8,453

20.0

5.3 million

34.6

Specialized sites

14,631

34.6

2.9 million

19.1

Pharmaceutical companies

9,200

21.8

2.3 million

14.8

Government, faculties, institutes, and hospitals

2,955

7.0

2.0 million

13.3

Health-care communication companies

4,036

9.6

1.7 million

11.3

Media (TV, daily newspapers)

2,980

7.1

1.0 million

6.9

Total

42,255

100.1

15.2 million

100

  1. The genres and sizes of the MedEval document sources. The MedEval document collection is a snapshot of the MedLex corpus in October 2007. (D. Kokkinakis, p.c.)