Skip to main content

Table 1 Distribution of relations per article in experimental data sets.

From: Ranking relations between diseases, drugs and genes for a curation task

PharmGKB data set

#Rels

Di-Di

Di-Dr

Di-Ge

Dr-Dr

Dr-Ge

Ge-Ge

all

per Art

abs

rel

abs

rel

abs

rel

abs

rel

abs

rel

abs

rel

sum

1

2

0.1

129

6.6

842

43.0

29

1.5

938

47.9

19

1.0

1959

2

6

0.5

138

10.9

484

38.1

18

1.4

611

48.1

13

1.0

1270

3

1

0.0

705

26.5

925

34.8

12

0.5

993

37.4

22

0.8

2658

4

9

1.2

98

13.1

231

30.9

21

2.8

372

49.7

17

2.3

748

5

0

0.0

397

24.3

575

35.2

15

0.9

636

38.9

12

0.7

1635

6

7

1.1

62

9.6

237

36.6

19

2.9

301

46.5

22

3.4

648

7

1

0.1

154

20.0

293

38.1

6

0.8

296

38.4

20

2.6

770

8

0

0.0

155

18.5

334

39.8

17

2.0

320

38.1

14

1.7

840

9

12

1.6

153

19.8

283

36.6

12

1.6

279

36.0

35

4.5

774

10

10

4.2

32

13.3

74

30.8

0

0.0

114

47.5

10

4.2

240

11

1

0.1

205

28.2

236

32.5

4

0.6

270

37.2

10

1.4

726

12

12

3.4

67

19.3

87

25.0

11

3.2

165

47.4

6

1.7

348

13

0

0.0

47

18.1

100

38.5

6

2.3

107

41.2

0

0.0

260

14

0

0.0

52

19.5

118

44.4

0

0.0

93

35.0

3

1.1

266

15

7

1.7

77

18.3

144

34.3

0

0.0

189

45.0

3

0.7

420

16

0

0.0

40

19.2

100

48.1

0

0.0

68

32.7

0

0.0

208

17

0

0.0

39

17.6

51

23.1

7

3.2

106

48.0

18

8.1

221

18

0

0.0

2

1.2

56

34.6

0

0.0

100

61.7

4

2.5

162

19

0

0.0

36

23.7

59

38.8

0

0.0

57

37.5

0

0.0

152

20

0

0.0

127

24.4

203

39.0

4

0.8

166

31.9

20

3.8

520

TOTAL

68

0.5

2715

18.3

5432

36.6

181

1.2

6181

41.7

248

1.7

14825

CTD data set

#Rels

  

Di-Dr

Di-Ge

  

Dr-Ge

  

all

per Art

  

abs

rel

abs

rel

  

abs

rel

  

sum

1

  

1482

23.3

1333

21.0

  

3539

55.7

  

6354

2

  

2454

21.9

1539

13.7

  

7219

64.4

  

11212

3

  

1806

17.6

1154

11.3

  

7294

71.1

  

10254

4

  

1717

15.5

994

9.0

  

8357

75.5

  

11068

5

  

1144

15.3

507

6.8

  

5824

77.9

  

7475

6

  

1248

14.5

525

6.1

  

6855

79.5

  

8628

7

  

578

12.7

270

6.0

  

3688

81.3

  

4536

8

  

648

14.1

225

4.9

  

3719

81.0

  

4592

9

  

396

14.4

165

6.0

  

2184

79.6

  

2745

10

  

285

12.7

88

3.9

  

1867

83.3

  

2240

11

  

193

13.7

93

6.6

  

1122

79.7

  

1408

12

  

203

15.1

63

4.7

  

1078

80.2

  

1344

TOTAL

  

12154

16.9

6956

9.7

  

52746

73.4

  

71856

  1. This table shows how many relations between which entities occur per article in both data sets. PharmGKB has some relations between entities of the same type. CTD contains only relations between entities of different types. In order to keep the tables of both databases easily comparable, entities of type "chemical" from CTD are labeled with "Dr" (drugs).