Skip to main content

Table 3 Performance comparison of THEME, DPMFS, EDCM, and EM-MN on the 20-Newsgroup collection

From: Thematic clustering of text documents using an EM-based approach

  THEME DPMFS EDCM EM-MN
News-Different-3 0.847 0.688 0.734 0.867
News-Similar-3 0.103 0.231 0.163 0.081
News-Moderated-6 0.782 0.663 0.531 0.562
  1. THEME, DPMFS, EDCM, and EM-MN are the proposed clustering method, a Dirichlet process mixture model, a Dirichlet compound multinomial model, and an EM-based mixture model, respectively.