From: Thematic clustering of text documents using an EM-based approach
Datasets
Number of Documents
Number of Clusters
News-Different-3
300
3
News-Similar-3
News-Moderated-6
600
6
Parkinson's Disease
25992
-
Huntington's Disease
5602