一、中文部分
(一)圖書
林傑斌、劉明德、陳湘,「資料採掘與 OLAP 理論與實務」,台北:文魁書局,2002。
(二)期刊
陳光華,"資訊檢索查詢之自然語言處理",中國圖書館學會會報,第 57 期, 85年 12月,頁 141 - 153 。曾元顯,"分類不一致對文件自動分類效果的影響",大學圖書館,9卷1期,2005年3月,頁 11-13簡立峰,"尋易系統(Csmart)與中文智慧型資訊檢索",資訊傳播與圖書館學, 3卷 2期, 85年 12月,頁28-37。二、英文部分
(一)圖書
Wordnet: An Electronic Lexical Database , pp. xviii-xix
(二)期刊
Aggrawal, C. C., & Yu, P. S., "Finding Generalized Projected Clusters in High Dimensional Spaces," Proceedings of the 2000 ACM SIGMOD international conference on Management of data, Dallas, New York: ACM Press, 2000, 70-81.
Cutting, D. R. Karger, J. O. Pedersen, and J. W. Tukey. “Scatter/gather: A cluster-based approach to browsing large document collections,” Proceedings of the 15th ACM-SIGIR Conference, 1992, pp. 318-329.
Dubes, R. C. & Jain, A. K., "Algorithms for Clustering Data," New Jersey: Prentice Hall, 1988.
Frakes, W. B. & Baezay, R., "Information Retrieval: Data Structures and Algorithms," New Jersey: Prentice-Hall, 1992.
Franca Debole and Fabrizio Sebastiani, “An Analysis of the Relative Hardness of Reuters-21578 Subsets” to appear in Journal of the American Society for Information Science and Technology.
Griffith, A., Luckhurst, H. C. & Willet, P., "Using Inter-Document Similarity Information in Document Retrieval Systems," Journal of the American Society for Information Sciences, Vol. 37, No. 1, 1986, pp. 3-11.
Ido Dagan and Ronen Feldman, "Keyword-based browsing and analysis of large document ets," Proceedings of the Symposium on Document Analysis and Information Retrieval (SDAIR-96), Las Vegas, Nevada, 1996.
Joseph B. Kruskal, "Multidimensional Scaling and Other Methods for Discovering Structure," pp. 296-339 in "Statistical Methods for Digital Computers" edited by Kurt Enslein, Anthony Ralston, and Herbert S. Wilf, Wiley: New York, 1977.
Krista Lagus, Samuel Kaski, and Teuvo Kohonen, “Mining Massive Document Collections by the WEBSOM Method,” Information Sciences, Vol 163/1-3, pp. 135-156, 2004.
Lee-Feng Chien, "PAT-Tree Based Keyword Extraction for Chinese Information Retrieval" CM SIGIR 1997.
Liu, T., Liu, S. & Chen, Z., 2003, "An Evaluation on Feature Selection for Text lustering," Proceedings of the Twentieth International Conference on Machine Learning, Washington, CA: AAAI Press, pp. 488-495.
Marti A. Hearst and Jan O. Pedersen, "Reexamining the Cluster Hypothesis: Scatter/
Gather n Retrieval Results," Proceedings of the 19th ACM-SIGIR Conference, 1996, pp. 76-84.
Mehran Sahami, Salim Yusufali, and Michelle Q. W. Baldonaldo, “SONIA: A Service for Organizing Networked Information Autonomously,” Proceedings of the 3rd ACM Conference on Digital Libraries, 1998, pp. 200-209.
Michele Banko, Vibhu O. Mittal, and Michael J. Witbrock, “Headline Generation Based on Statistical Translation,” ACL 2000.
Oren Zamir and Oren Etzioni, “Web document clustering: a feasibility demonstration,” Proceedings of the 21st ACM-SIGIR Conference, 1998, pp. 46-54.
Paul E. Kennedy, Alexander G. Hauptmann, "Automatic title generation for EM," Proceedings of the 5th ACM Conference on Digital Libraries, 2000, pp.
Ron Bekkerman, Ran El-Yaniv, Yoad Winter, Naftali Tishby, “On Feature Distributional Clustering for Text Categorization,” Proceedings of the 24th ACM-SIGIR Conference, 2001, pp.146-153.
Russell Swan and James Allan, "Automatic Generation of Overview Timelines," Proceedings of the 23rd ACM-SIGIR Conference, 2000, pp. 49-56.
Salton, G. & Buckley, C., "Term Weighting Approaches in Automatic Information Retrieval," Journal of Information Proceeding and Management, Vol. 24, No. 5, 1988, pp. 513-524.
Salton, G., "Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer," New York: Addison-Wesley, 1989.
Steinbach, M., Karypis, G. & Kumar, V., "A Comparison of Document Clustering Techniques," Technical Report 00-034, Computer Science and Engineering, University of Minnesota, 2000.
Wei,C.P.,Hu P.J.&Dong,Y.X.,"Managing Document Categories in E-Commerce Environments:an Evolution-Based Approach,,"European Journal of Information Systems,Vol.11,No.3,2002,pp.208-222
William B. Frakes and Ricardo Baeza-Yates, Information Retrieval: Data Structure and Algorithms, Prentice Hall, 1992.
(三)網路資源
Antti Arppe, "Term Extraction from Unrestricted Text,",1995 <http://www.lingsoft.fi/doc/nptool/term-extraction.html>
David D. Lewis, “Reuters-21578 text categorization test collection, Distribution 1.0” README file (v 1.2), 1997 <http://www.research.att.com/~lewis/>
Document Understanding Conferences<http://www-nlpir.nist.gov/projects/duc/.>
Jean Godby, "Two Techniques for the Identification of Phrases in Full Text," <http://www.oclc.org/oclc/research/publications/review94/part1/twotech.htm>
Jen-Nan Chen, Jyun-Sheng, Chang and Huey-Chyun Chen, "Using Word Segmentation Model for Compression of Chinese Text"
<http:// nlplab.cs.nhtu.edu.tw/~mathis/own/html/PAPER/JNL/95/cpcol/ CPCOL95.htm>
Mathis H. C. Chen, Tsong-Yi Tseng, Jason J. S. Chang, "Automatic Generation of Indices or Chinese Books," <http://nlplab.cs.nthu.edu.tw/~mathis/own/html/ PAPER/JNL/96/cpcol/BookIdx.htm>