中文參考書目(本資料按作者姓氏筆劃、發表時間依次進行排序)
[1]卜小蝶,"網路使用者檢索詞彙主題分類探析",台灣大學圖書資訊學系四十週年系慶研討會, 2001年11月16日,頁113。
[2]沈時宇,”網路新聞分類及訂閱系統”,中正大學資訊工程所碩士論文, 2002年。[3]陳淑美,”財經新聞自動分類之研究”,台灣大學圖書資訊學系碩士論文,1992年。[4]張政義,”網際網路上電子新聞追蹤系統的建立與評估”,輔仁大學圖書資訊學系碩士論文,1998年。[5]曾元顯、莊大衛,”文件自我擴展於自動分類之應用”,第十五屆計算機語言學研討會論文集,P129-141,2003年。
[6]曾元顯,”文件主題自動分類成效因素探討”,「中國圖書館學會會報」,2002年6月,第 68 期,P62-83.[7]曾元顯, 第一章數位文件關鍵特徵之自動擷取, 數位文件之資訊擷取與檢索, 269 頁, 2000年9月, ISBN 957-99750-3-2 , 全壘打文化事業有限公司出版.
[8]楊允言,”文件自動分類及其相似性排序”, 清華大學資訊科學學系碩士論文,1993年。[9]錢炳全、廖雙德,”中文試題自動分類方法”, 第七屆人工智慧與應用研討會(TAAI2002)論文集,A4-5 P125-130頁,2002年。
[10]蔣俊霞,”中文文件自動分類之探討”,淡江大學資訊工程研究所碩士論文,1994年。[11]顧皓光、莊裕澤,”網路文件自動分類”,八十六年全國計算機會議論文集,D25-30頁,1997年。
英文參考書目(本資料按作者姓氏字母序、發表時間依次進行排序)
[1]A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," Journal of the Royal Statistical Society, Series B, 39(1):1-38, 1977.
[2]Amit Singhal and Fernando Pereira, “Document Expansion for Speech Retrieval,” Proceedings of the 22th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 1999, P.34-41.
[3]Amit Singhal, Gerard Salton, and Chris Buckley, "Length Normalization in Degraded Text Collections," Proceedings of Fifth Annual Symposium on Document Analysis and Information Retrieval, April 15-17, 1996, pp. 149-162.
[4]Anne Kao, “Re: Reuters Corpus problems,” trecfiltering@list.research.microsoft. com, Oct. 2, 2001.
[5]Da-Wei Juang and Yuen-Hsien Tseng, "Uniform Indexing and Retrieval Scheme for Chinese, Japanese, and Korean," Proceedings of the Third NTCIR Workshop on Evaluation of Information Retrieval, Automatic Text Summarization and Question Answering, Oct. 8-10, 2002, Tokyo, Japan, P.137-141.
[6]David D. Lawis, Yiming Yang, Tony Rose and Fan Li, ”RCV1:A New Benchmark Collection for Text Categorization Research”, Journal of Machine Learning Research 5, 2004, P361-397.
[7]Dunja Mladenic, etc, "Feature selection for unbalanced class distribution and Naive Bayes," Proceedings of the International Conference on Machine Learning (ICML’98), 1998, http://www.cs.cmu.edu/~TextLearning/pww/yplanet.html.
[8]Chidanand Apt, Fred Damerau and Sholom M. Weiss, “Towards Language Independent Automated Learning of Text Categorization Models,” Proceedings of the 17th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 1994, P23 – 30.
[9]Hwee Tou Ng, Wei Boon Goh and Kok Leong Low, "Feature Selection, Perception Learning, and a Usability Case Study for Text Categorization," Proceedings of the 20th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 1997, P67 - 73.
[10]K. Nigam, A. McCallum, S. Thrun, and T. Mitchell, "Text classification from labeled and unlabeled documents using EM," Machine Learning, 39(2/3):103-134, 2000.
[11]Kamal Nigam and Rayid Ghani, "Analyzing the Effectiveness and Applicability of Co-training," Proceedings of the ninth international conference on information and knowledge management CIKM 2000, McLean, Virginia, United States, P86 – 93.
[12]Khalid Al-Kofahi, Alex Tyrrell, Arun Vachher, Tim Travers, and Peter Jackson, "Combining Multiple Classifiers for Text Categorization," Proceedings of the Tenth International Conference on Information and Knowledge Management 2001, Atlanta, Georgia, USA, P97-104.
[13]Leah S. Larkey and W. Bruce Croft, “Combining Classifiers in Text Categorization,” Proceedings of the 19th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 1996, P289 – 297.
[14]M. Kreines, “Reuters Corpus problems,” trecfiltering@list.research.microsoft. com, Oct. 2, 2001.
[15]Ron Bekkerman, Ran El-Yaniv, Yoad Winter, Naftali Tishby, “On Feature Distributional Clustering for Text Categorization,” Proceedings of the 24th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 2001, P146-153.
[16]Susan Dumais, John Platt, David Heckerman and Mehran Sahami, “Inductive Learning Algorithms and Representations for Text Categorization,” Proceedings of the 1998 ACM 7th international Conference on Information and Knowledge Management, 1998, P148 – 155.
[17]Thorsten Joachims, SVMlight: Support Vector Machine, version 5, http://svmlight.joachims.org/, 2002/03/07.
[18]Thorsten Joachims, "A Statistical Learning Model of Text Classification for Support Vector Machines," Proceedings of the 23rd Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 2001, P.128-136.
[19]Thorsten Joachims, “Text Categorization with Support Vector Machines: Learning with Many Relevant Features,” Proceedings of the European Conference on Machine Learning, 1998, Berlin, pp. 137-142.
[20]Vladimir N. Vapnik, The Nature of Statistical Learning Theory. Springer, 1995.
Platt, J. “Fast Training of SVMs using Sequential Minimal Optimization,” in B. Scholkopf, C. Burges, and A. Smola (Eds.) Advances in Kernel Methods – Support Vector Learning, MIT Press, 1998.
[21]Wai Lam, Kwok-Yin Lai, “A Meta-Learning Approach for Text Categorization,” Proceedings of the 23rd Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 2001, pp.303-309.
[22]William B. Frakes and Ricardo Baeza-Yates, Infomation Retrieval: Data Structure and Algorithms, Prentice Hall, 1992.
[23]William W. Cohen and Yoram Singer, “Context-Sensitive Learning Methods for Text Categorization,” Proceedings of the 19th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 1996, P307 – 315.
[24]Yiming Yang, “A Study on Thresholding Strategies for Text Categorization”, Proceedings of the 23rd Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 2001, P137-145.
[25]Yiming Yang and Xin Liu, “A Re-Examination of Text Categorization Methods,” Proceedings of the 22nd Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, 1999, P42 – 49.
[26]Yiming Yang and J. Pedersen, “A Comparative Study on Feature Selection in Text Categorization,” Proceedings of the International Conference on Machine Learning (ICML’97), 1997, P412-420.
[27]Yuen-Hsien Tseng and Da-Wei Juang, "Document-Self Expansion for Text Categorization," Proceedings of the 26th International ACM SIGIR Conference on Research and Development in Information Retrieval - SIGIR '03, July 28 - Aug. 1, Toronto, Canada, 2003, P.399-400.
[28]Yuen-Hsien Tseng, "Automatic Cataloguing and Searching for Retrospective Data by Use of OCR Text", Journal of American Society for Information Science and Technology, Vol. 52, No. 5, 2001, pp. 378-390.
[29]Yuen-Hsien Tseng and Douglas W. Oard, "Document Image Retrieval Techniques for Chinese" Proceedings of the Fourth Symposium on Document Image Understanding Technology, Columbia Maryland, April 23-25th, 2001, pp. 151-158.