中文文獻
[1]曾韋榮 (2006),結合潛在語意檢索及資訊粒化於資料探勘,碩士論文,國立臺北科技大學商業自動化與管理研究所,臺北。[2]張麗新、王家廞、趙雁南、楊澤紅 (2004),「基於Rel ief 的組合式特徵選擇」,復旦學報(自然科學版),第43卷,第5期,第893-898頁。
[3]廖闊、付建勝、楊萬麟 (2010),「改進的ReliefF 算法用於雷達距離像目標識別」,電子測量與儀器學報,第24卷,第9期,第831-836頁。
英文文獻
[1]Abbasi, A., and Chen, H. (2005), “Applying authorship analysis to extremist-group web forum messages,” IEEE Intelligent Systems, vol. 20, no. 5, pp. 67–75.
[2]Abbasi, A., Chen, H., and Salem, A. (2008), “Sentiment analysis in multiple languages: feature selection for opinion classification in web forums,” ACM Transactions on Information Systems, vol. 26, no. 3, pp. 12:1-12:34.
[3]Arun Kumar, M., and Gopal, M. (2010), “A comparison study on multiple binary-class SVM methods for unilabel text categorization,” Pattern Recognition Letters, vol. 31, no. 11, pp. 1437-1444.
[4]Chang, C. C. and Lin, C. J. (2001), “LIBSVM: A Library for Support Vector Machines,” Software, available at: http://www.csie.ntu.edu.tw/~cjlin/libsvm.
[5]Chaovalit, P., and Zhou, L. (2005), “Movie review mining: a comparison between supervised and unsupervised classification approaches,” In Proceedings of the 38th Hawaii International Conference on System Sciences, pp.1-9.
[6]Chen, E., Lin, Y., Xiong, H., Luo, Q., and Ma, H. (2010), “Exploiting probabilistic topic models to improve text categorization under class imbalance,” Information Processing and Management, vol. 47, no. 2, pp. 202-214.
[7]Chen, L. S., Liu, C. H., and Chiu, H. J. (2011), “A neural network based approach for sentiment classification in the blogosphere,” Journal of Informetrics, vol. 5, no. 2, pp. 313-322.
[8]Chien, W. T., and Tsai, C. S. (2003), “The investigation on the prediction of tool wear and the determination of optimum cutting conditions in machining 17-4PH stainless steel,” Journal of Materials Processing Technology, vol. 140, no. 1-3, pp. 340-345.
[9]Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K., and Harshman, R. (1990), “Indexing by latent semantic analysis,” Journal of the American Society for Information Science, vol. 41, no. 6, pp. 391-407.
[10]Das Mohapatra, P.K., Maity, C., Rao, R.S., Pati, B.R., and Mondal, K.C. (2009), “Tannase production by bacillus licheniformis KBR6: optimization of submerged culture conditions by taguchi DOE methodology,” Food Research International, vol. 42, no. 4, pp. 430-435.
[11]Fern&;aacute;ndez, A., del Jesus, M. J., and Herrera, F. (2009), “On the influence of an adaptive inference system in fuzzy rule based classification systems for imbalanced data sets,” Expert Systems with Application, vol. 36, pp. 9805-9812.
[12]Garc&;iacute;a, V., S&;aacute;nchez, J. S., and Mollineda, R. A. (2011), “On the effectiveness of preprocessing methods when dealing with different levels of class imbalance,” Knowledge-Based Systems, vol. 25, no. 1, pp. 13-21.
[13]Gunn, S. R. (1998), “Support vector machines for classification and regression,” Technical Report, University of Southampton, UK.
[14]Gomez, J. C., and Moens, M. F. (2012), “PCA document reconstruction for email classification,” Computational Statistics and Data Analysis, vol. 56, no. 3, pp. 741-751.
[15]Huang, Y., McCullagh, P. J., Black, N. D. (2009), “An optimization of ReliefF for classification in large datasets,” Data &; Knowledge Engineering, vol. 68, no. 11, pp. 1348-1356.
[16]Hong, C. W. (2011), “Using the Taguchi method for effective market segmentation,” Expert Systems with Applications, doi:10.1016/j.eswa.2011.11.040.
[17]Kira, K., and Rendell, L. A. (1992), “The feature selection problem traditional methods and a new algorithm,” In Proceedings of 9th National Conference on Artificial Intelligence, pp. 129-134.
[18]Kononenko, I. (1994), “Estimating attributes: analysis and extensions of Relief,” In Proceedings of the European Conference on Machine Learning, pp. 171-182.
[19]Kontostathis, M., and Pottenger, W. M. (2006), “A framework for understanding latent semantic indexing (LSI) performance,” Information Processing and Management, vol. 42, no. 1, pp. 56-73.
[20]Keshtkar, F., and Inkpen, D. (2009), “Using sentiment orientation features for mood classification in blogs,” IEEE International Conference on Natural Language Processing and Knowledge Engineering.
[21]Li, B., Xu, S., and Zhang, J. (2007), “Enhancing clustering blog documents by utilizing author/reader comments,” In Proceedings of the 45th Annual Southeast Regional Conference, pp. 94-99.
[22]Li, S., Zhou, G., Wang, Z., Lee, S. Y. M., and R. Wang (2011), “Imbalanced sentiment classification,” Proceedings of the 20th ACM International Conference on Information and Knowledge Management , pp. 2469-2472.
[23]Liu, B., Hu, M., and Cheng, J. (2005), “Opinion observer: analyzing and comparing opinions on the web,” In Proceedings of the 14th International Conference on World Wide Web, pp. 342-351.
[24]Liu, Y., Loh, H. T., and Sun, A., (2009), “Imbalanced text classification: a term weighting approach,” Expert Systems with Applications, vol. 37, no. 1, pp. 690-701.
[25]Liu, Y., Yu, X., Huang, J. X., and An, A. (2010), “Combining integrated sampling with SVM ensembles for learning from imbalanced datasets,” Information Processing and Management, vol. 47, no. 4, pp. 617-631.
[26]Meng, J., Lin, H., and Yu, Y. (2011), “A two-stage feature selection method for text categorization,” Computers and Mathematics with Applications, vol. 62, no. 7, pp. 2793-2800.
[27]Mallick, K., and Bhattacharyya, S. (2012), “Uncorrelated local maximum margin criterion: an efficient dimensionality reduction method for text classification,” Procedia Technology, vol. 4, pp. 370-374.
[28]Ogura, H., Amano, H., and Kondo, M. (2010), “Distinctive characteristics of a metric using deviations from Poisson for feature selection,” Expert Systems with Applications, vol. 37, no. 3, pp. 2273–2281.
[29]Ogura, H., Amano, H., and Kondo, M. (2011), “Comparison of metrics for feature selection in imbalanced text classification,” Expert Systems with Applications, vol. 38, no. 5, pp. 4978–4989.
[30]O’Keefe, T., and Koprinska, I. (2009), “Feature selection and weighting methods in sentiment analysis,” In proceedings of the 14th Australasian Document Computing Symposium.
[31]Quinlan, J. R. (1993), “C4.5: programs for machine learning,” Morgan kaufmann, San Mateo, CA.
[32]Sebastiani, F. (2002), “Machine learning in automated text categorization,” ACM Computing Surveys, vol. 34, no. 1, pp. 1–47.
[33]Simeon, M., and Hilderman, R. (2008), “Categorical proportional difference: A feature selection method for text categorization,” In Proceedings of the 17th Australasian Data Mining Conference.
[34]Stamatatos, E. (2008), “Author identification: using text sampling to handle the class imbalance problem,” Information Processing and Management, vol. 44, no. 2, pp. 790-799.
[35]Sun, A., Lim, E. P., and Liu, Y. (2009), “On strategies for imbalanced text classification using SVM: a comparative study,” Decision Support Systems, vol. 48, no. 1, pp. 191-201.
[36]Tan, S., and Zhang, J. (2008), “An empirical study of sentiment analysis for chinese documents,” Expert Systems with Applications, vol. 34, no. 4, pp. 2622-2629.
[37]Tang, H., Tan, S., and Cheng, X. (2009), “A survey on sentiment detection of reviews,” Expert Systems with Applications, vol. 36, no. 7, pp. 10760-10773.
[38]Tong, L. I., Chang, Y. C., and Lin, S. H. (2011), “Determining the optimal re-sampling strategy for a classification model with imbalanced data using design of experiments and response surface methodologies,” Expert Systems with Applications, vol. 38, no. 4, pp. 4222-4227.
[39]Uğuz, H. (2011), “A two-stage feature selection method for text categorization by using information gain, principal component analysis and genetic algorithm,” Knowledge-Based Systems, vol. 24, no. 7, pp. 1024-1032.
[40]Uysal, A., and Gunal, S. (2012), “A novel probabilistic feature selection method for text classification” Knowledge-Based Systems, doi: 10.1016/j.knosys.2012.06.005.
[41]van Halteren, H. (2004), “Linguistic profiling for author recognition and verification,” Proceedings of the 42nd annual meeting of the association for computational linguistics, pp. 199–206.
[42]Vapnik, V. N. (1995), The Nature of Statistical Learning Theory, Springer-Verlag.
[43]Weiss, G., and Provost, F. (2003), “Learning when training data are costly: the effect of class distribution on tree induction,” Journal of Artificial Intelligence Research, vol. 19, no. 1, pp. 315-354.
[44]Whitelaw, C., Garg, N., and Argamon, S. (2005), “Using appraisal groups for sentiment analysis,” In proceedings of the ACM 14th Conference on Information and Knowledge Management, pp. 625-631.
[45]Wu, C. H., Chuang, Z. J., and Lin, Y. C. (2006), “Emotion recognition from text using semantic labels and separable mixture models,” ACM Transactions on Asian Language Information Processing, vol. 5, no. 2, pp. 165-182.
[46]Ye, Q., Zhang, Z., and Law, R. (2009), “Sentiment classification of online reviews to travel destinations by supervised machine learning approaches,” Expert Systems with Applications, vol. 36, no. 3, pp. 6527-6535.
[47]Yusoff, N., Ramasamy, M., and Yusup, S. (2011), “Taguchi’s parametric design approach for the selection of optimization variables in a refrigerated gas plant,” Chemical Engineering Research and Design, vol. 89, no. 6, pp. 665-675.
[48]Yang, J., Liu, Y., Zhu, X., Liu, Z., Zhang, X. (2012), “A new feature selection based on comprehensive measurement both in inter-category and intra-category for text categorization,” Information Processing and Management, vol. 48, no. 4, pp. 741-754.
[49]Zhang, J. Z., Chen, J. C., and Kirby, E. D. (2007), “Surface roughness optimization in an end-milling operation using the Taguchi design method,” Journal of Materials Processing Technology, vol. 184, no. 1-3, pp. 233-239.
[50]Zheng, Z., Wu, X., and Srihari, R. (2004), “Feature selection for text categorization on imbalanced data,” ACM SIGKDD Explorations Newsletter, vol. 6, no. 1, pp. 80-89.
[51]Zhang, W., Yoshida, T., and Tang, X. (2011), “A comparative study of TF*IDF, LSI and multi-words for text classification,” Expert Systems with Applications, vol. 38, no. 3, pp. 2758-2765.