Wednesday, March 23, 2016

Topic modeling in sentiment analysis: A systematic review

 

Abstract

With the expansion and acceptance of Word Wide Web, sentiment analysis has become progressively popular research area in information retrieval and web data analysis. 

Due to the huge amount of user-generated contents over blogs, forums, social media, etc., sentiment analysis has attracted researchers both in academia and industry, since it deals with the extraction of opinions and sentiments. 

In this paper, we have presented a review of topic modeling, especially LDA-based techniques, in sentiment analysis. 

We have presented a detailed analysis of diverse approaches and techniques, and compared the accuracy of different systems among them. 

The results of different approaches have been summarized, analyzed and presented in a sophisticated fashion. 

This is the really effort to explore different topic modeling techniques in the capacity of sentiment analysis and imparting a comprehensive comparison among them.

References


Liu, B., Sentiment Analysis and Opinion Mining, Synthesis Lectures on Human Language Technologies, 5(1), pp. 1-167, 2012.


Pang, B. & Lee, L., Opinion Mining and Sentiment Analysis, Foundations and Trends in Information Retrieval, 2(1-2), pp. 1-135, 2008.


Hu, M. & Liu, B., Mining Opinion Features in Customer Reviews, in Proceedings of the Nineteenth National Conference on Artificial Intelligence (AAAI-04), San Jose, USA, vol. 4, pp. 755-760, July 2004.


Hu, M. & Liu, B., Mining and Summarizing Customer Reviews, in Proceedings of the tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-04), Washington, USA, pp. 168-177, ACM, Aug. 2004.


Zhang, L. & Liu, B., Aspect and Entity Extraction for Opinion Mining, Data Mining and Knowledge Discovery for Big Data, pp. 1-40, Springer Berlin Heidelberg, 2014.


Kitchenham, B. A. & Mendes, E., A Comparison of Cross-Company and Within-Company Effort Estimation Models for Web Applications, in Proceedings of the 8th International Conference on Empirical Assessment in Software Engineering (EASE-04), Edinburgh, Scotland, UK, pp. 47-55, May 2004.


Hofmann, T., Unsupervised Learning by Probabilistic Latent Semantic Analysis, Machine Learning, 42(1-2), pp. 177-196, 2001.


Blei, D. M., Ng, A. Y. & Jordan, M. I., Latent Dirichlet Allocation, The Journal of Machine Learning Research, 3, pp. 993-1022, 2003.


Fang, L. & Huang, M., Fine Granular Aspect Analysis Using Latent Structural Models, in Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, South Korea: Short Papers-Volume 2, pp. 333-337, Association for Computational Linguistics, July 2012.


Lin, Z., Jin, X., Xu, X., Wang, W., Cheng, X. & Wang, Y., A Cross-Lingual Joint Aspect/Sentiment Model for Sentiment Analysis, in Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management (CIKM-14), Shanghai, China, pp. 1089-1098, ACM, Nov. 2014.


Xueke, X., Xueqi, C. ,Songbo, T., Yue, L. & Huawei, S., Aspect-Level Opinion Mining of Online Customer Reviews, China Communications, 10(3), pp. 25-41, 2013.


Zhai, Z., Liu, B., Xu, H. & Jia, P., Constrained LDA for Grouping Product Features in Opinion Mining, Advances in knowledge discovery and data mining, pp. 448-459, Springer, 2011.


Moghaddam, S. & Ester, M., ILDA: Interdependent LDA Model for Learning Latent Aspects and their Ratings from Online Product Reviews, in Proceedings of the 34th International ACM SIGIR Conference on Research and Development in Information Retrieval(SIGIR-11), Beijing, China, pp. 665-674, ACM, July 2011.


Brody, S. & Elhadad, N., An Unsupervised Aspect-Sentiment Model for Online Reviews, in Human Language Technologies: in Proceedings of the 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT-10), Los Angeles, USA, pp. 804-812, Association for Computational Linguistics, June 2010.


Zhao, W.X., Jiang, J., Yan, H. & Li, X., Jointly Modeling Aspects and Opinions with a Maxent-LDA Hybrid, in Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP-10), Massachusetts, USA, pp. 56-65, Association for Computational Linguistics, Oct. 2010.


Jo, Y. & Oh, A. H., Aspect and Sentiment Unification Model for Online Review Analysis, in Proceedings of the Fourth ACM International Conference on Web Search and Data Mining (WSDM-11), Hong Kong, pp. 815-824, ACM, Feb. 2011.


Xu, X., Tan, S., Liu, Y., Cheng, X. & Lin, Z., Towards Jointly Extracting Aspects and Aspect-Specific Sentiment Knowledge, in Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM-12), Maui Hawaii, USA, pp. 1895-1899, ACM, Oct. 2012.


Mukherjee, A. & Liu, B., Aspect Extraction through Semi-Supervised Modeling, in Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Jeju, South Korea: Long Papers-Volume 1, pp. 339-348, Association for Computational Linguistics, July 2012.


Kim, S., Zhang, J., Chen, Z., Oh, A. H. & Liu, S., A Hierarchical Aspect-Sentiment Model for Online Reviews, in Proceedings of the Twenty-Seventh AAAI conference on Artificial Intelligence(AAAI-13), Washington, USA, July 2013.


[Bagheri, A., Saraee, M. & De Jong, F., ADM-LDA: An Aspect Detection Model Based on Topic Modelling Using the Structure of Review Sentences, Journal of Information Science, 40(5), pp. 621-636, 2014.


Gruber, A., Weiss, Y. & Rosen-Zvi, M., Hidden Topic Markov Models, in Proceeding of the 11th International Conference on Artificial Intelligence and Statistics (AISTATS-07), San Juan, Puerto Rico, pp. 163-170, Mar. 2007.


Wang, T., Cai, Y., Leung, H.-f., Lau, R. Y., Li, Q. & Min, H., Product Aspect Extraction Supervised with Online Domain Knowledge, Knowledge-Based Systems, 71, pp. 86-100, 2014.


Chen, Z., Mukherjee, A., Liu, B., Hsu, M., Castellanos, M. & Ghosh, R., R., Leveraging Multi-Domain Prior Knowledge in Topic Models, in Proceedings of the Twenty-Third international joint conference on Artificial Intelligence (IJCAI-13), Beijing, China, pp. 2071-2077, AAAI Press, Aug. 2013.


Chen, Z., Mukherjee, A., Liu, B., Hsu, M., Castellanos, M. & Ghosh, R., Discovering Coherent Topics Using General Knowledge, in Proceedings of the 22nd ACM international conference on Conference on information & knowledge management (CIKM-13), San Francisco, USA, pp. 209-218, ACM, Oct. 2013.


Chen, Z., Mukherjee, A., Liu, B., Hsu, M., Castellanos, M. & Ghosh, R., Exploiting Domain Knowledge in Aspect Extraction, in Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing (EMNLP-13), Seattle, USA, pp. 1655-1667, Oct. 2013.


Chen, Z., Mukherjee, A. & Liu, B., Aspect Extraction with Automated Prior Knowledge Learning, in Proceedings of the 52nd Annual Meeting of the Association of Computational Linguistics (ACL-214), Baltimore, USA, pp. 347-358, June 2014.


Han, J., Cheng, H., Xin, D. & Yan, X., Frequent Pattern Mining: Current status and Future Directions, Data Mining and Knowledge Discovery, 15(1), pp. 55-86, 2007.


Rosen-Zvi, M., Chemudugunta, C., Griffiths, T., Smyth, P. & Steyvers, M., Learning Author-Topic Models from Text Corpora, ACM Transactions on Information Systems (TOIS), 28(1), pp. 1-38, 2010.


Chen, Z. & Liu, B., Topic Modeling Using Topics from Many Domains, Lifelong Learning and Big Data, in Proceedings of the 31st International Conference on Machine Learning (ICML-14), Beijing, China, pp. 703-711, June 2014.


Chen, Z. & Liu, B., Mining Topics in Documents: Standing on the Shoulders of Big Data, in Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-14), New York City, USA, pp. 1116-1125, ACM, Aug. 2014.





DOI: http://dx.doi.org/10.5614%2Fitbj.ict.res.appl.2016.10.1.6

http://journals.itb.ac.id/index.php/jictra/article/view/1442