Product Review Analysis using Big Data Analytics

Jovitha Christina .R.H, Loyola-ICAM College of Engineering and Technology Chennai, India; Mary Caroline Nikitha ,Loyola-ICAM College of Engineering and Technology Chennai, India; Kavitha .V ,Loyola-ICAM College of Engineering and Technology Chennai, India

Big Data, HDFS, HIVE, Key Generation, Sentiment Analysis, Sqoop

Sentiment Analysis is the process of using text analytics to mine various data sources for opinions. Often, sentiment analysis is done on the data that is got from the Internet and from various social media platforms. Because the content collected from the internet is unstructured, we need tools that can process and analyze this disparate data. Hence we make use of Big Data to handle the different sources and formats of the structured and unstructured data. In particular consumer reviews of a product are given in textual format, we first parse the reviews and classify them into positive and negative and then send these datasets to the Hadoop File System (HDFS) to analyze them. This helps the purchaser to have some knowledge about the product’s pros and cons and decide which product to buy.
    [1] Bing Liu,"Exploring User Opinions in Recommender Systems", Proceeding of the second KDD workshop on Large Scale Recommender Systems and the Netflix Prize Competition", Aug 24, 2008, Las Vegas, Nevada, USA. [2] Lina Zhou, Pimwadee Chaovalit, “Movie Review Mining: a Comparison between Supervised and Unsupervised Classification Approaches”, Proceedings of the 38th Hawaii International Conference on system sciences, 2005. [3] B. B. Khairullah Khan, Aurangzeb Khan, “Sentence based sentiment classification from online customer reviews,” ACM, 2010. [4] Tomasz Jacha, Ewa Magieraa, Wojciech Froelicha “Application of HADOOP to Store and Process Big Data Gathered from an Urban Water Distribution System “ Procedia Engineering 119 on 2015,pp.1375 – 1380 [5] Y. Chen, S. Alspaugh, D. Borthakur, R. Katz, “Energy efficiency for large-scale mapreduce workloads with significant interactive analysis”, in: Proceedings of the 7th ACM European conference on Computer Systems, EuroSys ’12, 2012,pp. 43–56. [6] F. L. Gang Li, “A clustering-based approach on sentiment analysis,” IEEE, 2010. [7] M. X. Y. S. Haiping Zhang, Zhengang Yu, “Feature- level sentiment analysis for chinese product reviews,” IEEE, 2011. [8] Suresh Ramanujam,R. Nancyamala, J.Nivedha, J.Kokila “Sentiment analysis using big data” [9] Andry Alamsyah, Marisa Paryasto, Feriza J. Putra, Rizal Himmawan, "Network text analysis to summarize online conversations for marketing intelligence efforts in telecommunication industry", Information and Communication Technology (ICoICT) 2016 4th International Conference on, pp. 1-5, 2016.
Paper ID: GRDCF003007
Published in: Conference : National Conference on Computational Intelligence Systems (NCCIS - 2017)
Page(s): 19 - 26