Please use this identifier to cite or link to this item: http://hdl.handle.net/11718/14038
Title: Statistical Modeling Approach Using Unstructured Data
Authors: Roy, Dipanmoy
D Costa, Daphne
Krishna, Praveen
Keywords: Customer Sentiments;Shapley Value Regression;Star Rating;Ordinal Logistic Regression
Issue Date: 2015
Publisher: Indian Institute of Management, Ahmedabad
Citation: Roy, D., D Costa, D., Krishna, P.. (2015). Statistical Modeling Approach Using Unstructured Data. 4th IIMA International Conference on Advanced Data Analysis, Business Analytics and Intelligence. Indian Institute of Management, Ahmedabad
Series/Report no.: IC 15;046
Abstract: This paper aims at finding the key factors that drive the average star ratings at e-commerce site for any product. The text reviews provided by customers on the websites are analyzed. The customer sentiments scores are calculated from the text reviews using text mining and are placed into various categories. A model is built with sentiment scores of the categories as the independent variable and average star rating as the dependent. The method also finds which categories drive customers from lower star ratings to higher star ratings. Shapley Value regression is used to find major drivers driving average star rating while ordinal logistic regression is used to identify drivers that drive customers from low star ratings to higher star ratings. We have used this methodology on customer reviews about HPs Inkjet printers on Amazon.com website and the recommendations have helped HP to improve its star ratings on the website for its Inkjet products.
URI: http://hdl.handle.net/11718/14038
Appears in Collections:4th IIMA International Conference on Advanced Data Analysis, Business Analytics and Intelligence

Files in This Item:
File Description SizeFormat 
IC 15-046.pdf
  Restricted Access
97.79 kBAdobe PDFView/Open Request a copy


Items in IIMA Institutional Repository are protected by copyright, with all rights reserved, unless otherwise indicated.