Перейти к основному содержанию
AkademIndex

Продукты

Для разработчиков

AkademBaseОткрытый API экосистемы
Статья

The effects of pre-processing strategies in sentiment analysis of online movie reviews

Harnani Mat ZinFaculty of Computer Science and Information Technology, University Putra Malaysia, Selangor, MalaysiaNorwati MustaphaFaculty of Computer Science and Information Technology, University Putra Malaysia, Selangor, MalaysiaMasrah Azrifah Azmi MuradFaculty of Computer Science and Information Technology, University Putra Malaysia, Selangor, MalaysiaNurfadhlina Mohd SharefFaculty of Computer Science and Information Technology, University Putra Malaysia, Selangor, Malaysia
2017en
ABI

Аннотация

With the ever increasing of internet applications and social networking sites, people nowadays can easily express their feelings towards any products and services. These online reviews act as an important source for further analysis and improved decision making. These reviews are mostly unstructured by nature and thus, need processing like sentiment analysis and classification to provide a meaningful information for future uses. In text analysis tasks, the appropriate selection of words/features will have a huge impact on the effectiveness of the classifier. Thus, this paper explores the effect of the pre-processing strategies in the sentiment analysis of online movie reviews. In this paper, supervised machine learning method was used to classify the reviews. The support vector machine (SVM) with linear and non-linear kernel has been considered as classifier for the classification of the reviews. The performance of the classifier is critically examined based on the results of precision, recall, f-measure, and accuracy. Two different features representations were used which are term frequency and term frequency-inverse document frequency. Results show that the pre-processing strategies give a significant impact on the classification process.

Перевод пока недоступен

Идентификаторы

Цитирования и источники

Цитирований: 3Использованных источников: 0