Open Access Open Access  Restricted Access Subscription or Fee Access

Synonyms Weighted LDA for Product Aspects Extraction

Yun Peng, Changxuan Wan

Abstract


It is important to extract the aspects from the comments of shoppers about certain products. Product aspect descriptions often contain words of same meaning, and discriminating these synonyms effectively can improve the efficiency of aspects extraction. Standard topic model in identifying those words, because of the absent introduction of priori knowledge, will largely ignore the relations of words meaning. An improved aspects extracting model of LDA by adding synonymous words constraint is proposed in this paper. In the process of words clustering analysis, words with high similarity are aggregated in the same cluster. In the stage of aspects extraction, we introduce words clustering and cluster membership weights as priori knowledge to strengthen the original LDA topic model for features clustering, which can improve the accuracy of product aspects extraction. Our experiments show that synonyms weighted LDA proposed in this paper outperforms the original LDA by a large margin.

Keywords


clustering analysis, LDA model, aspects extraction, synonyms cluster.

Full Text:

PDF

Refbacks

  • There are currently no refbacks.


Disclaimer/Regarding indexing issue:

We have provided the online access of all issues and papers to the indexing agencies (as given on journal web site). It’s depend on indexing agencies when, how and what manner they can index or not. Hence, we like to inform that on the basis of earlier indexing, we can’t predict the today or future indexing policy of third party (i.e. indexing agencies) as they have right to discontinue any journal at any time without prior information to the journal. So, please neither sends any question nor expects any answer from us on the behalf of third party i.e. indexing agencies.Hence, we will not issue any certificate or letter for indexing issue. Our role is just to provide the online access to them. So we do properly this and one can visit indexing agencies website to get the authentic information.