Abstract
The emerging research area of opinion mining deals with computational methods in order to find, extract and systematically analyze people’s opinions, attitudes and emotions towards certain topics. While providing interesting market research information, the user generated content existing on the Web 2.0 presents numerous challenges regarding systematic analysis, the differences and unique characteristics of the various social media channels being one of them. This article reports on the determination of such particularities, and deduces their impact on text preprocessing and opinion mining algorithms. The
effectiveness of different algorithms is evaluated in order to determine their applicability to the various social media channels. Our research shows that text preprocessing algorithms are mandatory for mining opinions on the Web 2.0 and that part of these algorithms are sensitive to errors and mistakes contained in the user generated content.
Original language | English |
---|---|
Pages (from-to) | 899-908 |
Number of pages | 10 |
Journal | INFORMATION PROCESSING & MANAGEMENT |
Volume | 50 |
Issue number | 6 |
DOIs | |
Publication status | Published - Nov 2014 |
Keywords
- Opinion Mining
- Noisy text
- Text Preprocessing
- User generated Content
- Data Mining
- Text preprocessing
- User generated content
- Opinion mining
- Data mining