On text preprocessing for opinion mining outside of laboratory environments

Gerald Petz, Michal Jan Karpowicz, Harald Fürschuß, Andreas Auinger, Stephan Winkler, Susanne Schaller, Andreas Holzinger

Research output: Chapter in Book/Report/Conference proceedingsConference contributionpeer-review

30 Citations (Scopus)

Abstract

Opinion mining deals with scientific methods in order to find, extract and systematically analyze subjective information. When performing opinion mining to analyze content on the Web, challenges arise that usually do not occur in laboratory environments where prepared and preprocessed texts are used. This paper discusses preprocessing approaches that help coping with the emerging problems of sentiment analysis in real world situations. After outlining the identified shortcomings and presenting a general process model for opinion mining, promising solutions for language identification, content extraction and dealing with Internet slang are discussed.

Original languageEnglish
Title of host publicationActive Media Technology - 8th International Conference, AMT 2012, Proceedings
PublisherSpringer
Pages618-629
Number of pages12
ISBN (Print)9783642352355
DOIs
Publication statusPublished - 2012
Event8th International Conference on Active Media Technology, AMT 2012 - Macau, China
Duration: 4 Dec 20127 Dec 2012

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7669 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference8th International Conference on Active Media Technology, AMT 2012
Country/TerritoryChina
CityMacau
Period04.12.201207.12.2012

Keywords

  • content extraction
  • Internet slang
  • language detection
  • Opinion mining
  • sentiment analysis
  • text mining
  • Web analytics

Fingerprint

Dive into the research topics of 'On text preprocessing for opinion mining outside of laboratory environments'. Together they form a unique fingerprint.

Cite this