Evaluation of Clustering with PCA for Market Segmentation: A Study Using Simulated and Surrogate Data

Research output: Contribution to journalConference articlepeer-review

Abstract

Market segmentation is a strategy crucial for manufacturing enterprises, enabling targeted approaches and actions aligned with specific business objectives. By segmenting the market, companies can identify profitable segments and gain a competitive advantage through superior market positioning. Principal Component Analysis (PCA) is a common statistical technique used for dimensionality reduction and feature extraction, and PCA-based clustering is a data-driven approach applied for identifying market segments. However, effective segmentation requires systematic application of methods, appropriate selection of principal components, and proper clustering validation measures. In this paper, we propose a framework to evaluate PCA-based clustering for segmentation using simulated and surrogate data along with several internal and external validation clustering measures. This framework can be applied to dimensionality reduction methods to obtain stable clustering solutions, which can be effectively optimized using various validation measures; it can also provide explainable insight into segments identified by clustering with PCA. To illustrate the effectiveness of our proposed framework, we present a case study. We simulate clustering data and use e-car data as surrogate data to evaluate the clustering results, employing k-means as the clustering method.

Original languageEnglish
Pages (from-to)2063-2075
Number of pages13
JournalProcedia Computer Science
Volume253
DOIs
Publication statusPublished - 2025
Event6th International Conference on Industry 4.0 and Smart Manufacturing, ISM 2024 - Prague, Czech Republic
Duration: 13 Nov 202415 Nov 2024

Keywords

  • clustering
  • clustering data simulation
  • data-driven business analytics
  • external validation measures
  • internal
  • k-means
  • Market segmentation

Fingerprint

Dive into the research topics of 'Evaluation of Clustering with PCA for Market Segmentation: A Study Using Simulated and Surrogate Data'. Together they form a unique fingerprint.

Cite this