TY - GEN
T1 - Applications of Large Language Models (LLMs) in Business Analytics – Exemplary Use Cases in Data Preparation Tasks
AU - Nasseri, Mehran
AU - Brandtner, Patrick
AU - Zimmermann, Robert
AU - Falatouri, Taha
AU - Darbanian, Farzaneh
AU - Obinwanne, Tobechi
N1 - Publisher Copyright:
© 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2023
Y1 - 2023
N2 - The application of data analytics in management has become a crucial success factor for the modern enterprise. To apply analytical models, appropriately prepared data must be available. Preparing this data can be cumbersome, time-consuming, and error prone. In the current era of Artificial Intelligence (AI), Large Language Models (LLMs) like OpenAI’s ChatGPT offer a promising pathway to support these tasks. However, their potential in enhancing the efficiency and effectiveness of data preparation remains largely unexplored. In this paper, we apply and evaluate the performance of OpenAI’s ChatGPT for data preparation. Based on four real-life use cases we show, that ChatGPT demonstrates high performance in the context of translating text, assigning products to given categories, classifying sentiments of customer reviews, and extracting information from textual requests. The results of our paper indicate that ChatGPT can be a valuable tool for many companies, helping with daily data preparation tasks. We demonstrated that ChatGPT can handle different languages and formats of data and have shown that LLMs can perform multiple tasks with minimal or no fine-tuning, leveraging their pre-trained knowledge and generalization abilities. However, we have also observed that ChatGPT may sometimes produce incorrect outputs, especially when input data is noisy or ambiguous. We have also noticed that ChatGPT may struggle with tasks that require more complex reasoning or domain-specific knowledge. Future research should focus on improving the robustness and reliability of LLMs for data preparation tasks, as well as on developing more efficient and user-friendly ways to deploy and interact with them.
AB - The application of data analytics in management has become a crucial success factor for the modern enterprise. To apply analytical models, appropriately prepared data must be available. Preparing this data can be cumbersome, time-consuming, and error prone. In the current era of Artificial Intelligence (AI), Large Language Models (LLMs) like OpenAI’s ChatGPT offer a promising pathway to support these tasks. However, their potential in enhancing the efficiency and effectiveness of data preparation remains largely unexplored. In this paper, we apply and evaluate the performance of OpenAI’s ChatGPT for data preparation. Based on four real-life use cases we show, that ChatGPT demonstrates high performance in the context of translating text, assigning products to given categories, classifying sentiments of customer reviews, and extracting information from textual requests. The results of our paper indicate that ChatGPT can be a valuable tool for many companies, helping with daily data preparation tasks. We demonstrated that ChatGPT can handle different languages and formats of data and have shown that LLMs can perform multiple tasks with minimal or no fine-tuning, leveraging their pre-trained knowledge and generalization abilities. However, we have also observed that ChatGPT may sometimes produce incorrect outputs, especially when input data is noisy or ambiguous. We have also noticed that ChatGPT may struggle with tasks that require more complex reasoning or domain-specific knowledge. Future research should focus on improving the robustness and reliability of LLMs for data preparation tasks, as well as on developing more efficient and user-friendly ways to deploy and interact with them.
KW - Business Analytics
KW - ChatGPT
KW - Data Preparation
KW - Language Models (LLMs)
KW - Natural Language Processing (NLP)
UR - http://www.scopus.com/inward/record.url?scp=85178642254&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-48057-7_12
DO - 10.1007/978-3-031-48057-7_12
M3 - Conference contribution
AN - SCOPUS:85178642254
SN - 9783031480560
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 182
EP - 198
BT - HCI International 2023 – Late Breaking Papers - 25th International Conference on Human-Computer Interaction, HCII 2023, Proceedings
A2 - Degen, Helmut
A2 - Ntoa, Stavroula
A2 - Moallem, Abbas
PB - Springer
T2 - 25th International Conference on Human-Computer Interaction, HCII 2023
Y2 - 23 July 2023 through 28 July 2023
ER -