AutoML analysis, optimisation and comparison

  • Claudia Klausgraber

Student thesis: Master's Thesis

Abstract

This master’s thesis deals with the examination of the fundamental properties of Automated Machine Learning (AutoML) and its distinction from traditional methods of
machine learning (ML). The central research question is how the three AutoML libraries
Auto-sklearn, TPOT, and AutoGluon function and in what ways they can be optimised.
To answer this question, a detailed comparative analysis of the three AutoML frameworks was conducted. The methodology included a comprehensive literature review and
the experimental evaluation of the frameworks using binary classification and regression tasks, with particular focus on the optimisation potential of the individual libraries
through data preprocessing.
The attempt to optimise the AutoML frameworks through data preprocessing showed
that not every type of preprocessing is suitable. It was found that the traditional preprocessing steps LDA and kPCA led to poorer results than when the models were trained
on the original data. It was also shown that conventional preprocessing steps led to the
best results in the classification task.
Therefore, it can be said that data preprocessing offers possibilities to improve the results of AutoML, but should be applied with caution. These findings provide valuable
insights for further optimisations and developments in the field of AutoML.
Date of Award2024
Original languageEnglish (American)
SupervisorBogdan Burlacu (Supervisor)

Cite this

'