Data-based prediction of microbial contamination in herbs and identification of optimal harvest parameters

Stefan Anlauf, Andreas Haghofer, Karl Dirnberger, Stephan Winkler

Research output: Contribution to journalArticlepeer-review

1 Citation (Scopus)

Abstract

The quality of freshly harvested herbs is affected by several crucial factors, such as weather, tillage, fertilization, drying, and the harvesting process, e.g. Our main goal is to learn models that are able to predict spore contaminations in different types of herbs on the basis of information about the harvesting process, transport conditions, drying, and storage conditions. This shall enable us to identify optimal processing parameters, which will allow more effective and cost efficient contamination prevention. Using machine learning, we have generated ensembles of models that predict the risk for spore contamination on the basis of harvest processing parameters. The training information about contamination in herbs is given as results of laboratory analysis data. We applied different modeling algorithms (random forests, gradient boosting trees, genetic programming, and neural networks). In this paper we report on modeling results for yeast and mold contaminations in peppermint and nettle; e.g., for yeast contamination in peppermint we obtained models with 78.13% accuracy. Additionally, we use descriptive statistics to identify those parameters that have a statistically significant influence on the contamination; for example, our analysis shows that there seems to be a relationship between mold in peppermint and information about harrowing and the growth height (p = 0.001).
Translated title of the contributionDatengestützte Vorhersage der mikrobiellen Verunreinigung von Kräutern und Ermittlung der optimalen Ernteparameter
Original languageEnglish
Article number20210027
Pages (from-to)171-183
Number of pages13
JournalInternational Journal of Food Engineering
Volume18
Issue number3
DOIs
Publication statusPublished - 9 Aug 2021

Keywords

  • data science
  • herbs
  • machine learning
  • microbial contamination
  • statistics

Cite this