TY - GEN
T1 - Identification of Similarities and Clusters of Bread Baking Recipes Based on Data of Ingredients
AU - Anlauf, Stefan
AU - Lasslberger, Melanie
AU - Grassmann, Rudolf
AU - Himmelbauer, Johannes
AU - Winkler, Stephan
N1 - Funding Information:
The research reported in this paper has been funded by BMK, BMDW, and the State of Upper Austria in the frame of the COMET Programme managed by FFG. The project is a cooperation between University of Applied Sciences Upper Austria, Software Competence Center Hagenberg and backaldrin International The Kornspitz Company GmbH. The data was provided by the company backaldrin International The Kornspitz Company GmbH. Special thanks to Sebastian Dorl for proofreading this paper.
Publisher Copyright:
© 2022 The Authors.
PY - 2022
Y1 - 2022
N2 - We define the similarity of bakery recipes and identify groups of similar recipes using different clustering algorithms. Our analyses are based on the relative amounts of ingredients included in the recipes. We use different clustering algorithms to find the optimal clusters for all recipes, namely k-means, k-medoid, and hierarchical clustering. In addition to standard similarity measures we define a similarity measure using the logarithm of the original data to reduce the impact of raw materials that are used in large quantities. Clustering recipes based on their ingredients can improve the search for similar recipes and therefore help with the time-consuming process of developing new recipes. Using the k-medoid method, we can separate 1271 recipes into six different clusters. We visualize our results via dendrograms that represent the hierarchical separation of the recipes into individual groups and sub-groups.
AB - We define the similarity of bakery recipes and identify groups of similar recipes using different clustering algorithms. Our analyses are based on the relative amounts of ingredients included in the recipes. We use different clustering algorithms to find the optimal clusters for all recipes, namely k-means, k-medoid, and hierarchical clustering. In addition to standard similarity measures we define a similarity measure using the logarithm of the original data to reduce the impact of raw materials that are used in large quantities. Clustering recipes based on their ingredients can improve the search for similar recipes and therefore help with the time-consuming process of developing new recipes. Using the k-medoid method, we can separate 1271 recipes into six different clusters. We visualize our results via dendrograms that represent the hierarchical separation of the recipes into individual groups and sub-groups.
KW - baking recipes
KW - clustering
KW - ingredient
KW - machine learning
UR - http://www.scopus.com/inward/record.url?scp=85142923054&partnerID=8YFLogxK
U2 - 10.46354/i3m.2022.foodops.002
DO - 10.46354/i3m.2022.foodops.002
M3 - Conference contribution
AN - SCOPUS:85142923054
T3 - 8th International Food Operations and Processing Simulation Workshop, FoodOPS 2022
BT - 8th International Food Operations and Processing Simulation Workshop, FoodOPS 2022
A2 - Bruzzone, Agostino G.
A2 - Longo, Francesco
A2 - Vignali, Giuseppe
PB - DIME UNIVERSITY OF GENOA
T2 - 8th International Food Operations and Processing Simulation Workshop, FoodOPS 2022
Y2 - 19 September 2022 through 21 September 2022
ER -