TY - GEN
T1 - Simultaneous Unlearning of Multiple Protected User Attributes From Variational Autoencoder Recommenders Using Adversarial Training
AU - Escobedo, Gustavo
AU - Ganhör, Christian
AU - Brandl, Stefan
AU - Augstein, Mirjam
AU - Schedl, Markus
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2025.
PY - 2025
Y1 - 2025
N2 - In widely used neural network-based collaborative filtering models, users’ history logs are encoded into latent embeddings that represent the users’ preferences. In this setting, the models are capable of mapping users’ protected attributes (e.g., gender or ethnicity) from these user embeddings even without explicit access to them, resulting in models that may treat specific demographic user groups unfairly and raise privacy issues. While prior work has approached the removal of a single protected attribute of a user at a time, multiple attributes might come into play in real-world scenarios. In the work at hand, we present AdvXMultVAE which aims to unlearn multiple protected attributes (exemplified by gender and age) simultaneously to improve fairness across demographic user groups. For this purpose, we couple a variational autoencoder (VAE) architecture with adversarial training (AdvMultVAE) to support simultaneous removal of the users’ protected attributes with continuous and/or categorical values. Our experiments on two datasets, LFM-2b-100k and Ml-1m, from the music and movie domains, respectively, show that our approach can yield better results than its singular removal counterparts (based on AdvMultVAE) in effectively mitigating demographic biases whilst improving the anonymity of latent embeddings.
AB - In widely used neural network-based collaborative filtering models, users’ history logs are encoded into latent embeddings that represent the users’ preferences. In this setting, the models are capable of mapping users’ protected attributes (e.g., gender or ethnicity) from these user embeddings even without explicit access to them, resulting in models that may treat specific demographic user groups unfairly and raise privacy issues. While prior work has approached the removal of a single protected attribute of a user at a time, multiple attributes might come into play in real-world scenarios. In the work at hand, we present AdvXMultVAE which aims to unlearn multiple protected attributes (exemplified by gender and age) simultaneously to improve fairness across demographic user groups. For this purpose, we couple a variational autoencoder (VAE) architecture with adversarial training (AdvMultVAE) to support simultaneous removal of the users’ protected attributes with continuous and/or categorical values. Our experiments on two datasets, LFM-2b-100k and Ml-1m, from the music and movie domains, respectively, show that our approach can yield better results than its singular removal counterparts (based on AdvMultVAE) in effectively mitigating demographic biases whilst improving the anonymity of latent embeddings.
KW - Collaborative Filtering
KW - Debiasing
KW - Privacy
KW - Recommender Systems
KW - Variational Autoencoder
UR - http://www.scopus.com/inward/record.url?scp=85208607937&partnerID=8YFLogxK
U2 - 10.1007/978-3-031-71975-2_7
DO - 10.1007/978-3-031-71975-2_7
M3 - Conference contribution
AN - SCOPUS:85208607937
SN - 9783031719745
T3 - Communications in Computer and Information Science
SP - 91
EP - 102
BT - Advances in Bias and Fairness in Information Retrieval - 5th International Workshop, BIAS 2024, Revised Selected Papers
A2 - Bellogin, Alejandro
A2 - Boratto, Ludovico
A2 - Malloci, Francesca Maridina
A2 - Marras, Mirko
A2 - Kleanthous, Styliani
A2 - Lex, Elisabeth
PB - Springer
T2 - 5th International Workshop on Algorithmic Bias in Search and Recommendation, BIAS 2024
Y2 - 18 July 2024 through 18 July 2024
ER -