On the relation of causality- versus correlation-based feature selection on model fairness
Saarela, M. (2024). On the relation of causality- versus correlation-based feature selection on model fairness. In SAC '24 : Proceedings of the 39th ACM/SIGAPP Symposium on Applied Computing (pp. 56-64). ACM. https://doi.org/10.1145/3605098.3636018
Authors
Date
2024Copyright
© 2024 Copyright held by the owner/author(s)
As machine learning models are used increasingly in the educational domain, ensuring that they are fair and do not discriminate against certain groups or individuals is imperative. Although there are a few recent attempts to ensure fairness in these models, the majority of fairness literature tends to overlook the feature selection (FS) process despite its critical role as one of the foundational steps in the machine learning pipeline. Moreover, traditional FS methods identify features by examining the correlational relationships between predictive features and the target variable without seeking to uncover causal connections between them. To address these issues, we compare for four openly available datasets---two educational ones and two benchmark datasets regularly used in the fairness literature---the impact of these two different ways of FS (i.e., causality- versus correlation-based) on the performance and fairness of the resulting models. Our results show that causality-based FS generally leads to fairer models, while the models built after correlation-based FS manifest higher performance.
...
Publisher
ACMParent publication ISBN
979-8-4007-0243-3Conference
ACM/SIGAPP Symposium on Applied ComputingIs part of publication
SAC '24 : Proceedings of the 39th ACM/SIGAPP Symposium on Applied ComputingKeywords
Publication in research information system
https://converis.jyu.fi/converis/portal/detail/Publication/215928897
Metadata
Show full item recordCollections
Related funder(s)
Research Council of FinlandFunding program(s)
Academy Research Fellow, AoFAdditional information about funding
This work was supported by the Otto A. Malm Foundation and the Academy of Finland (project no. 356314).License
Related items
Showing items with similar title or keywords.
-
Simulating Counterfactuals
Karvanen, Juha; Tikka, Santtu; Vihola, Matti (AI Access Foundation, 2024)Counterfactual inference considers a hypothetical intervention in a parallel world that shares some evidence with the factual world. If the evidence specifies a conditional distribution on a manifold, counterfactuals may ... -
Estimating Causal Effects from Panel Data with Dynamic Multivariate Panel Models
Helske, Jouni; Tikka, Santtu (Elsevier, 2024)Panel data are ubiquitous in scientific fields such as social sciences. Various modeling approaches have been presented for observational causal inference based on such data. Existing approaches typically impose restrictive ... -
Enhancing Identification of Causal Effects by Pruning
Tikka, Santtu; Karvanen, Juha (MIT Press, 2018)Causal models communicate our assumptions about causes and e ects in real-world phenomena. Often the interest lies in the identification of the e ect of an action which means deriving an expression from the observed ... -
Causality-Aware Convolutional Neural Networks for Advanced Image Classification and Generation
Terziyan, Vagan; Vitko, Oleksandra (Elsevier, 2023)Smart manufacturing uses emerging deep learning models, and particularly Convolutional Neural Networks (CNNs) and Generative Adversarial Networks (GANs), for different industrial diagnostics tasks, e.g., classification, ... -
Part-of-speech tagging in written slang
Korolainen, Valtteri (2014)Erilaiset kieliteknologiasovellukset ovat olleet jo vuosikymmeniä arkipäiväises-sä käytössä. Esimerkiksi ennustava tekstinsyöttö ja automaattinen korjaus ovat olleet käytössä jo vuosikymmeniä. Puheen tunnistus ja kielen ...