They&apos;re All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Vision Language Models (VLMs) such as CLIP are powerful models; however they can exhibit unwanted biases, making them less safe when deployed directly in applications such as text-to-image, text-to-video retrievals, reverse search, or classification tasks. In this work, we propose a novel framework to generate synthetic counterfactual images to create a diverse and balanced dataset that can be used to fine-tune CLIP. Given a set of diverse synthetic base images from text-to-image models, we leverage off-the-shelf segmentation and inpainting models to place humans with diverse visual appearances in context. We show that CLIP trained on such datasets learns to disentangle the human appearance from the context of an image, i.e., what makes a doctor is not correlated to the person's visual appearance, like skin color or body type, but to the context, such as background, the attire they are wearing, or the objects they are holding. We demonstrate that our fine-tuned CLIP model, \(CF_\alpha\), improves key fairness metrics such as MaxSkew, MinSkew, and NDKL by 40-66\% for image retrieval tasks, while still achieving similar levels of performance in downstream tasks. We show that, by design, our model retains maximal compatibility with the original CLIP models, and can be easily controlled to support different accuracy versus fairness trade-offs in a plug-n-play fashion.

Related collections

Author and article information

Journal

Publication date Created: 17 June 2024

Article

ArXiV ID: 2406.11331

SO-VID: 247f221c-fbb1-4ff3-a3b1-40d670070e4b

License:

http://arxiv.org/licenses/nonexclusive-distrib/1.0/

History

Custom metadata

Categories cs.CV cs.IR cs.LG

ScienceOpen disciplines: Computer vision & Pattern recognition,Information & Library science,Artificial intelligence

Data availability:

ScienceOpen disciplines: Computer vision & Pattern recognition, Information & Library science, Artificial intelligence

They're All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias

Read this article at

Abstract

Related collections

Citation Behaviour and Practice

Author and article information

Journal

Article

History

Custom metadata

Comments

Comment on this article

Similar content 37