FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling

Tian, Yu; Shi, Min; Luo, Yan; Kouhana, Ava; Elze, Tobias; Wang, Mengyu

Computer Science > Computer Vision and Pattern Recognition

arXiv:2311.02189 (cs)

[Submitted on 3 Nov 2023 (v1), last revised 1 May 2024 (this version, v5)]

Title:FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling

Authors:Yu Tian, Min Shi, Yan Luo, Ava Kouhana, Tobias Elze, Mengyu Wang

View PDF

Abstract:Fairness in artificial intelligence models has gained significantly more attention in recent years, especially in the area of medicine, as fairness in medical models is critical to people's well-being and lives. High-quality medical fairness datasets are needed to promote fairness learning research. Existing medical fairness datasets are all for classification tasks, and no fairness datasets are available for medical segmentation, while medical segmentation is an equally important clinical task as classifications, which can provide detailed spatial information on organ abnormalities ready to be assessed by clinicians. In this paper, we propose the first fairness dataset for medical segmentation named Harvard-FairSeg with 10,000 subject samples. In addition, we propose a fair error-bound scaling approach to reweight the loss function with the upper error-bound in each identity group, using the segment anything model (SAM). We anticipate that the segmentation performance equity can be improved by explicitly tackling the hard cases with high training errors in each identity group. To facilitate fair comparisons, we utilize a novel equity-scaled segmentation performance metric to compare segmentation metrics in the context of fairness, such as the equity-scaled Dice coefficient. Through comprehensive experiments, we demonstrate that our fair error-bound scaling approach either has superior or comparable fairness performance to the state-of-the-art fairness learning models. The dataset and code are publicly accessible via this https URL.

Comments:	ICLR 2024; Codes available at this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2311.02189 [cs.CV]
	(or arXiv:2311.02189v5 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2311.02189

Submission history

From: Yu Tian [view email]
[v1] Fri, 3 Nov 2023 18:44:21 UTC (587 KB)
[v2] Fri, 19 Jan 2024 22:39:05 UTC (594 KB)
[v3] Tue, 27 Feb 2024 18:38:50 UTC (594 KB)
[v4] Sun, 10 Mar 2024 14:47:57 UTC (594 KB)
[v5] Wed, 1 May 2024 01:32:34 UTC (595 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:FairSeg: A Large-Scale Medical Image Segmentation Dataset for Fairness Learning Using Segment Anything Model with Fair Error-Bound Scaling

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators