Denoising Individual Bias for Fairer Binary Submatrix Detection

Date
2020-10
Language
English
Embargo Lift Date
Committee Members
Degree
Degree Year
Department
Grantor
Journal Title
Journal ISSN
Volume Title
Found At
ACM
Abstract

Low rank representation of binary matrix is powerful in disentangling sparse individual-attribute associations, and has received wide applications. Existing binary matrix factorization (BMF) or co-clustering (CC) methods often assume i.i.d background noise. However, this assumption could be easily violated in real data, where heterogeneous row- or column-wise probability of binary entries results in disparate element-wise background distribution, and paralyzes the rationality of existing methods. We propose a binary data denoising framework, namely BIND, which optimizes the detection of true patterns by estimating the row- or column-wise mixture distribution of patterns and disparate background, and eliminating the binary attributes that are more likely from the background. BIND is supported by thoroughly derived mathematical property of the row- and column-wise mixture distributions. Our experiment on synthetic and real-world data demonstrated BIND effectively removes background noise and drastically increases the fairness and accuracy of state-of-the arts BMF and CC methods.

Description
item.page.description.tableofcontents
item.page.relation.haspart
Cite As
Wan, C., Chang, W., Zhao, T., Cao, S., & Zhang, C. (2020). Denoising Individual Bias for Fairer Binary Submatrix Detection. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management (pp. 2245–2248). Association for Computing Machinery. https://doi.org/10.1145/3340531.3412156
ISSN
Publisher
Series/Report
Sponsorship
Major
Extent
Identifier
Relation
Journal
Proceedings of the 29th ACM International Conference on Information & Knowledge Management
Rights
Publisher Policy
Source
ArXiv
Alternative Title
Type
Conference proceedings
Number
Volume
Conference Dates
Conference Host
Conference Location
Conference Name
Conference Panel
Conference Secretariat Location
Version
Author's manuscript
Full Text Available at
This item is under embargo {{howLong}}