Inv-SENet: Invariant Self Expression Network for clustering under biased data
Ashutosh Singh (Northeastern University); Ashish Singh (University of Massachusetts Amherst); Aria Masoomi (Northeastern University); Tales Imbiriba (Northeastern University); Erik Learned-Miller (University of Massachusetts, Amherst); Deniz Erdogmus (Northeastern University)
-
SPS
IEEE Members: $11.00
Non-members: $15.00
Subspace clustering algorithms are used for understanding the cluster structure that explains the patterns prevalent in the dataset well. These methods are extensively used for data-exploration tasks in various areas of Natural Sciences. However, most of these methods fail to handle confounding attributes in the dataset. For datasets where a data sample represent multiple attributes, naively applying any clustering approach can result in undesired output. To this end, we propose a novel framework for jointly removing confounding attributes while learning to cluster data points in individual subspaces. Assuming we have label information about these confounding attributes, we regularize the clustering method by adversarially learning to minimize the mutual information between the data representation and the confounding attribute labels. Our experimental result on synthetic and real-world datasets demonstrate the effectiveness of our approach.