arxiv:2402.11682

Learning Conditional Invariances through Non-Commutativity

Published on Feb 18, 2024

Authors:

Abstract

Invariance learning algorithms that conditionally filter out domain-specific random variables as distractors, do so based only on the data semantics, and not the target domain under evaluation. We show that a provably optimal and sample-efficient way of learning conditional invariances is by relaxing the invariance criterion to be non-commutatively directed towards the target domain. Under domain asymmetry, i.e., when the target domain contains semantically relevant information absent in the source, the risk of the encoder varphi^* that is optimal on average across domains is strictly lower-bounded by the risk of the target-specific optimal encoder Phi^*_tau. We prove that non-commutativity steers the optimization towards Phi^*_tau instead of varphi^*, bringing the H-divergence between domains down to zero, leading to a stricter bound on the target risk. Both our theory and experiments demonstrate that non-commutative invariance (NCI) can leverage source domain samples to meet the sample complexity needs of learning Phi^*_tau, surpassing SOTA <PRE_TAG>invariance learning algorithms</POST_TAG> for domain adaptation, at times by over 2%, approaching the performance of an oracle. Implementation is available at https://github.com/abhrac/nci.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

No model linking this paper

Cite arxiv.org/abs/2402.11682 in a model README.md to link it from this page.

No dataset linking this paper

Cite arxiv.org/abs/2402.11682 in a dataset README.md to link it from this page.

No Space linking this paper

Cite arxiv.org/abs/2402.11682 in a Space README.md to link it from this page.

No Collection including this paper

Add this paper to a collection to link it from this page.