Causal Generative Domain Adaptation Networks [Daily Reading]
文章目录
causal graph
话不多说,先上图,这篇文章的因果图和我之前读过的,都有所不同。As the variable that need to be predicted, Y is on the cause position instead of effect position. In the following we will see how this makes sense.
Framework
In this paper they propose a Generative Domain Adaptation Network (G-DAN) with specific latent variables θ \theta θ to capture changes in P X ∣ Y P_{X|Y} PX∣Y. X X Xdenotes features and Y Y Y the class label. P X ∣ Y P_{X|Y} PX∣Y is implicitly represents by functional model X = g ( Y , E , θ ) X=g(Y,E,\theta) X=g(Y,E,θ). In this function, latent variables θ \theta θ may change across domains. E, the noise
Identifiability theory to support θ \theta θ
Causality: Testing Identifiability
(The above blog explains how to identify joint distribution under confounding in detail.)
In the 3.1 section , the authors use identifiability to make sure whether the estimated
θ
^
\widehat {\theta}
θ
can capture all the distribution changes.
Model Learning
θ \theta θ is estimated by