Theoretical consistency

Question

Theoretical consistency

cshen6 opened this issue 8 years ago · comments

cshen6 commented 8 years ago

Important:

we are showing the diffusion process U is exchangeable, thus conditional iid. Therefore we are testing independence of the diffusion process with the nodal attributes. Make that part more explicit in intro, tell people why diffusion process is more advantageous than anything else (add citation too).
There is still a gap from condition i.i.d. of U, to dCov(U,X)=0 if and only if U and X is independent. Let us think about it.

Tentative:

extend exchangeability of U to the diffusion distance for the directed and/or weighted graph.
compare to and think about the case when we replace U by A.

cshen6 · Answer 1 · Fri Sep 23 2016 08:32:49 GMT+0800 (China Standard Time)

Main:

theorem x:
distance correlation on exchangeable random variables converge to 0 if and only if under independence.

Proof forward: most of thm1 and thm2 holds directly without iid (in dcorr07), so the only step to justify is the SLLN for v-statistic in thm 2.

Carefully check through the SLLN for exchangeable random variables & the V-statistic. Need finite first moment in SLLN for conditional iid, I think.

corollary: MGC and Dcorr are consistent, when testing network dependency for the euclidean distance of the diffusion map, or Euclidean distance of the adjacency matrix, or the estimated network factors, because they are all exchangeable.

Also need finite second moment above; and note that the adjacency matrix is 2-array joint exchangeable.

What about directed & weighted graph model?

Youjin Lee · Answer 2 · Sun Sep 25 2016 12:31:45 GMT+0800 (China Standard Time)

@cshen6 Some useful properties valid for exchangeable variables are actually proven in http://www.sciencedirect.com/science/article/pii/S0047259X13000262.

cshen6 · Answer 3 · Sun Sep 25 2016 20:41:24 GMT+0800 (China Standard Time)

@youjin1207
yes indeed! although it is for coordinate exchangeability (dimension-wise), rather than observation exchangeability (sample-wise).

Youjin Lee · Answer 4 · Mon Sep 26 2016 04:58:58 GMT+0800 (China Standard Time)

@cshen6 On page 6, they actually mentioned sample exchangeability, even though they didn't specifically prove it. In my understanding, we apply conditional CLT for conditionally independence random variables. I don't know if we actually have to PROVE it, since the results are already there - I don't see the difference between Theorem in my current draft and conditional CLT - instead of sub-sigma-algebra notion, I used g function or eta.

cshen6 · Answer 5 · Tue Sep 27 2016 03:20:39 GMT+0800 (China Standard Time)

@youjin1207
I listed the forward and backward directions above for thm X, which is the last theoretical piece.
Either way, it will lead to proving the theorem and the corollary.

Youjin Lee · Answer 6 · Sat Oct 01 2016 01:14:10 GMT+0800 (China Standard Time)

@cshen6 Now I don't think distance in conditional variable and that in un-conditional variable are the same so we cannot approach in that way.

cshen6 · Answer 7 · Sat Oct 01 2016 03:47:31 GMT+0800 (China Standard Time)

ok I will think about it before our next meeting.

also, I think adjacency matrix is in fact exchangeable, if we look at the definition and let the exchangeability to be defined on both row and column of the adjacency matrix.