YujiaBao / Distributional-Signatures

"Few-shot Text Classification with Distributional Signatures" ICLR 2020

Home Page:https://arxiv.org/abs/1908.06039

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

The function _subset_selection in src/dataset/stats.py is never called

pvij opened this issue · comments

As per the paper, the source pool during meta training is all of training data except the data in training classes used for the episode and all of training data during meta testing. But the code doesn't seem to do this. Even during meta training, I think the source pool is constructed from the entire training set. I think the function _subset_selection has a role to play here but this function isn't called anywhere. If I am wrong, can you point the part of code that does the source pool construction during meta training as I am not able to find it?

Hi pvij, thanks for checking out. _compute_idf and _compute_iwf return the statistics over the source pool. Both methods look at classes in the argument classes, aggregate their statistics, and return the idf and iwf (inverse word frequency).

You are right. I am wrong. Thanks for pointing it out.

You are welcome!