Training the GHN on other datasets

Question

Training the GHN on other datasets

pivettamarcos opened this issue 2 years ago · comments

Marcos Pivetta commented 2 years ago

Is there a way to train the GHN on other types of datasets, such as with 1D inputs?

Boris Knyazev · Answer 1 · Wed Jan 26 2022 23:20:37 GMT+0800 (China Standard Time)

The overall approach of GHNs should work on 1D inputs as well, but in this code it's not supported.

The main steps to achieve that would be to:

write your own net_generator.py based on ours and generate training architectures that can process 1D inputs.
write a Network class to process 1D inputs.
set spatial dimensions to 1 in max_shape https://github.com/facebookresearch/ppuda/blob/main/ppuda/config.py#L160

Other minor steps may be required as we assume 2D inputs in the code.

Jake Strickland · Answer 2 · Thu Feb 03 2022 03:27:24 GMT+0800 (China Standard Time)

Will you be providing more examples with different modalities like text or audio?

Boris Knyazev · Answer 3 · Fri Feb 04 2022 00:20:52 GMT+0800 (China Standard Time)

I just created a pull request #5 with an example to predict parameters for a generic MLP, which should be possible to adapt to 1D inputs, text or audio. However, the predicted parameters are very likely to be meaningless in this case because the GHN was trained on images, but this is just an example.

To make predicted parameters useful for 1D inputs, text or audio, GHN must be trained on such data. This requires research, but I'm hopeful that it will be possible in the near future.

Feel free to close this issue if your questions are resolved.

Rohan Sukumaran · Answer 4 · Tue Sep 27 2022 06:03:24 GMT+0800 (China Standard Time)

@bknyaz - If I want to train for CelebA dataset, I would have to generate new NN using the generator, and edit the network class to handle the celebA inputs?

Boris Knyazev · Answer 5 · Tue Sep 27 2022 08:18:42 GMT+0800 (China Standard Time)

You can generate new NNs to handle CelebA, but it may be easier to just change the existing CIFAR-10/ImageNet graphs on the fly in the graph loader (perhaps, somewhere in this function https://github.com/facebookresearch/ppuda/blob/main/ppuda/deepnets1m/loader.py#L167) by replacing the classification nodes with those appropriate for CelebA.

Rohan Sukumaran · Answer 6 · Tue Sep 27 2022 21:23:31 GMT+0800 (China Standard Time)

I see, yes that makes sense! Thanks for the super quick response!