For other datasets, is it better to use pre-searched network from cifar, or search from itself?

Question

For other datasets, is it better to use pre-searched network from cifar, or search from itself?

rtrobin opened this issue 5 years ago · comments

I want to try PDARTS on other datasets. Is it better to use the network transferred from pre-searched cifar model, or search from the dataset itself? For other NAS methods, maybe only the former option is doable. Since PDARTS is time efficient in searching, maybe I should search from it?

Any suggestion is appreciated. Thanks. :)

Cytosine · Answer 1 · Wed Jun 19 2019 17:33:03 GMT+0800 (China Standard Time)

@rtrobin Hello, I'm also working on applying PDARTS on different tasks!

Let's go straight: in general I would believe searching for a model is always better than a pre-searched model, because the algo optimize the model w.r.t. the data.

From my experience, it is kinda hard to search directly on other dataset; because you still need to set the parameters carefully take the data in to consideration.

I also retrained the PDARTS structure on my own data; it turns out to be quite good (but it does not beat the human experts, for I didn't change a lot of the parameters). It seems that more effort are required to fine-tune the pre-searched model.

GL,

YiJun Tang · Answer 2 · Thu Jun 20 2019 11:01:11 GMT+0800 (China Standard Time)

@Catosine Thanks for the reply.

That's all my concern too. Comparing to traditional deep learning architecture, it is even harder to fine tune the hyper parameters. Is there any guidance about the corresponding actual meaning in reality for each parameters?

Cytosine · Answer 3 · Thu Jun 20 2019 11:30:58 GMT+0800 (China Standard Time)

@rtrobin Would you mind raising a specific case for your question?

YiJun Tang · Answer 4 · Thu Jun 20 2019 14:04:37 GMT+0800 (China Standard Time)

@Catosine I don't have any useful to share right now. I'm doing some rough investigation and trial on a simple and small data set. I would share something when I expand my trial.

Cytosine · Answer 5 · Thu Jun 20 2019 19:31:26 GMT+0800 (China Standard Time)

@rtrobin I've done some rough search, and got a little experience with the parameters. You may show me what you have now so I may be able to tell you something useful if I have met the same situation as you do.
Again, thanks for keep me updated：）

chenxin061 · Answer 6 · Fri Jun 21 2019 19:39:44 GMT+0800 (China Standard Time)

As @Catosine said, you should fit some hyperparameters if you want to search on a new dataset. However, if you expect a better performance on the new dataset, my suggestion is to search on it instead of transferring existing architectures, although I believe that the released PDARTS architecture can work well.

Cytosine · Answer 7 · Mon Jun 24 2019 16:56:29 GMT+0800 (China Standard Time)

@rtrobin Hi there! I've got some data: comparing pre-searched model, my own-searched model does have a better accuracy.

Li Jiahui · Answer 8 · Tue Sep 03 2019 19:43:32 GMT+0800 (China Standard Time)

@Catosine how much improvement did you get on your own data , compare to manual architecture ?

Cytosine · Answer 9 · Tue Sep 03 2019 23:07:15 GMT+0800 (China Standard Time)

@JarveeLee It is from 95% top 1 valid acc to 99%.

Li Jiahui · Answer 10 · Wed Sep 04 2019 12:41:03 GMT+0800 (China Standard Time)

Wow! Great !

YiJun Tang · Answer 11 · Sun Sep 08 2019 18:30:50 GMT+0800 (China Standard Time)

@Catosine Thanks for the update. Good job.

I haven't done NAS work after June. Maybe you could share some hyperparameters strategy here for others' further research, if they find this post.

Tingwei Liu · Answer 12 · Tue Jan 07 2020 19:42:33 GMT+0800 (China Standard Time)

@Catosine @rtrobin hi, I am enjoying your conversation. But I meet some troubles when I apply the PDARTS to other datasets.

one of the datasets is VOC2012, which is used to semantic segmentation generally.For this dataset, I just change input, criterion and other related parameters,like channel. Unfortunately, result is bed, the Loss is not converge finally and keep in 4~6, which is vibrate.
another one is ECSSD, which is used to saliency detection, the operation was done as last dataset do, I use the architecture to train ECSSD after search, the result are still bad.
As you can see, I wondering maybe I forget some operation need to add, or PDARTS just perform very well in Classification task like Cifar and ImageNet .

Any suggestion is appreciated. Thanks. :)

Cytosine · Answer 13 · Wed Jan 08 2020 00:45:10 GMT+0800 (China Standard Time)

@LiuTingWed Hello!
First thing first, the PDARTS does work well on other dataset such as face identification. I've done experiment with it and it is much better than ResNet. But I cannot guarantee if it works on other datasets. From my personal idea, I think it should be able to handle classification tasks only without major change in the supernet or operations. It is definitely a good idea to try with different operations as candidate and different block structure, because at lease one thing is certain: the operations and the blocks are carefully designed for CIFAR as well as ImageNet.

In addition, there are other following works about DARTS. The latest one should be FairDARTS, It recognised the convergence issue and gave an pretty good solution. You might be interested with it.

GL,
PF