fine-tune this model in tensorflow with 'nan' problem

Question

fine-tune this model in tensorflow with 'nan' problem

lisztfrancis opened this issue 8 years ago · comments

I've converted this model to tensorflow framework, and lauch the graph with .npy data file, the data file upload without any problem since I printed all value in tf.all_variables() for debug . But I found it cann't backprop properly,the trainable variables became nan at the first bp process. I'm not very skillful at CNN tricks since my physics background. Do I need adjust some special layers of this net? and what's the proper
method of optimization?
Thanks for any insightful and helpful analysis and advice!

Loreto Parisi · Answer 1 · Wed May 30 2018 22:14:59 GMT+0800 (China Standard Time)

@lisztfrancis I was thinking the same thing, at the end I preferred Caffe. The performances on the api using Tornando Web Server + Docker is super with this ResNet-50 pre-trained model.
Which advantage did you find converting to TF?