train the belief tracker

Question

train the belief tracker

yahuvi opened this issue 7 years ago · comments

I use default config and run the tracker training on macOS:
python nndial.py -config config/tracker.cfg -mode train

logs below：

init net from scrach ...
loading model settings from config file ...
prepare slot value templates ...
formatting DB ...
semi-supervised action examples: 0.00%
Corpus VMC : 97.34%
Corpus Success : 91.57%
===============
Data statistics
===============
Train : 405
Valid : 135
Test : 136
===============
Voc : 598
===============
Venue : 68
===============
setting network structures using theano variables ...
init n2n SDS ...
init rnn requestable trackers ...
init OfferChange tracker ...
init rnn informable trackers ...
init normal policy network ...
loss function
including informable tracker loss ...
including informable tracker loss ...
including informable tracker loss ...
including requestable tracker loss ...
including requestable tracker loss ...
including requestable tracker loss ...
including requestable tracker loss ...
including requestable tracker loss ...
including requestable tracker loss ...
including OfferChange tracker loss ...
gradient w.r.t inftrk
gradient w.r.t reqtrk

issue：
Program is blocked here,the log is no longer printed.
Apple Activity Monitor status:CPU is 98%,Memory is 15.56GB.

Shawn Wen · Answer 1 · Fri Aug 11 2017 15:18:53 GMT+0800 (China Standard Time)

Theano is very slow in compiling computational graphs for this model because the architecture is non-trivial. You can put theano flags optimizer=fast_compile to run it. The run-time is relatively faster because both the model and dataset are small.

robotzheng · Answer 2 · Fri Aug 18 2017 14:55:45 GMT+0800 (China Standard Time)

THEANO_FLAGS=optimizer=fast_compile,device=gpu,floatX=float32 python nndial.py -config config/tracker.cfg -mode train
also：
including informable tracker loss ...
including informable tracker loss ...
including informable tracker loss ...
including requestable tracker loss ...
including requestable tracker loss ...
including requestable tracker loss ...
including requestable tracker loss ...
including requestable tracker loss ...
including requestable tracker loss ...
including OfferChange tracker loss ...
gradient w.r.t inftrk
gradient w.r.t reqtrk

robotzheng · Answer 3 · Fri Aug 18 2017 14:56:31 GMT+0800 (China Standard Time)

I use centos 7.5 K40

robotzheng · Answer 4 · Fri Aug 18 2017 15:01:37 GMT+0800 (China Standard Time)

start work：
number of parameters : 1103292
number of training parameters : 1096842
start network training ...
Finishing 25 dialog in epoch 1
thanks to shawnwun

xiw54 · Answer 5 · Wed Aug 23 2017 01:50:57 GMT+0800 (China Standard Time)

Found the example_run, sorry!

Hai Liang W. · Answer 6 · Sun Dec 02 2018 16:53:22 GMT+0800 (China Standard Time)

I came to the same problem.
The program starts to train by suppling THEANO_FLAGS="optimizer=fast_compile".