Prediction is not working

Question

Prediction is not working

BenjamWhite opened this issue 7 years ago · comments

I'm running the same code on test data and get strange weights back.

import h5py
with h5py.File('data/%s.hdf5'%FN1, mode='r') as f:
    if 'layer_names' not in f.attrs and 'model_weights' in f:
        f = f['model_weights']
    weights = [np.copy(v) for v in f['timedistributed_1'].itervalues()]

and
map(lambda x: x.shape, weights)
is giving me back:
[(2,)]

Also run the code with Keras 2.0.0 and the actual version. Can it be due to different versions?

Thanks in advance!

imranshaikmuma · Answer 1 · Mon Jun 05 2017 14:24:02 GMT+0800 (China Standard Time)

@BenjamWhite i am trying to fix this problem from two days. please help me if you find it too..

BenjamWhite · Answer 2 · Mon Jun 05 2017 14:26:28 GMT+0800 (China Standard Time)

I thought first it might be something with theano and tensorflow as they store data in different order, but changing channel_first to channel_last didn't help.

imranshaikmuma · Answer 3 · Mon Jun 05 2017 14:28:41 GMT+0800 (China Standard Time)

what i feel is file is not having weights of 'time_distributed_1' layer. i googled and find nothing on this. it has only [array(['bias:0', 'kernel:0'], dtype='<U8')] inside.

BenjamWhite · Answer 4 · Mon Jun 05 2017 14:34:35 GMT+0800 (China Standard Time)

well, that's a different issue, when TimeDistributed gets defined, there is a
), missing before name.
So it assigns the name to the Dense method.
Before:

model.add(TimeDistributed(Dense(vocab_size,
                                kernel_regularizer=regularizer, 
                                bias_regularizer=regularizer, name = 'timedistributed_1')))

After:

model.add(TimeDistributed(Dense(vocab_size,
                                kernel_regularizer=regularizer, 
                                bias_regularizer=regularizer),
                                name = 'timedistributed_1'))

There is also a method to show the content of the h5py.

Haha, that's the next error I'm getting with those values [array(['bias:0', 'kernel:0'], dtype='<U8')]
the

# out very own softmax
def output2probs(output):
    print 'output:', type(output), output
    print 'weights[0]:', type(weights[0]), weights[0]
    #print 'weights[1]:', type(weights[1])
    output = np.dot(output, weights[0]) + weights[1]
    output -= output.max()
    output = np.exp(output)
    output /= output.sum()
    return output

is saying it doesn't know dtype='<U8'

imranshaikmuma · Answer 5 · Mon Jun 05 2017 15:01:22 GMT+0800 (China Standard Time)

weights[0] and weights[1] should be an array of weights. we are not getting that from our trained model file. if you can solve this, everything will be automatically solved

imranshaikmuma · Answer 6 · Mon Jun 05 2017 16:17:55 GMT+0800 (China Standard Time)

check this link it helped me a lot (Cell 94)
https://github.com/rtlee9/recipe-summarization/blob/master/src/predict.ipynb
here when we are using fit_generator, weights are produced till dropout3 only. and there is no layer for time distributed..everyone has the same problem but they ignored and moved on..

BenjamWhite · Answer 7 · Mon Jun 05 2017 16:49:59 GMT+0800 (China Standard Time)

where you able to run this?

imranshaikmuma · Answer 8 · Wed Jun 21 2017 00:04:36 GMT+0800 (China Standard Time)

i am able to run but i am not getting the desired output.
i am using a different data like job description and generating a job title
my data is huge and i am looking for gpu to train the model
what about you?

BenjamWhite · Answer 9 · Wed Jun 21 2017 14:19:48 GMT+0800 (China Standard Time)

didn't really work on it anymore (had no time so far), so basically still the same error.
how did you get around that error before?

What is the diff between expected and real output?
Is it a data problem?

Elaine · Answer 10 · Sun May 27 2018 14:16:38 GMT+0800 (China Standard Time)

If you open the hdf5 file you should see the file structure.
For me, I had to modify it to

import h5py
with h5py.File('data/%s.hdf5'%FN1, mode='r') as f:
    if 'layer_names' not in f.attrs and 'model_weights' in f:
        f = f['model_weights']
    weights = [np.copy(v) for v in f['time_distributed_1']['time_distributed_1'].values()]
weights = np.array([weights[1], weights[0]])

Nauman A Butt · Answer 11 · Sun Jun 10 2018 22:00:46 GMT+0800 (China Standard Time)

@elainelinlin did you get any prediction out of it? For your own data?

Elaine · Answer 12 · Mon Jun 11 2018 13:20:06 GMT+0800 (China Standard Time)

@fzr2009 Yes. What is not working for you?

Nauman A Butt · Answer 13 · Mon Jun 11 2018 13:42:51 GMT+0800 (China Standard Time)

@elainelinlin I get this KeyError every time with different string in it, Please help if you can