Outdated RQBottleneckTransformer model

Question

Outdated RQBottleneckTransformer model

Subuday opened this issue 5 months ago · comments

Maksym Sutkovenko commented 5 months ago

It looks like architecture of RQBottleneckTransformer has been changed, but model has not been retrained/reuploaded.
So trying to load RQBottleneckTransformer using
vq_model = vq_stoks.RQBottleneckTransformer.load_model(ref="collabora/whisperspeech:whisper-vq-stoks-medium-en+pl.model").cuda() leads to error:

Error(s) in loading state_dict for RQBottleneckTransformer:
	Missing key(s) in state_dict: "rq.project_in.weight", "rq.project_in.bias", "rq.project_out.weight", "rq.project_out.bias". 
	Unexpected key(s) in state_dict: "rq.layers.0.project_in.weight", "rq.layers.0.project_in.bias", "rq.layers.0.project_out.weight", "rq.layers.0.project_out.bias".

Maksym Sutkovenko · Answer 1 · Thu Mar 07 2024 16:55:00 GMT+0800 (China Standard Time)

Okay, I was using incorrect version of vector_quantize_pytorch.
The correct fine is specified in settings.ini file.

Jakub Piotr Cłapa · Answer 2 · Thu Mar 07 2024 17:09:08 GMT+0800 (China Standard Time)

Yeah, that's unfortunate. It would probably make sense to update the checkpoint and use the newest version of vector_quantize_pytorch since AFAIR the math did not change at all, just the layer names.

Jakub Piotr Cłapa · Answer 3 · Thu Mar 07 2024 17:09:44 GMT+0800 (China Standard Time)

Maybe we could do it when we start working on new languages @zoq ?