Contribution: checkpoints AVAILABLE!

Question

Contribution: checkpoints AVAILABLE!

freds0 opened this issue 2 years ago · comments

Frederico S. Oliveira commented 2 years ago

Hi guys,
First I would like to thank @junjun3518 for the excellent work of developing and sharing the code. I trained the model following the paper settings for two weeks on a V100 GPU using ratio=2 and 3. I would like to contribute to the project by sharing the checkpoints. Below are the download links.

nuwave x2:
https://drive.google.com/file/d/1pegayKs-i78yWlPuLIp-BCU8KxxCpBzd/view?usp=sharing

nuwave x3:
https://drive.google.com/file/d/12RUMjEALAs0EoEw6Fqf9ZkpTm3COX6sf/view?usp=sharing

The following are images of the training logs.

nuwave x2:

epoch:

loss:

val loss:

nuwave x3:

epoch:

loss:

val loss:

b-a01c-8df0b99c9e0e.svg)

Frederico S. Oliveira · Answer 1 · Tue Apr 12 2022 21:50:45 GMT+0800 (China Standard Time)

I also ran the test scripts:

nuwave_x2:

nuwave_x3:

이준혁(Junhyeok Lee) · Answer 2 · Wed Apr 13 2022 07:16:56 GMT+0800 (China Standard Time)

Thank you for your great contribution! I add link of this issue on README.md!

Nathan Raw · Answer 3 · Thu Jun 09 2022 05:11:59 GMT+0800 (China Standard Time)

Hey there @junjun3518 and and @freds0. Can I (or @junjun3518, if they'd like) share this model on the Hugging Face Hub?

Edwin · Answer 4 · Sun Sep 18 2022 14:35:31 GMT+0800 (China Standard Time)

Thank you for your great contribution! I want to use checkpoint to upsample music. What performance devices do I need? Can a personal laptop run this model?

이준혁(Junhyeok Lee) · Answer 5 · Sun Sep 18 2022 14:39:02 GMT+0800 (China Standard Time)

Yes, it is also runnable with CPU (but very slow).
For the music case, I don't recommend you to use this project since it is only trained with clean speech without music

Edwin · Answer 6 · Sun Sep 18 2022 16:17:51 GMT+0800 (China Standard Time)

Yes, it is also runnable with CPU (but very slow). For the music case, I don't recommend you to use this project since it is only trained with clean speech without music

Thanks for getting back to me so quickly. If I use music data to train the model, and then use the model to upsample, is that feasible?

이준혁(Junhyeok Lee) · Answer 7 · Sun Sep 18 2022 16:25:28 GMT+0800 (China Standard Time)

I think that it is different from the instrumental of the source.
For example, it is hard to apply to electronic music, since high-frequency sounds of electronic do not correlate with low-frequency sounds.
For classic instrumental or piano, it is applicable.

Saurabh · Answer 8 · Mon Jan 16 2023 06:22:57 GMT+0800 (China Standard Time)

Would you say 700 epochs are enough for training?