chavinlo / musicgen_trainer

simple trainer for musicgen/audiocraft

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Plans for Encodec fine-tuning

0xprincess opened this issue · comments

Hi! Do you have any plans for writing encodec fine-tuning code? The current checkpoint seems to perform really poorly on most sorts of electronic music so it's definitely a barrier for achieving high-quality generations.

commented

Training LoRAs would be great! That way you can expand the Encodec's capabilities to infinity by automatically loading the appropriate LoRA for the genre.

commented

Hi! Do you have any plans for writing encodec fine-tuning code? The current checkpoint seems to perform really poorly on most sorts of electronic music so it's definitely a barrier for achieving high-quality generations.
@0xprincess

Yes

commented

Training LoRAs would be great! That way you can expand the Encodec's capabilities to infinity by automatically loading the appropriate LoRA for the genre.

will try to add LoRAs (LM, not encodec yet) for the next release

@chavinlo btw, I noticed that the current release of musicgen uses mono version of encodec, but the original encodec release has 48khz stereo model. If we're talking about fine-tuning, do you think that swapping the current 32khz musicgen's encodec for 48khz model is possible? I personally think that having stereo audio would be great.