Yubai Wei's repositories
continual_finetuning_with_kl
made a adaption to training language models with kl penalty to alleviate catastrophic forgetting
code-accumulation
record some basic code writing in work
Language:Python000
Language:Python000
Language:Python000
paper
paper recently read
000