Merge Transformers language models by use of gradient parameters.
Repository from Github https://github.comoobabooga/BlockMerge_GradientRepository from Github https://github.comoobabooga/BlockMerge_Gradient