Geek Repo
Github PK Tool:Github PK Tool
[ICLR 2024] Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding
[NeurIPS 2024] Can LLMs Learn by Teaching? A Preliminary Study
Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs