Zhenyu (Allen) Zhang's starred repositories
LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
FlagEmbedding
Retrieval and Retrieval-augmented LLMs
schedule_free
Schedule-Free Optimization in PyTorch
Awesome-Graph-LLM
A collection of AWESOME things about Graph-Related LLMs.
TimeSformer
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
Triton-Puzzles
Puzzles for learning Triton
Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
Awesome-LLM-Long-Context-Modeling
📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥
Neural-Network-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
recurrentgemma
Open weights language model from Google DeepMind, based on Griffin.
long-llms-learning
A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks
LLMs-Finetuning-Safety
We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.
scattermoe
Triton-based implementation of Sparse Mixture of Experts.
tinyBenchmarks
Evaluating LLMs with fewer examples