Xiang Zhao's repositories
quant_gptq_series
GPTQ-based LLM quantization methods
quant_omniquant_series
OmniQuant-based LLM quantization methods
structured_pruning_llm
LLM Structured Pruning Methods
unstructured_pruning_llm
LLM Un-Structured Pruning Methods
nlp_merge_series
NLP merge methods
Language:Jupyter Notebook000
Language:Jupyter Notebook000
Language:Jupyter Notebook000
Language:Cuda000