B. Shen (sbwww)

sbwww

Geek Repo

Company:IIE, UCAS

Location:Beijing

Home Page:sbwww.github.io

Github PK Tool:Github PK Tool

B. Shen's repositories

COST-EFF

[EMNLP 2022] COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models

Language:PythonLicense:Apache-2.0Stargazers:7Issues:1Issues:0

sbwww.github.io

personal homepage

Language:HTMLLicense:NOASSERTIONStargazers:2Issues:1Issues:0

LLM-Shearing

Preprint: Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Language:PythonLicense:MITStargazers:1Issues:0Issues:0
Language:HTMLStargazers:0Issues:1Issues:0

bert-squeeze

🛠️ Tools for Transformers compression using PyTorch Lightning ⚡

Language:PythonStargazers:0Issues:0Issues:0

ChineseNLPCorpus

中文自然语言处理数据集,平时做做实验的材料。欢迎补充提交合并。

Stargazers:0Issues:0Issues:0

cs-self-learning

计算机自学指南

Language:HTMLLicense:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

Daxuexi

北京 青年大学习 使用Github Actions自动完成

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Diffusion-BERT

Implementation of DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Diffusion-LM

Diffusion-LM

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

License:MITStargazers:0Issues:0Issues:0

IoT-For-Beginners

12 Weeks, 24 Lessons, IoT for All!

Language:C++License:MITStargazers:0Issues:0Issues:0

lighteval

LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

License:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0

LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.

License:Apache-2.0Stargazers:0Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

License:MITStargazers:0Issues:0Issues:0

mace

MACE is a deep learning inference framework optimized for mobile heterogeneous computing platforms.

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mlc-llm

Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

mlx

MLX: An array framework for Apple silicon

Language:C++License:MITStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

RWKV-Android

使用Android cpu 运行 RWKV V4 ONNX

Language:JavaStargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0

smoothquant

SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

UCAS_exam_review

**科学院大学网安-计算机相关课程资源,高级人工智能,深度学习,应用密码学,机器学习,信息隐藏,信息论与编码,多媒体编码等

Stargazers:0Issues:0Issues:0

wanda

A simple and effective LLM pruning approach.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0