Beast code in Giters

jiqing-feng's repositories

accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Language:PythonApache-2.0000

bitsandbytes

Accessible large language models via k-bit quantization for PyTorch.

Language:PythonMIT000

ClipBERT

[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.

Language:PythonMIT000

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.0000

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonApache-2.0000

The official implementation of the paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now". This work introduces one fast and efficient attack methods to generate toxic content for safety-driven diffusion models.

Language:PythonMIT000

FlexFlow

A distributed deep learning framework.

Language:C++Apache-2.0000

GEAR

GEAR: An Efficient KV Cache Compression Recipefor Near-Lossless Generative Inference of LLM

Language:Python000

intel-extension-for-transformers

Extending Hugging Face transformers APIs for Transformer-based models and improve the productivity of inference deployment. With extremely compressed models, the toolkit can greatly improve the inference efficiency on Intel platforms.

Language:PythonApache-2.0000

lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

Language:PythonMIT000

LookaheadDecoding

Language:PythonApache-2.0000

neural-compressor

Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.

Language:PythonApache-2.0000

optimum

🏎️ Accelerate training and inference of 🤗 Transformers with easy to use hardware optimization tools

Language:PythonApache-2.0000

optimum-intel

Accelerate inference of 🤗 Transformers with Intel optimization tools

Language:Jupyter NotebookApache-2.0000

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonApache-2.0000

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonNOASSERTION000

pytorch_geometric

Graph Neural Network Library for PyTorch

Language:PythonMIT000

tau

Pipeline Parallelism for PyTorch

Language:PythonBSD-3-Clause000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000

Lora

Language:Python000

models

Intel® AI Reference Models: contains Intel optimizations for running deep learning workloads on Intel® Xeon® Scalable processors and Intel® Data Center GPUs

Apache-2.0000

ProtST

Camera-ready repo for ProtST

Language:PythonApache-2.0000

q-diffusion

[ICCV 2023] Q-Diffusion: Quantizing Diffusion Models.

MIT000

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.0000