rnwang04

Ruonan Wang's repositories

bloomz.cpp

C++ implementation for BLOOM

Language:CMIT000

Folder-Structure-Conventions

Folder / directory structure options and naming conventions for software projects

MIT000

HelloGitHub

:octocat: 分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.

Language:Python000

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max). A PyTorch LLM library that seamlessly integrates with HuggingFace, LangChain, LlamaIndex, DeepSpeed, vLLM, FastChat, ModelScope, etc.

Language:PythonApache-2.0000

lightning

The most intuitive, flexible, way to build PyTorch models and lightning apps that glue together everything around the models, without the pain of infrastructure, cost management, scaling and everything else.

Language:PythonApache-2.0000

llama.cpp

LLM inference in C/C++

Language:C++MIT000

LongLoRA

Code and documents of LongLoRA and LongAlpaca

Language:PythonApache-2.0000

neural-compressor

Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, sparsity, pruning, knowledge distillation, across different deep learning frameworks to pursue optimal inference performance.

Language:PythonApache-2.0000

rnwang04

Ruonan Wang's repositories

bigdl-project.github.io

bloomz.cpp

cvpr2022

Folder-Structure-Conventions

HelloGitHub

ipex-llm

lightning

llama.cpp

LongLoRA

neural-compressor