rnwang04

followers

following

stars

Intel

Shanghai, China

Ruonan Wang's starred repositories

Data-Paralle-Cpp

个人翻译《Data Parallel C++》

Language:TeXApache-2.06500

onnxruntime-inference-examples

Examples for using ONNX Runtime for machine learning inferencing.

Language:C++MIT107400

onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

Language:C++MIT1364500

chatllm.cpp

Pure C++ implementation of several models for real-time chatting on your computer (CPU)

Language:C++MIT28700

Qwen2

Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.

Language:Shell637500

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonBSD-3-Clause537300

LLaVA-Med

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Language:PythonNOASSERTION132700

koboldcpp

A simple one-file way to run various GGML and GGUF models with a KoboldAI UI

Language:C++AGPL-3.0443700

fp6_llm

An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).

Language:CudaApache-2.016000

LLMs-from-scratch

Implementing a ChatGPT-like LLM in PyTorch from scratch, step by step

Language:Jupyter NotebookNOASSERTION2316500

level-zero

oneAPI Level Zero Specification Headers and Loader

Language:C++MIT19300

linux-npu-driver

Intel® NPU (Neural Processing Unit) Driver

Language:C++MIT12300

chainlit

Build Conversational AI in minutes ⚡️

Language:TypeScriptApache-2.0625500

prompt-lookup-decoding

Language:Jupyter Notebook41900

Langchain-Chatchat

Knowledge Base QA using RAG pipeline on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) with IPEX-LLM

Language:PythonApache-2.01200

milvus

A cloud-native vector database, storage for next generation AI applications

Language:GoApache-2.02840000

faiss

A library for efficient similarity search and clustering of dense vectors.

Language:C++MIT2959900

cortex

Drop-in, local AI alternative to the OpenAI stack. Multi-engine (llama.cpp, TensorRT-LLM, ONNX). Powers 👋 Jan

Language:C++Apache-2.0179500

jan

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer. Multiple engine support (llama.cpp, TensorRT-LLM)

Language:TypeScriptAGPL-3.02100100

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonMIT5204700

ollama

Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.

Language:GoMIT7938100

intel-npu-acceleration-library

Intel® NPU Acceleration Library

Language:PythonApache-2.039400

SYCLomatic

Language:LLVMNOASSERTION21600

quip-sharp

Language:PythonGPL-3.045100

text-generation-webui

A Gradio web UI for Large Language Models.

Language:PythonAGPL-3.03853200

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.02325500

llm-numbers

Numbers every LLM developer should know

Awesome-pytorch-list

A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.

LLaMA-Factory

A WebUI for Efficient Fine-Tuning of 100+ LLMs (ACL 2024)

Language:PythonApache-2.02723900

axolotl

Go ahead and axolotl questions

Language:PythonApache-2.0696500