Dinghow

Dinghow Yang's starred repositories

ollama

Get up and running with Llama 3, Mistral, Gemma 2, and other large language models.

Language:GoMIT79868 482 3691

grok-1

Grok open release

Language:PythonApache-2.049195 561 202

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION23610 197 199

ZLUDA

CUDA on AMD GPUs

Language:RustApache-2.08466 120 153

Sccache is a ccache-like tool. It is used as a compiler wrapper and avoids compilation when possible. Sccache has the capability to utilize caching in remote storage environments, including various cloud storage options, or alternatively, in local storage.

Language:RustApache-2.05602 52 854

nvitop

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Language:PythonApache-2.04334 25 82

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonApache-2.03376 24 425

baize-chatbot

Let ChatGPT teach your own chatbot in hours with a single GPU!

Language:PythonGPL-3.03148 50 52

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

MIT3115 25 71

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonNOASSERTION2850 33 138

llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonMIT2160 24 159

ThunderKittens

Tile primitives for speedy kernels

Language:CudaMIT1405 25 20

splatter-image

Official implementation of `Splatter Image: Ultra-Fast Single-View 3D Reconstruction' CVPR 2024

Language:PythonBSD-3-Clause761 23 53

EAGLE

Official Implementation of EAGLE-1 and EAGLE-2

Language:PythonApache-2.0673 12 92

code-act

Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.

Language:PythonMIT400 5 9

chat_templates

Chat Templates for 🤗 HuggingFace Large Language Models

Language:JinjaMIT369 6 9

qserve

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Language:PythonApache-2.0351 8 22

ChunkLlama

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Language:PythonApache-2.0300 7 20

Agent-FLAN

[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

Apache-2.0300 4 10

LLMDebugger

LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step

Language:PythonApache-2.0296 6 8

SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

Apache-2.0280 13 1

Q-Bench

①[ICLR2024 Spotlight] (GPT-4V/Gemini-Pro/Qwen-VL-Plus+16 OS MLLMs) A benchmark for multi-modality LLMs (MLLMs) on low-level vision and visual quality assessment.

Language:Jupyter Notebook215 1 11

Spec-Bench

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Language:PythonApache-2.0119 1 11

ELM

[ECCV 2024] Embodied Understanding of Driving Scenarios

Language:Python119 7 13

3dgcn

Convolution in the Cloud: Learning Deformable Kernels in 3D Graph Convolution Networks for Point Cloud Analysis

Language:PythonMIT114 4 11

LM-Infinite

Implementation of paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"

Language:PythonMIT99 4 10

PointMetaBase

This is a PyTorch implementation of PointMetaBase proposed by our paper "Meta Architecure for Point Cloud Analysis"

Language:PythonMIT85 4 20

Grounded_3D-LLM

Code&Data for Grounded 3D-LLM with Referent Tokens

Language:Python60 6 2

llm-scheduling-artifact

Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“

Language:PythonApache-2.046 4 1

Backdoor_DPR

Code for "Backdoor Attacks on Dense Passage Retrievers for Disseminating Misinformation"

Language:Python4 2 1