Dinghow Yang (Dinghow)

Dinghow

Geek Repo

Company:Peking University

Location:Hangzhou, China

Home Page:https://dinghow.site

Github PK Tool:Github PK Tool


Organizations
TJMSC

Dinghow Yang's starred repositories

aria-ng-gui

一个 Aria2 图形界面客户端 | An Aria2 GUI for Windows & Linux & MacOS

Language:JavaScriptLicense:MITStargazers:1688Issues:0Issues:0

AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Language:PythonLicense:MITStargazers:1474Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:35Issues:0Issues:0

api-for-open-llm

Openai style api for open large language models, using LLMs just as chatgpt! Support for LLaMA, LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, Xverse, SqlCoder, CodeLLaMA, ChatGLM, ChatGLM2, ChatGLM3 etc. 开源大模型的统一后端接口

Language:PythonLicense:Apache-2.0Stargazers:2213Issues:0Issues:0

MInference

To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an A100 while maintaining accuracy.

Language:PythonLicense:MITStargazers:572Issues:0Issues:0

Point-SAM

Point-SAM: This is the official repository of "Point-SAM: Promptable 3D Segmentation Model for Point Clouds". We provide codes for running our demo and links to download checkpoints.

Language:PythonLicense:MITStargazers:72Issues:0Issues:0

Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

Stargazers:907Issues:0Issues:0

DIF-Gaussian

MICCAI 2024: Learning 3D Gaussians for Extremely Sparse-View Cone-Beam CT Reconstruction

Language:PythonStargazers:21Issues:0Issues:0

RecFlex

A recommendation model kernel optimizing system

Language:PythonLicense:Apache-2.0Stargazers:6Issues:0Issues:0

DistServe

Disaggregated serving system for Large Language Models (LLMs).

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:199Issues:0Issues:0

Bench2Drive

Closed-loop multi-ability evaluation of end-to-end autonomous driving algorithms

Language:PythonLicense:Apache-2.0Stargazers:518Issues:0Issues:0

LLMDebugger

LDB: A Large Language Model Debugger via Verifying Runtime Execution Step by Step

Language:PythonLicense:Apache-2.0Stargazers:297Issues:0Issues:0

qserve

QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving

Language:PythonLicense:Apache-2.0Stargazers:353Issues:0Issues:0

3dgcn

Convolution in the Cloud: Learning Deformable Kernels in 3D Graph Convolution Networks for Point Cloud Analysis

Language:PythonLicense:MITStargazers:115Issues:0Issues:0

sccache

Sccache is a ccache-like tool. It is used as a compiler wrapper and avoids compilation when possible. Sccache has the capability to utilize caching in remote storage environments, including various cloud storage options, or alternatively, in local storage.

Language:RustLicense:Apache-2.0Stargazers:5602Issues:0Issues:0

nvitop

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Language:PythonLicense:Apache-2.0Stargazers:4334Issues:0Issues:0

llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Language:PythonLicense:MITStargazers:2163Issues:0Issues:0

ELM

[ECCV 2024] Embodied Understanding of Driving Scenarios

Language:PythonStargazers:119Issues:0Issues:0

Grounded_3D-LLM

Code&Data for Grounded 3D-LLM with Referent Tokens

Language:PythonStargazers:61Issues:0Issues:0

SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

License:Apache-2.0Stargazers:281Issues:0Issues:0

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonLicense:NOASSERTIONStargazers:2852Issues:0Issues:0

ThunderKittens

Tile primitives for speedy kernels

Language:CudaLicense:MITStargazers:1405Issues:0Issues:0

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

License:MITStargazers:3117Issues:0Issues:0

splatter-image

Official implementation of `Splatter Image: Ultra-Fast Single-View 3D Reconstruction' CVPR 2024

Language:PythonLicense:BSD-3-ClauseStargazers:764Issues:0Issues:0

code-act

Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.

Language:PythonLicense:MITStargazers:400Issues:0Issues:0

LM-Infinite

Implementation of paper "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"

Language:PythonLicense:MITStargazers:100Issues:0Issues:0

llm-scheduling-artifact

Artifact of OSDI '24 paper, ”Llumnix: Dynamic Scheduling for Large Language Model Serving“

Language:PythonLicense:Apache-2.0Stargazers:46Issues:0Issues:0

PointMetaBase

This is a PyTorch implementation of PointMetaBase proposed by our paper "Meta Architecure for Point Cloud Analysis"

Language:PythonLicense:MITStargazers:85Issues:0Issues:0

ChunkLlama

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Language:PythonLicense:Apache-2.0Stargazers:301Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:23662Issues:0Issues:0