Dinghow

followers

following

stars

Peking University

Hangzhou, China

https://dinghow.site

Organizations

TJMSC

Dinghow Yang's starred repositories

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.035868 349 1728

autogen

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

Language:Jupyter NotebookCC-BY-4.028703 362 1473

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.018324 158 1411

codellama

Inference code for CodeLlama models

Language:PythonNOASSERTION15512 176 191

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Language:PythonApache-2.013148 99 758

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonApache-2.011449 382 3315

triton

Development repository for the Triton language and compiler

Language:C++MIT11235 179 1192

litellm

Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)

Language:PythonNOASSERTION10711 62 2699

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION9489 159 614

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonApache-2.08453 99 1218

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.07641 89 1627

chat-ui

Open source codebase powering the HuggingChat app

Language:TypeScriptApache-2.06869 82 513

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT6382 60 78

exllama

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Language:PythonMIT2682 35 216

safetensors

Simple, safe way to store and distribute tensors

Language:PythonApache-2.02631 40 170

tvm_mlir_learn

compiler learning resources collect.

Language:Python1961 35 4

spikingjelly

SpikingJelly is an open-source deep learning framework for Spiking Neural Network (SNN) based on PyTorch.

Language:PythonNOASSERTION1238 18 401

FateZero

[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"

Language:Jupyter NotebookMIT1080 14 33

Xwin-LM

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

Language:Python1008 37 20

Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Language:Python982 41 1

codereview.gpt

Reviews your Pull/Merge Requests using ChatGPT

Language:JavaScriptMIT534 10 23

Point-Bind_Point-LLM

Align 3D Point Cloud with Multi-modalities for Large Language Models

Language:PythonMIT384 15 12

MetaMath

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Language:PythonApache-2.0355 7 27

OpenPCSeg

OpenPCSeg: Open Source Point Cloud Segmentation Toolbox and Benchmark

Language:Python327 13 24

segformer-pytorch

Implementation of Segformer, Attention + MLP neural network for segmentation, in Pytorch

Language:PythonMIT324 9 13

SeqGPT

SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding

Language:PythonApache-2.0201 4 14

flash-llm

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

Language:CudaApache-2.0160 5 4

Uni3DETR

Code release for our NeurIPS 2023 paper "Uni3DETR: Unified 3D Detection Transformer".

Language:PythonApache-2.067 4 7

recom

An Optimizing Compiler for Recommendation Model Inference

Language:C++Apache-2.021 4 1

llm-code-review

A container GitHub Action to review a pull request by HuggingFace's LLM Model.

Language:PythonApache-2.01800