Dinghow Yang (Dinghow)

Dinghow

Geek Repo

Company:Peking University

Location:Hangzhou, China

Home Page:https://dinghow.site

Github PK Tool:Github PK Tool


Organizations
TJMSC

Dinghow Yang's starred repositories

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:35868Issues:349Issues:1728

autogen

A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap

Language:Jupyter NotebookLicense:CC-BY-4.0Stargazers:28703Issues:362Issues:1473

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18324Issues:158Issues:1411

codellama

Inference code for CodeLlama models

Language:PythonLicense:NOASSERTIONStargazers:15512Issues:176Issues:191

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:13148Issues:99Issues:758

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonLicense:Apache-2.0Stargazers:11449Issues:382Issues:3315

triton

Development repository for the Triton language and compiler

litellm

Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)

Language:PythonLicense:NOASSERTIONStargazers:10711Issues:62Issues:2699

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:9489Issues:159Issues:614

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8453Issues:99Issues:1218

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7641Issues:89Issues:1627

chat-ui

Open source codebase powering the HuggingChat app

Language:TypeScriptLicense:Apache-2.0Stargazers:6869Issues:82Issues:513

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6382Issues:60Issues:78

exllama

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Language:PythonLicense:MITStargazers:2682Issues:35Issues:216

safetensors

Simple, safe way to store and distribute tensors

Language:PythonLicense:Apache-2.0Stargazers:2631Issues:40Issues:170

tvm_mlir_learn

compiler learning resources collect.

spikingjelly

SpikingJelly is an open-source deep learning framework for Spiking Neural Network (SNN) based on PyTorch.

Language:PythonLicense:NOASSERTIONStargazers:1238Issues:18Issues:401

FateZero

[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"

Language:Jupyter NotebookLicense:MITStargazers:1080Issues:14Issues:33

Xwin-LM

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

codereview.gpt

Reviews your Pull/Merge Requests using ChatGPT

Language:JavaScriptLicense:MITStargazers:534Issues:10Issues:23

Point-Bind_Point-LLM

Align 3D Point Cloud with Multi-modalities for Large Language Models

Language:PythonLicense:MITStargazers:384Issues:15Issues:12

MetaMath

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:355Issues:7Issues:27

OpenPCSeg

OpenPCSeg: Open Source Point Cloud Segmentation Toolbox and Benchmark

segformer-pytorch

Implementation of Segformer, Attention + MLP neural network for segmentation, in Pytorch

Language:PythonLicense:MITStargazers:324Issues:9Issues:13

SeqGPT

SeqGPT: An Out-of-the-box Large Language Model for Open Domain Sequence Understanding

Language:PythonLicense:Apache-2.0Stargazers:201Issues:4Issues:14

flash-llm

Flash-LLM: Enabling Cost-Effective and Highly-Efficient Large Generative Model Inference with Unstructured Sparsity

Language:CudaLicense:Apache-2.0Stargazers:160Issues:5Issues:4

Uni3DETR

Code release for our NeurIPS 2023 paper "Uni3DETR: Unified 3D Detection Transformer".

Language:PythonLicense:Apache-2.0Stargazers:67Issues:4Issues:7

recom

An Optimizing Compiler for Recommendation Model Inference

Language:C++License:Apache-2.0Stargazers:21Issues:4Issues:1

llm-code-review

A container GitHub Action to review a pull request by HuggingFace's LLM Model.

Language:PythonLicense:Apache-2.0Stargazers:18Issues:0Issues:0