Mukai Li (kiaia)

kiaia

Geek Repo

Company:HKU,BUAA

Location:Shanghai

Github PK Tool:Github PK Tool

Mukai Li's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonLicense:MITStargazers:165958Issues:1552Issues:2548

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36220Issues:348Issues:1752

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:34432Issues:341Issues:2684

Rectangle

Move and resize windows on macOS with keyboard shortcuts and snap areas

Language:SwiftLicense:NOASSERTIONStargazers:25446Issues:96Issues:621

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25262Issues:220Issues:456

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:13052Issues:114Issues:978

MOSS

An open-source tool-augmented conversational language model from Fudan University

Language:PythonLicense:Apache-2.0Stargazers:11904Issues:124Issues:353

gorilla

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Language:PythonLicense:Apache-2.0Stargazers:11101Issues:99Issues:205

qlora

QLoRA: Efficient Finetuning of Quantized LLMs

Language:Jupyter NotebookLicense:MITStargazers:9842Issues:84Issues:247
Language:PythonLicense:Apache-2.0Stargazers:7053Issues:67Issues:70

consistency_models

Official repo for consistency models.

Language:PythonLicense:MITStargazers:6056Issues:60Issues:51

GPU-Puzzles

Solve puzzles. Learn CUDA.

Language:Jupyter NotebookLicense:MITStargazers:5531Issues:29Issues:28

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:4597Issues:50Issues:420

tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Language:PythonLicense:MITStargazers:4507Issues:120Issues:54

rebiber

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Language:PythonLicense:MITStargazers:2560Issues:15Issues:29

UltraChat

Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)

Language:PythonLicense:MITStargazers:2199Issues:39Issues:30
Language:Jupyter NotebookLicense:MITStargazers:1659Issues:42Issues:72

Llama-X

Open Academic Research on Improving LLaMA to SOTA LLM

Language:PythonLicense:Apache-2.0Stargazers:1586Issues:41Issues:21

ceval

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Language:PythonLicense:MITStargazers:1581Issues:14Issues:80

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

License:MITStargazers:1540Issues:19Issues:0

FastEdit

🩹Editing large language models within 10 seconds⚡

Language:PythonLicense:Apache-2.0Stargazers:1261Issues:14Issues:27

lmms-eval

Accelerating the development of large multimodal models (LMMs) with lmms-eval

Language:PythonLicense:NOASSERTIONStargazers:1255Issues:4Issues:120

EasyContext

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Language:PythonLicense:Apache-2.0Stargazers:581Issues:9Issues:41

ring-flash-attention

Ring attention implementation with flash attention

Video-MME

✨✨Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

Apple-Monitor

一个用 Java 实现的 Apple 商店库存监控,支持bark,dingtalk,微信等方式推送实时库存信息。目前支持**和日本地区。An Apple store inventory monitoring implemented in Java, supports bark, dingtalk, WeChat and other methods to push real-time inventory information. Currently supports China and Japan regions.

Language:JavaLicense:MITStargazers:333Issues:6Issues:29

LEval

[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Language:PythonLicense:GPL-3.0Stargazers:330Issues:4Issues:17

diffusion-of-thoughts

Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"

ChatBridge

ChatBridge, an approach to learning a unified multimodal model to interpret, correlate, and reason about various modalities without relying on all combinations of paired data.

Language:PythonLicense:BSD-3-ClauseStargazers:45Issues:2Issues:7