XIAOMING WANG's starred repositories

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:35283Issues:343Issues:2792

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonLicense:NOASSERTIONStargazers:14944Issues:262Issues:210

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:10441Issues:162Issues:765

mmcv

OpenMMLab Computer Vision Foundation

Language:PythonLicense:Apache-2.0Stargazers:5880Issues:84Issues:1153

OpenPrompt

An Open-Source Framework for Prompt-Learning.

Language:PythonLicense:Apache-2.0Stargazers:4348Issues:44Issues:257

Conference-Acceptance-Rate

Acceptance rates for the major AI conferences

Language:Jupyter NotebookLicense:MITStargazers:4222Issues:128Issues:28

big-sleep

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN. Technique was originally created by https://twitter.com/advadnoun

Language:PythonLicense:MITStargazers:2569Issues:47Issues:87

DeBERTa

The implementation of DeBERTa

Language:PythonLicense:MITStargazers:1986Issues:42Issues:123
Language:PythonLicense:Apache-2.0Stargazers:1473Issues:32Issues:75

LAMA

LAnguage Model Analysis

Language:PythonLicense:NOASSERTIONStargazers:1352Issues:71Issues:48

OpenMMLabCourse

OpenMMLab course index and stuff

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1027Issues:12Issues:7

CLUECorpus2020

Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

PaLM-pytorch

Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways

Language:PythonLicense:MITStargazers:821Issues:16Issues:11

HiVT

[CVPR 2022] HiVT: Hierarchical Vector Transformer for Multi-Agent Motion Prediction

Language:PythonLicense:Apache-2.0Stargazers:618Issues:35Issues:50

autoprompt

AutoPrompt: Automatic Prompt Construction for Masked Language Models.

Language:PythonLicense:Apache-2.0Stargazers:593Issues:11Issues:31

CValues

面向中文大模型价值观的评估与对齐研究

Language:PythonLicense:Apache-2.0Stargazers:472Issues:1Issues:7

llm-sp

Papers and resources related to the security and privacy of LLMs 🤖

Language:PythonLicense:Apache-2.0Stargazers:426Issues:15Issues:8

pytorch-meta-optimizer

A PyTorch implementation of Learning to learn by gradient descent by gradient descent

Language:PythonLicense:MITStargazers:310Issues:16Issues:10

argoverse-forecasting

Official Repository for Argoverse Motion Forecasting Baselines

Language:PythonLicense:BSD-3-Clause-ClearStargazers:250Issues:7Issues:22
Language:PythonLicense:Apache-2.0Stargazers:88Issues:4Issues:2

OpenAI_PTCompletion

A Parallel Completion Python Library that boosts your OpenAI-API query with task queue & multiprocessing.

Language:PythonLicense:MITStargazers:22Issues:1Issues:0

Cue-CoT

Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs [EMNLP 2023 Findings]

Language:PythonStargazers:21Issues:1Issues:0

MSQA

Microsoft question-answering dataset

Language:PythonLicense:NOASSERTIONStargazers:10Issues:2Issues:1
Language:PythonLicense:MITStargazers:10Issues:1Issues:1

KddRES

This is the first Cantonese Dialogue Dataset

Language:PythonStargazers:9Issues:2Issues:0
Language:PythonLicense:MITStargazers:8Issues:1Issues:0

self_restraint

Clear code lead to clear mind.

Language:Jupyter NotebookLicense:MITStargazers:3Issues:1Issues:0
License:Apache-2.0Stargazers:2Issues:1Issues:0