CZWin32768

followers

following

stars

Canada

zewen-chi.github.io

Zewen Chi's repositories

XNLG

AAAI-20 paper: Cross-Lingual Natural Language Generation via Pre-Training

Language:Python128 6 16

XLM-Align

Language:PythonMIT36 4 4

bitsandbytes-aarch64

aarch64 for bitsandbytes

Language:PythonMIT100

CCLUE

古文语言理解测评基准 Classical Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Language:PythonApache-2.01 10

unilm

UniLM AI - Unified "Language" Model Pre-training across Tasks, Languages, and Modalities

Language:PythonMIT010

accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Language:PythonApache-2.0000

adapter-transformers

Huggingface Transformers + Adapters = ❤️

Language:PythonApache-2.0010

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookApache-2.0000

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT better.

Language:HTMLCC0-1.0000

cloudflare-scrape

A Python module to bypass Cloudflare's anti-bot page.

Language:PythonMIT010

crosslingual_winograd

"It's All in the Heads" (Findings of ACL 2021), official implementation and data

Language:PythonApache-2.0010

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonMIT010

DialoGPT

Large-scale pretraining for dialogue

Language:PythonMIT000

dockers

Language:Dockerfile010

flores

Facebook Low Resource (FLoRes) MT Benchmark

Language:PythonCC-BY-SA-4.0010

improved-aesthetic-predictor

CLIP+MLP Aesthetic Score Predictor

Language:PythonApache-2.0000

llama.cpp

Port of Facebook's LLaMA model in C/C++

Language:CMIT000

MAGIC

Language Models Can See: Plugging Visual Controls in Text Generation

Language:Python010

massive

Tools and Modeling Code for the MASSIVE dataset

Language:PythonNOASSERTION010

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonNOASSERTION010

Neural-Collapse

[NeurIPS 2021] A Geometric Analysis of Neural Collapse with Unconstrained Features

Language:Python010

open_clip

An open source implementation of CLIP.

Language:PythonNOASSERTION010

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonApache-2.0000

ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Language:PythonMIT010

PrefixTuning

Prefix-Tuning: Optimizing Continuous Prompts for Generation

Language:Python010

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonApache-2.0000

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonApache-2.0000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0010

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonMIT000

YaLM-100B

Pretrained language model with 100B parameters

Language:PythonApache-2.0010