Zewen Chi (CZWin32768)

CZWin32768

Geek Repo

Location:Canada

Home Page:zewen-chi.github.io

Github PK Tool:Github PK Tool

Zewen Chi's repositories

XNLG

AAAI-20 paper: Cross-Lingual Natural Language Generation via Pre-Training

Language:PythonLicense:MITStargazers:36Issues:4Issues:4

bitsandbytes-aarch64

aarch64 for bitsandbytes

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

CCLUE

古文语言理解测评基准 Classical Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

unilm

UniLM AI - Unified "Language" Model Pre-training across Tasks, Languages, and Modalities

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

accelerate

🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

adapter-transformers

Huggingface Transformers + Adapters = ❤️

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

alpaca-lora

Instruct-tune LLaMA on consumer hardware

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT better.

Language:HTMLLicense:CC0-1.0Stargazers:0Issues:0Issues:0

cloudflare-scrape

A Python module to bypass Cloudflare's anti-bot page.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

crosslingual_winograd

"It's All in the Heads" (Findings of ACL 2021), official implementation and data

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

DialoGPT

Large-scale pretraining for dialogue

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:DockerfileStargazers:0Issues:1Issues:0

flores

Facebook Low Resource (FLoRes) MT Benchmark

Language:PythonLicense:CC-BY-SA-4.0Stargazers:0Issues:1Issues:0

improved-aesthetic-predictor

CLIP+MLP Aesthetic Score Predictor

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

llama.cpp

Port of Facebook's LLaMA model in C/C++

Language:CLicense:MITStargazers:0Issues:0Issues:0

MAGIC

Language Models Can See: Plugging Visual Controls in Text Generation

Language:PythonStargazers:0Issues:1Issues:0

massive

Tools and Modeling Code for the MASSIVE dataset

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

Neural-Collapse

[NeurIPS 2021] A Geometric Analysis of Neural Collapse with Unconstrained Features

Language:PythonStargazers:0Issues:1Issues:0

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

PrefixTuning

Prefix-Tuning: Optimizing Continuous Prompts for Generation

Language:PythonStargazers:0Issues:1Issues:0

RL4LMs

A modular RL library to fine-tune language models to human preferences

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

YaLM-100B

Pretrained language model with 100B parameters

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0