sublimationAC

followers

following

stars

XDU & USYD

Xi'an

Wei Liu's starred repositories

awesome_lists

Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)

Language:PythonMIT139100

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

MIT224800

composer

Supercharge Your Model Training

Language:PythonApache-2.0507400

ChineseWebText

Language:Python13600

LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Language:PythonMIT49300

Mu-scaling

Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales

Language:Python2600

adaptive-span

Transformer training code for sequential tasks

Language:PythonNOASSERTION60800

tfrecord

Standalone TFRecord reader/writer with PyTorch data loaders

Language:PythonMIT84300

GPT2

An implementation of training for GPT2, supports TPUs

Language:PythonMIT141900

Llama-X

Open Academic Research on Improving LLaMA to SOTA LLM

Language:PythonApache-2.0158300

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonApache-2.01792800

LLMPruner

Language:Python27800

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonApache-2.0585500

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonApache-2.0445700

PaddleFleetX

飞桨大模型开发套件，提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。

Language:PythonApache-2.043200

PaddleSlim

PaddleSlim is an open-source library for deep model compression and architecture search.

Language:PythonApache-2.0154500

ChatGPT4MT

🎁[ChatGPT4MT] Towards Making the Most of ChatGPT for Machine Translation

Language:Python7200

ErrorAnalysis_Prompt

:gift:[ChatGPT4MTevaluation] ErrorAnalysis Prompt for MT Evaluation in ChatGPT

Language:Python8600

ChatGPT-vs.-BERT

🎁[ChatGPT4NLU] A Comparative Study on ChatGPT and Fine-tuned BERT

Language:Python19300

OpenChatKit

Language:PythonApache-2.0902000

server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Language:PythonBSD-3-Clause778200

metaseq

Repo for external large-scale work

Language:PythonMIT643500

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonApache-2.01200200

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonMIT981300

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.04561000

BioMedLM

Language:Python58900

BlenderProc

A procedural Blender pipeline for photorealistic training image generation

Language:PythonGPL-3.0264600

safe-rules

详细的C/C++编程规范指南，由360质量工程部编著，适用于桌面、服务端及嵌入式软件系统。

Apache-2.0221200

ChatGPT

🔮 ChatGPT Desktop Application (Mac, Windows and Linux)

Language:Rust5173400

ViTAE-VSA

The official repo for [ECCV'22] "VSA: Learning Varied-Size Window Attention in Vision Transformers"

Language:Python15400