Wei Liu (sublimationAC)

sublimationAC

Geek Repo

Company:XDU & USYD

Location:Xi'an

Github PK Tool:Github PK Tool

Wei Liu's starred repositories

awesome_lists

Awesome Lists for Tenure-Track Assistant Professors and PhD students. (助理教授/博士生生存指南)

Language:PythonLicense:MITStargazers:1391Issues:0Issues:0

LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

License:MITStargazers:2248Issues:0Issues:0

composer

Supercharge Your Model Training

Language:PythonLicense:Apache-2.0Stargazers:5074Issues:0Issues:0
Language:PythonStargazers:136Issues:0Issues:0

LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Language:PythonLicense:MITStargazers:493Issues:0Issues:0

Mu-scaling

Research without Re-search: Maximal Update Parametrization Yields Accurate Loss Prediction across Scales

Language:PythonStargazers:26Issues:0Issues:0

adaptive-span

Transformer training code for sequential tasks

Language:PythonLicense:NOASSERTIONStargazers:608Issues:0Issues:0

tfrecord

Standalone TFRecord reader/writer with PyTorch data loaders

Language:PythonLicense:MITStargazers:843Issues:0Issues:0

GPT2

An implementation of training for GPT2, supports TPUs

Language:PythonLicense:MITStargazers:1419Issues:0Issues:0

Llama-X

Open Academic Research on Improving LLaMA to SOTA LLM

Language:PythonLicense:Apache-2.0Stargazers:1583Issues:0Issues:0

Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Language:PythonLicense:Apache-2.0Stargazers:17928Issues:0Issues:0
Language:PythonStargazers:278Issues:0Issues:0

DeepSpeedExamples

Example models using DeepSpeed

Language:PythonLicense:Apache-2.0Stargazers:5855Issues:0Issues:0

RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Language:PythonLicense:Apache-2.0Stargazers:4457Issues:0Issues:0

PaddleFleetX

飞桨大模型开发套件,提供大语言模型、跨模态大模型、生物计算大模型等领域的全流程开发工具链。

Language:PythonLicense:Apache-2.0Stargazers:432Issues:0Issues:0

PaddleSlim

PaddleSlim is an open-source library for deep model compression and architecture search.

Language:PythonLicense:Apache-2.0Stargazers:1545Issues:0Issues:0

ChatGPT4MT

🎁[ChatGPT4MT] Towards Making the Most of ChatGPT for Machine Translation

Language:PythonStargazers:72Issues:0Issues:0

ErrorAnalysis_Prompt

:gift:[ChatGPT4MTevaluation] ErrorAnalysis Prompt for MT Evaluation in ChatGPT

Language:PythonStargazers:86Issues:0Issues:0

ChatGPT-vs.-BERT

🎁[ChatGPT4NLU] A Comparative Study on ChatGPT and Fine-tuned BERT

Language:PythonStargazers:193Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:9020Issues:0Issues:0

server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Language:PythonLicense:BSD-3-ClauseStargazers:7782Issues:0Issues:0

metaseq

Repo for external large-scale work

Language:PythonLicense:MITStargazers:6435Issues:0Issues:0

RWKV-LM

RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

Language:PythonLicense:Apache-2.0Stargazers:12002Issues:0Issues:0

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonLicense:MITStargazers:9813Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:45610Issues:0Issues:0
Language:PythonStargazers:589Issues:0Issues:0

BlenderProc

A procedural Blender pipeline for photorealistic training image generation

Language:PythonLicense:GPL-3.0Stargazers:2646Issues:0Issues:0

safe-rules

详细的C/C++编程规范指南,由360质量工程部编著,适用于桌面、服务端及嵌入式软件系统。

License:Apache-2.0Stargazers:2212Issues:0Issues:0

ChatGPT

🔮 ChatGPT Desktop Application (Mac, Windows and Linux)

Language:RustStargazers:51734Issues:0Issues:0

ViTAE-VSA

The official repo for [ECCV'22] "VSA: Learning Varied-Size Window Attention in Vision Transformers"

Language:PythonStargazers:154Issues:0Issues:0