cliangyu

Liangyu Chen's repositories

Cola

[NeurIPS2023] Official implementation of the paper "Large Language Models are Visual Reasoning Coordinators"

Language:Jupyter NotebookNOASSERTION99 3 2

CSVAL

[MIDL 2023] Official Imeplementation of "Making Your First Choice: To Address Cold Start Problem in Vision Active Learning"

Language:PythonMIT32 3 1

random_hacks

Random hacks that I need to keep happy

MIT3 10

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonApache-2.0000

Emu

Emu: An Open Multimodal Generalist

Language:Python000

fast-stable-diffusion

fast-stable-diffusion, +25-50% speed increase + memory efficient + DreamBooth

Language:PythonMIT000

gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

Language:PythonApache-2.0000

litellm

Call all LLM APIs using the OpenAI format. Use Azure, OpenAI, Cohere, Anthropic, Ollama, VLLM, Sagemaker, HuggingFace, Replicate (100+ LLMs)

Language:PythonMIT000

Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment. Demo apps to showcase Meta Llama3 for WhatsApp & Messenger.

Language:Jupyter Notebook000

llama3

the main Llama 3 GitHub site - will be moved under Meta-Llama

NOASSERTION000

LLaVA

Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.

Language:PythonApache-2.0000

LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding

000

Megatron-DeepSpeed

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonNOASSERTION000

mmina

010

OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Language:PythonApache-2.0000

open-instruct

Language:PythonApache-2.0000

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonMIT000

optimum

🚀 Accelerate training and inference of 🤗 Transformers and 🤗 Diffusers with easy to use hardware optimization tools

Language:PythonApache-2.0000

toolformer

Language:PythonMIT000

visitor-badge

A badge generator service to count visitors of your markdown file.

Language:HTMLGPL-3.0000

visual-chatgpt

VisualChatGPT

Language:PythonMIT000

yang-song.github.io

Personal website

Language:JavaScriptMIT000

cliangyu

Liangyu Chen's repositories

Cola

CSVAL

video-diffusion

random_hacks

gpt4v_api

aibrowser

transformers

cliangyu

CSVALv2

DeepSpeed

diff_json

Emu

fast-stable-diffusion

gpt-neox

init-pools-dal

litellm

llama-recipes

llama3

LLaVA

LLMSpeculativeSampling

Megatron-DeepSpeed

mmina

OFA

open-instruct

open_flamingo

optimum

toolformer

visitor-badge

visual-chatgpt

yang-song.github.io