Lujun Zhao (egrcc)

egrcc

Geek Repo

Company:Alibaba Group

Location:Hangzhou, Zhejiang, China

Github PK Tool:Github PK Tool

Lujun Zhao's starred repositories

guidance

A guidance language for controlling large language models.

Language:Jupyter NotebookLicense:MITStargazers:17646Issues:115Issues:452

awesome-english-ebooks

经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新

Language:HTMLStargazers:16861Issues:445Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:16822Issues:153Issues:1306

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:8888Issues:94Issues:613

lean-side-bussiness

精益副业:程序员如何优雅地做副业

LMFlow

An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.

Language:PythonLicense:Apache-2.0Stargazers:8054Issues:72Issues:384

starcoder

Home of StarCoder: fine-tuning & inference!

Language:PythonLicense:Apache-2.0Stargazers:7140Issues:68Issues:140

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonLicense:MITStargazers:5578Issues:62Issues:139

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5231Issues:67Issues:379

camel

🐫 CAMEL: Communicative Agents for “Mind” Exploration of Large Language Model Society (NeruIPS'2023) https://www.camel-ai.org

Language:PythonLicense:Apache-2.0Stargazers:4521Issues:56Issues:216

GodMode

AI Chat Browser: Fast, Full webapp access to ChatGPT / Claude / Bard / Bing / Llama2! I use this 20 times a day.

Language:TypeScriptLicense:MITStargazers:4041Issues:36Issues:159

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:3977Issues:41Issues:343

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:3907Issues:45Issues:354

llm-foundry

LLM training code for Databricks foundation models

Language:PythonLicense:Apache-2.0Stargazers:3743Issues:45Issues:356

mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:3205Issues:30Issues:750

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonLicense:BSD-3-ClauseStargazers:2920Issues:60Issues:86

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonLicense:MITStargazers:2727Issues:35Issues:162

exllama

A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

Language:PythonLicense:MITStargazers:2624Issues:36Issues:219

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:2479Issues:30Issues:146

mPLUG-Owl

mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model

Language:PythonLicense:MITStargazers:1968Issues:27Issues:204

MetaTransformer

Meta-Transformer for Unified Multimodal Learning

Language:PythonLicense:Apache-2.0Stargazers:1447Issues:22Issues:63

Awesome-GPT-Store

Custom GPT Store - A collection of major GPTS available in public

awesome-gpts

Collection of all the GPTs created by the community

LLaVA-Med

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Language:HTMLLicense:NOASSERTIONStargazers:1159Issues:24Issues:67

mPLUG-DocOwl

mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding

Language:PythonLicense:Apache-2.0Stargazers:987Issues:26Issues:68

mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Language:PythonLicense:MITStargazers:872Issues:9Issues:17

imgpilot

Turn the draft into amazing artwork with the power of Real-Time Latent Consistency Model

Language:TypeScriptLicense:MITStargazers:526Issues:6Issues:3

LLaVAR

Code/Data for the paper: "LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding"

Language:PythonLicense:Apache-2.0Stargazers:237Issues:5Issues:20

chatgpt-web

用 Express 和 Vue3 搭建的 ChatGPT 演示网页

Language:VueLicense:MITStargazers:94Issues:0Issues:0