Yongming Rao's starred repositories

open-interpreter

A natural language interface for computers

Language:PythonLicense:AGPL-3.0Stargazers:42175Issues:317Issues:747

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:12295Issues:96Issues:1023

WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Yi

A series of large language models trained from scratch by developers @01-ai

Language:PythonLicense:Apache-2.0Stargazers:7412Issues:112Issues:287

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonLicense:Apache-2.0Stargazers:7240Issues:113Issues:148

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6333Issues:61Issues:77

Wonder3D

Single Image to 3D using Cross-Domain Diffusion for 3D Generation

Language:PythonLicense:AGPL-3.0Stargazers:4481Issues:46Issues:162

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:4188Issues:47Issues:386

Baichuan2

A series of large language models developed by Baichuan Intelligent Technology

Language:PythonLicense:Apache-2.0Stargazers:4023Issues:40Issues:386

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:3093Issues:21Issues:392

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:3081Issues:29Issues:958

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonLicense:BSD-3-ClauseStargazers:3028Issues:60Issues:88

consistencydecoder

Consistency Distilled Diff VAE

Language:PythonLicense:MITStargazers:2095Issues:23Issues:19

webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Language:PythonLicense:BSD-3-ClauseStargazers:2069Issues:22Issues:301

InternLM-XComposer

InternLM-XComposer2 is a groundbreaking vision-language large model (VLLM) excelling in free-form text-image composition and comprehension.

viper

Code for the paper "ViperGPT: Visual Inference via Python Execution for Reasoning"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1631Issues:88Issues:46

zero123plus

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

Language:PythonLicense:Apache-2.0Stargazers:1555Issues:29Issues:66

AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Language:PythonLicense:MITStargazers:1378Issues:10Issues:324

SoM

Set-of-Mark Prompting for LMMs

Language:PythonLicense:MITStargazers:1006Issues:21Issues:33

Xwin-LM

Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment

MeZO

[NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333

Language:PythonLicense:MITStargazers:993Issues:19Issues:31

text2room

Text2Room generates textured 3D meshes from a given text prompt using 2D text-to-image models (ICCV2023).

Language:PythonLicense:MITStargazers:983Issues:10Issues:31

awesome-language-agents

List of language agents based on paper "Cognitive Architectures for Language Agents"

objaverse-xl

🪐 Objaverse-XL is a Universe of 10M+ 3D Objects. Contains API Scripts for Downloading and Processing!

Language:PythonLicense:Apache-2.0Stargazers:623Issues:8Issues:42

LLM-groundedDiffusion

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD)

Driving-with-LLMs

PyTorch implementation for the paper "Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving"

Language:PythonLicense:Apache-2.0Stargazers:358Issues:16Issues:22

InstructDiffusion

PyTorch implementation of InstructDiffusion, a unifying and generic framework for aligning computer vision tasks with human instructions.

Language:PythonLicense:NOASSERTIONStargazers:350Issues:10Issues:18

laion-3d

Collect large 3d dataset and build models

MM-Vet

MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities (ICML 2024)

Language:PythonLicense:Apache-2.0Stargazers:195Issues:2Issues:5

tifa

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

Language:PythonLicense:Apache-2.0Stargazers:116Issues:3Issues:5