Kaiyang's starred repositories

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:65269Issues:552Issues:689

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:33682Issues:338Issues:1627

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonLicense:Apache-2.0Stargazers:28743Issues:336Issues:266

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookLicense:MITStargazers:10521Issues:95Issues:321

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:8690Issues:93Issues:599

metaseq

Repo for external large-scale work

Language:PythonLicense:MITStargazers:6382Issues:109Issues:292

dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Language:PythonLicense:Apache-2.0Stargazers:3539Issues:127Issues:390

Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Language:PythonLicense:MITStargazers:3441Issues:100Issues:158

best_AI_papers_2022

A curated list of the latest breakthroughs in AI (in 2022) by release date with a clear video explanation, link to a more in-depth article, and code.

WeightWatcher

The WeightWatcher tool for predicting the accuracy of Deep Neural Networks

Language:PythonLicense:Apache-2.0Stargazers:1391Issues:32Issues:231

CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:675Issues:14Issues:20

omnivore

Omnivore: A Single Model for Many Visual Modalities

Language:PythonLicense:NOASSERTIONStargazers:544Issues:19Issues:31

OpenPSG

Benchmarking Panoptic Scene Graph Generation (PSG), ECCV'22

Language:PythonLicense:MITStargazers:384Issues:6Issues:86

deepkit-ml

The collaborative real-time open-source machine learning devtool and training suite: Experiment execution, tracking, and debugging. With server and project management tools.

Language:TypeScriptLicense:MITStargazers:365Issues:19Issues:23

MultiModal-DeepFake

[TPAMI 2024 & CVPR 2023] PyTorch code for DGM4: Detecting and Grounding Multi-Modal Media Manipulation and beyond

Language:PythonLicense:NOASSERTIONStargazers:262Issues:3Issues:26

diffmimic

[ICLR 2023] DiffMimic: Efficient Motion Mimicking with Differentiable Physics https://arxiv.org/abs/2304.03274

Language:PythonLicense:NOASSERTIONStargazers:257Issues:12Issues:3

ContextDET

Contextual Object Detection with Multimodal Large Language Models

visual_prompt_retrieval

[NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"

Language:PythonLicense:CC0-1.0Stargazers:155Issues:4Issues:10
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:146Issues:10Issues:19

MaskPoint

[ECCV 2022] Masked Discrimination for Self-Supervised Learning on Point Clouds

Language:PythonLicense:BSD-3-ClauseStargazers:85Issues:8Issues:8
Language:PythonLicense:MITStargazers:38Issues:1Issues:6

on-device-dg

On-Device Domain Generalization

Language:PythonLicense:NOASSERTIONStargazers:37Issues:2Issues:2

GVRT

[ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization

pi-Tuning

Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.

Language:PythonLicense:NOASSERTIONStargazers:31Issues:5Issues:1

Low-Shot-Robustness

Code for the ICCV 2023 paper "Benchmarking Low-Shot Robustness to Natural Distribution Shifts"

Language:PythonLicense:NOASSERTIONStargazers:12Issues:4Issues:1