KaiyangZhou

followers

following

stars

https://kaiyangzhou.github.io

Organizations

The-AI-Talks

Kaiyang's starred repositories

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookNOASSERTION65269 552 689

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.033682 338 1627

stanford_alpaca

Code and documentation to train Stanford's Alpaca models, and generate the data.

Language:PythonApache-2.028743 336 266

latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:Jupyter NotebookMIT10521 95 321

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookBSD-3-Clause8690 93 599

metaseq

Repo for external large-scale work

Language:PythonMIT6382 109 292

dm_control

Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Language:PythonApache-2.03539 127 390

Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Language:PythonMIT3441 100 158

best_AI_papers_2022

A curated list of the latest breakthroughs in AI (in 2022) by release date with a clear video explanation, link to a more in-depth article, and code.

WeightWatcher

The WeightWatcher tool for predicting the accuracy of Deep Neural Networks

Language:PythonApache-2.01391 32 231

CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Language:Jupyter NotebookNOASSERTION675 14 20

omnivore

Omnivore: A Single Model for Many Visual Modalities

Language:PythonNOASSERTION544 19 31

OpenPSG

Benchmarking Panoptic Scene Graph Generation (PSG), ECCV'22

Language:PythonMIT384 6 86

deepkit-ml

The collaborative real-time open-source machine learning devtool and training suite: Experiment execution, tracking, and debugging. With server and project management tools.

Language:TypeScriptMIT365 19 23

MultiModal-DeepFake

[TPAMI 2024 & CVPR 2023] PyTorch code for DGM4: Detecting and Grounding Multi-Modal Media Manipulation and beyond

Language:PythonNOASSERTION262 3 26

diffmimic

[ICLR 2023] DiffMimic: Efficient Motion Mimicking with Differentiable Physics https://arxiv.org/abs/2304.03274

Language:PythonNOASSERTION257 12 3

ContextDET

Contextual Object Detection with Multimodal Large Language Models

NOASSERTION155 13 4

visual_prompt_retrieval

[NeurIPS2023] Official implementation and model release of the paper "What Makes Good Examples for Visual In-Context Learning?"

Language:PythonCC0-1.0155 4 10

perception_test

Language:Jupyter NotebookApache-2.0146 10 19

CuPL

Language:Python143 2 7

MaskPoint

[ECCV 2022] Masked Discrimination for Self-Supervised Learning on Point Clouds

Language:PythonBSD-3-Clause85 8 8

UPT

ood_bench

Language:Python45 3 5

OoD-Bench

Language:PythonMIT38 1 6

on-device-dg

On-Device Domain Generalization

Language:PythonNOASSERTION37 2 2

GVRT

[ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization

Language:Python32 1 7

pi-Tuning

Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.

Language:PythonNOASSERTION31 5 1

SEVERE-BENCHMARK

Language:Python22 2 1

Low-Shot-Robustness

Code for the ICCV 2023 paper "Benchmarking Low-Shot Robustness to Natural Distribution Shifts"

Language:PythonNOASSERTION12 4 1