Yang Yang's starred repositories
InternGPT
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
FaceForensics
Github of the FaceForensics dataset
LLM-eval-survey
The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
TimeSformer-pytorch
Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
ama_prompting
Ask Me Anything language model prompting
Multi-Modality-Arena
Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!
self-correction-llm-papers
This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.
SelfBlendedImages
[CVPR 2022 Oral] Detecting Deepfakes with Self-Blended Images https://arxiv.org/abs/2204.08376
Awesome-Deepfake-Generation-and-Detection
A Survey on Deepfake Generation and Detection
caltech-pedestrian-dataset-to-yolo-format-converter
converts the format of the caltech pedestrian dataset to the format that yolo uses
DDM-Public
code for paper: Decoupled diffusion models: image to zero and zero to noise
Divide-Evaluate-and-Refine
Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback
Lip-Extract
This is Unofficial Repo. Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery Detection (CVPR 2021)