YangYangGirl

Yang Yang's starred repositories

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.035190 345 1698

DragGAN

Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" （DragGAN 全功能实现，在线Demo，本地部署试用，代码、模型已全部开源，支持Windows, macOS, Linux）

Language:Python4993 66 112

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookBSD-3-Clause4383 34 188

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonNOASSERTION4072 46 380

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Language:PythonApache-2.03154 43 49

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonMIT2784 37 176

FaceForensics

Github of the FaceForensics dataset

Language:PythonNOASSERTION2282 73 81

LLM-eval-survey

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".

1287 10 17

LLaVA-Med

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Language:PythonNOASSERTION1226 25 69

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookApache-2.01218 9 116

TimeSformer-pytorch

Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification

Language:PythonMIT673 17 18

ama_prompting

Ask Me Anything language model prompting

Language:PythonApache-2.0531 24 5

Multi-Modality-Arena

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!

Language:Python394 6 18

self-correction-llm-papers

This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.

Apache-2.0332 11 1

SelfBlendedImages

[CVPR 2022 Oral] Detecting Deepfakes with Self-Blended Images https://arxiv.org/abs/2204.08376

Language:PythonNOASSERTION182 7 44

AesBench

An expert benchmark aiming to comprehensively evaluate the aesthetic perception capacities of MLLMs.

Language:PythonApache-2.0174 4 4

Awesome-Deepfake-Generation-and-Detection

A Survey on Deepfake Generation and Detection

128 70

tifa

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

Language:PythonApache-2.0115 3 5

FACTOR

Detecting Deepfakes Without Seeing Any

Language:PythonNOASSERTION101 2 7

FTCN

[Official] Exploring Temporal Coherence for More General Video Face Forgery Detection(ICCV 2021)

Language:Python91012

DSG

Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)

Language:Jupyter Notebook59 3 3

antispoofing

Language:PythonMIT56 2 6

TALL4Deepfake

Language:PythonMIT53 4 12

pacscore

Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation. CVPR 2023

Language:Python49 6 6

caltech-pedestrian-dataset-to-yolo-format-converter

converts the format of the caltech pedestrian dataset to the format that yolo uses

Language:Python38 2 2

DDM-Public

code for paper: Decoupled diffusion models: image to zero and zero to noise

Language:Python34 1 3

Divide-Evaluate-and-Refine

Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback

Language:Jupyter NotebookMIT23 1 4

LPCV_2023_solution

Language:Python18 2 1

BoS

[ICLR 2024 Spotlight] Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments

Language:Python1600

Lip-Extract

This is Unofficial Repo. Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery Detection (CVPR 2021)

Language:Python2 1 1