Yang Yang (YangYangGirl)

YangYangGirl

Geek Repo

Company:Australian National University | Shanhai AI Lab | SenseTime | SmartMore | HUST

Location:Canberra, Australia

Home Page:https://yangyanggirl.github.io/

Twitter:@YangYangSuper

Github PK Tool:Github PK Tool

Yang Yang's starred repositories

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:35190Issues:345Issues:1698

DragGAN

Unofficial Implementation of DragGAN - "Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold" (DragGAN 全功能实现,在线Demo,本地部署试用,代码、模型已全部开源,支持Windows, macOS, Linux)

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:4383Issues:34Issues:188

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:4072Issues:46Issues:380

InternGPT

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Language:PythonLicense:Apache-2.0Stargazers:3154Issues:43Issues:49

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonLicense:MITStargazers:2784Issues:37Issues:176

FaceForensics

Github of the FaceForensics dataset

Language:PythonLicense:NOASSERTIONStargazers:2282Issues:73Issues:81

LLM-eval-survey

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".

LLaVA-Med

Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.

Language:PythonLicense:NOASSERTIONStargazers:1226Issues:25Issues:69

alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1218Issues:9Issues:116

TimeSformer-pytorch

Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification

Language:PythonLicense:MITStargazers:673Issues:17Issues:18

ama_prompting

Ask Me Anything language model prompting

Language:PythonLicense:Apache-2.0Stargazers:531Issues:24Issues:5

Multi-Modality-Arena

Chatbot Arena meets multi-modality! Multi-Modality Arena allows you to benchmark vision-language models side-by-side while providing images as inputs. Supports MiniGPT-4, LLaMA-Adapter V2, LLaVA, BLIP-2, and many more!

self-correction-llm-papers

This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.

SelfBlendedImages

[CVPR 2022 Oral] Detecting Deepfakes with Self-Blended Images https://arxiv.org/abs/2204.08376

Language:PythonLicense:NOASSERTIONStargazers:182Issues:7Issues:44

AesBench

An expert benchmark aiming to comprehensively evaluate the aesthetic perception capacities of MLLMs.

Language:PythonLicense:Apache-2.0Stargazers:174Issues:4Issues:4

Awesome-Deepfake-Generation-and-Detection

A Survey on Deepfake Generation and Detection

tifa

TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering

Language:PythonLicense:Apache-2.0Stargazers:115Issues:3Issues:5

FACTOR

Detecting Deepfakes Without Seeing Any

Language:PythonLicense:NOASSERTIONStargazers:101Issues:2Issues:7

FTCN

[Official] Exploring Temporal Coherence for More General Video Face Forgery Detection(ICCV 2021)

Language:PythonStargazers:91Issues:0Issues:12

DSG

Davidsonian Scene Graph (DSG) for Text-to-Image Evaluation (ICLR 2024)

Language:Jupyter NotebookStargazers:59Issues:3Issues:3

pacscore

Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation. CVPR 2023

caltech-pedestrian-dataset-to-yolo-format-converter

converts the format of the caltech pedestrian dataset to the format that yolo uses

DDM-Public

code for paper: Decoupled diffusion models: image to zero and zero to noise

Divide-Evaluate-and-Refine

Repo for our NeurIPS 2023 paper on: Divide, Evaluate, and Refine: Evaluating and Improving Text-to-Image Alignment with Iterative VQA Feedback

Language:Jupyter NotebookLicense:MITStargazers:23Issues:1Issues:4

BoS

[ICLR 2024 Spotlight] Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments

Language:PythonStargazers:16Issues:0Issues:0

Lip-Extract

This is Unofficial Repo. Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery Detection (CVPR 2021)