Wang Yabin's starred repositories

tgif-dataset

Text-Guided Inpainting Forgery Dataset

License:CC-BY-SA-4.0Stargazers:1Issues:0Issues:0

ControlNetPlus

ControlNet++: All-in-one ControlNet for image generations and editing!

Language:PythonLicense:Apache-2.0Stargazers:1271Issues:0Issues:0

lobe-chat

🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT chat application.

Language:TypeScriptLicense:NOASSERTIONStargazers:35491Issues:0Issues:0

deepfloyd_if_lab

A notebook-based web UI for DeepFloyd IF

Language:PythonLicense:BSD-3-ClauseStargazers:24Issues:0Issues:0

sd-webui-reactor

Fast and Simple Face Swap Extension for StableDiffusion WebUI (A1111 SD WebUI, SD WebUI Forge, SD.Next, Cagliostro)

Language:PythonLicense:AGPL-3.0Stargazers:2329Issues:0Issues:0

DRCT

The official code of "DRCT: Diffusion Reconstruction Contrastive Training towards Universe Detection of Diffusion Generated Images"

Stargazers:12Issues:0Issues:0

SyntheticImagesAnalysis

Synthetic Images Analysis

Language:PythonLicense:NOASSERTIONStargazers:27Issues:0Issues:0

cog-leditsplusplus

Cog wrapper to serve LEdits++ as an API

Language:PythonLicense:NOASSERTIONStargazers:2Issues:0Issues:0

UnbiasedGenImage

Corresponding Code to the Paper "Fake or JPEG? Revealing Common Biases in Generated Image Detection Datasets"

Language:PythonStargazers:8Issues:0Issues:0

LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Language:PythonStargazers:753Issues:0Issues:0

scaling_on_scales

When do we not need larger vision models?

Language:PythonLicense:MITStargazers:271Issues:0Issues:0

LASTED

Synthetic Image Detection

Language:PythonLicense:MITStargazers:50Issues:0Issues:0
Language:PythonLicense:MITStargazers:373Issues:0Issues:0

euge-trainer

A SDXL trainer modified from kohya trainer.

Language:PythonStargazers:7Issues:0Issues:0
Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2Issues:0Issues:0

AutoSplice_Dataset

AutoSplice: A Text-prompt Manipulated Image Dataset for Media Forensics, WMF@CVPR2023

Stargazers:35Issues:0Issues:0

server-bot-quick-start

Server bots for Poe

Language:PythonStargazers:15Issues:0Issues:0

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3862Issues:0Issues:0

ect

Consistency Models Made Easy

Language:PythonStargazers:172Issues:0Issues:0

fastapi_poe

A helper library for writing Poe API bots using FastAPI

Language:PythonLicense:Apache-2.0Stargazers:116Issues:0Issues:0

multimodal-prompt-learning

[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".

Language:PythonLicense:MITStargazers:583Issues:0Issues:0

sdeval

Evaluation for stable diffusion model training

Language:PythonLicense:Apache-2.0Stargazers:13Issues:0Issues:0

MaskDiT

Code for Fast Training of Diffusion Models with Masked Transformers

Language:PythonLicense:MITStargazers:325Issues:0Issues:0

Poe-Telegram-Chatbot

调用Poe官方API实现Telegram对话机器人,主要调用GPT-4和Claude-3-Opus模型。

Language:PythonStargazers:164Issues:0Issues:0

Full-Segment-Anything

This is Pytorch Implementation Code for adding new features in code of Segment-Anything. Here, the features support batch-input on the full-grid prompt (automatic mask generation) with post-processing: removing duplicated or small regions and holes, under flexible input image size

Language:PythonLicense:MITStargazers:121Issues:0Issues:0

clip-diy

Official implementation of the WACV 2024 paper CLIP-DIY

Language:Jupyter NotebookStargazers:22Issues:0Issues:0

midjourney-api

midjourney in discord api.

Language:PythonStargazers:775Issues:0Issues:0

Image-Aesthetics-and-Quality-Assessment

[ACMMM 2023, Official Code] for paper "EAT: An Enhancer for Aesthetics-Oriented Transformers". Official Weights and Demos provided. 目前是地表最强开源美学评估模型之一.

Language:PythonStargazers:82Issues:0Issues:0

Image-Color-Aesthetics-and-Quality-Assessment

[ICCV 2023, Official Code] for paper "Thinking Image Color Aesthetics Assessment: Models, Datasets and Benchmarks". Official Weights and Demos provided. 首个面向图像色彩主观美学评估的数据集、算法和benchmark.

Language:PythonStargazers:119Issues:0Issues:0