iamwangyabin

Wang Yabin's starred repositories

tgif-dataset

Text-Guided Inpainting Forgery Dataset

CC-BY-SA-4.0100

ControlNetPlus

ControlNet++: All-in-one ControlNet for image generations and editing!

Language:PythonApache-2.0127100

🤯 Lobe Chat - an open-source, modern-design LLMs/AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Bedrock / Azure / Mistral / Perplexity ), Multi-Modals (Vision/TTS) and plugin system. One-click FREE deployment of your private ChatGPT chat application.

Language:TypeScriptNOASSERTION3549100

deepfloyd_if_lab

A notebook-based web UI for DeepFloyd IF

Language:PythonBSD-3-Clause2400

sd-webui-reactor

Fast and Simple Face Swap Extension for StableDiffusion WebUI (A1111 SD WebUI, SD WebUI Forge, SD.Next, Cagliostro)

Language:PythonAGPL-3.0232900

DRCT

The official code of "DRCT: Diffusion Reconstruction Contrastive Training towards Universe Detection of Diffusion Generated Images"

1200

Deepfake-Detection-without-Deepfakes-Generalization-via-Synthetic-Frequency-Patterns-Injection

Language:Python300

SyntheticImagesAnalysis

Synthetic Images Analysis

Language:PythonNOASSERTION2700

cog-leditsplusplus

Cog wrapper to serve LEdits++ as an API

Language:PythonNOASSERTION200

UnbiasedGenImage

Corresponding Code to the Paper "Fake or JPEG? Revealing Common Biases in Generated Image Detection Datasets"

Language:Python800

LLaVA-pp

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Language:Python75300

scaling_on_scales

When do we not need larger vision models?

Language:PythonMIT27100

LASTED

Synthetic Image Detection

Language:PythonMIT5000

ComfyUI_essentials

Language:PythonMIT37300

euge-trainer

A SDXL trainer modified from kohya trainer.

Language:Python700

M3Dsynth

Language:Jupyter NotebookNOASSERTION200

AutoSplice_Dataset

AutoSplice: A Text-prompt Manipulated Image Dataset for Media Forensics, WMF@CVPR2023

3500

server-bot-quick-start

Server bots for Poe

Language:Python1500

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonMIT386200

ect

Consistency Models Made Easy

Language:Python17200

fastapi_poe

A helper library for writing Poe API bots using FastAPI

Language:PythonApache-2.011600

multimodal-prompt-learning

[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".

Language:PythonMIT58300

sdeval

Evaluation for stable diffusion model training

Language:PythonApache-2.01300

MaskDiT

Code for Fast Training of Diffusion Models with Masked Transformers

Language:PythonMIT32500

Poe-Telegram-Chatbot

调用Poe官方API实现Telegram对话机器人，主要调用GPT-4和Claude-3-Opus模型。

Language:Python16400

Full-Segment-Anything

This is Pytorch Implementation Code for adding new features in code of Segment-Anything. Here, the features support batch-input on the full-grid prompt (automatic mask generation) with post-processing: removing duplicated or small regions and holes, under flexible input image size

Language:PythonMIT12100