rain305f

followers

following

stars

Peking University

shenzhen

https://rain305f.github.io/

rain's starred repositories

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonApache-2.024770 193 3942

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonApache-2.04440 62 177

pytorch-fid

Compute FID scores with PyTorch.

Language:PythonApache-2.03295 15 85

reflexion

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Language:PythonMIT2237 31 33

Awesome-Federated-Machine-Learning

Everything about federated learning, including research papers, books, codes, tutorials, videos and beyond

MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Language:PythonApache-2.01255 20 28

conditional-flow-matching

TorchCFM: a Conditional Flow Matching library

Language:PythonMIT990 13 43

FaceFormer

[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers

Language:PythonMIT777 15 101

RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Language:Python767 11 22

Anti-DreamBooth

Anti-DreamBooth: Protecting users from personalized text-to-image synthesis (ICCV 2023)

Language:PythonAGPL-3.0200 10 21

common_metrics_on_video_quality

You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.

Language:Python183 1 15

MMA-Diffusion

[CVPR2024] MMA-Diffusion: MultiModal Attack on Diffusion Models

Language:PythonNOASSERTION123 5 11

text2image_safety

Language:PythonMIT117 2 6

Awesome-MLLM-Safety

Accepted by IJCAI-24 Survey Track

Language:PythonMIT98 70

Open-Sora-Dataset

Language:Python91 8 6

Duolando

Code for ICLR 2024 paper "Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment"

Language:Python87 6 2

Uniaa

Unified Multi-modal IAA Baseline and Benchmark

curiosity_redteam

Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizXgXU)

Language:Jupyter NotebookMIT53 5 5

FineR

[ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models

Language:PythonApache-2.03200

Awesome-T2I-safety-Papers

List of T2I safety papers, updated daily, welcome to discuss using Discussions

MIT2600

AdaShield

[ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting."

Language:Python26 1 2

Universal-Prompt-Injection

The official implementation of our pre-print paper "Automatic and Universal Prompt Injection Attacks against Large Language Models".

Language:Python26 2 2

JailBreakV_28K

JailBreakV-28K: A comprehensive benchmark designed to evaluate the transferability of LLM jailbreak attacks to MLLMs, and further assess the robustness and safety of MLLMs against a variety of jailbreak attacks.

Language:Python22 1 3

unsafe-diffusion

Language:Python21 2 1

ActiveGCD

Code for our CVPR 2024 paper "Active Generalized Category Discovery"

Language:PythonMIT20 30

RIATIG

Language:PythonMIT20 1 3

TextGrad

Language:Python18 1 1

Multimodal-Roadmap-for-freshman

本项目用于Multimodal领域新手的学习路线，包括该领域的经典论文，项目及课程。旨在希望学习者在一定的时间内达到对这个领域有较为深刻的认知，能够自己进行的独立研究。

FedoSSL

code for paper "Towards Unbiased Training in Federated Open-world Semi-supervised Learning"

Language:Python1000

Divide-and-Conquer-Attack

Divide-and-Conquer Attack: Harnessing the Power of LLM to Bypass the Censorship of Text-to-Image Generation Mode

Language:Python800