rain (rain305f)

rain305f

Geek Repo

Company:Peking University

Location:shenzhen

Home Page:https://rain305f.github.io/

Github PK Tool:Github PK Tool

rain's starred repositories

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24770Issues:193Issues:3942

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonLicense:Apache-2.0Stargazers:4440Issues:62Issues:177

pytorch-fid

Compute FID scores with PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:3295Issues:15Issues:85

reflexion

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Language:PythonLicense:MITStargazers:2237Issues:31Issues:33

Awesome-Federated-Machine-Learning

Everything about federated learning, including research papers, books, codes, tutorials, videos and beyond

MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Language:PythonLicense:Apache-2.0Stargazers:1255Issues:20Issues:28

conditional-flow-matching

TorchCFM: a Conditional Flow Matching library

Language:PythonLicense:MITStargazers:990Issues:13Issues:43

FaceFormer

[CVPR 2022] FaceFormer: Speech-Driven 3D Facial Animation with Transformers

Language:PythonLicense:MITStargazers:777Issues:15Issues:101

RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Anti-DreamBooth

Anti-DreamBooth: Protecting users from personalized text-to-image synthesis (ICCV 2023)

Language:PythonLicense:AGPL-3.0Stargazers:200Issues:10Issues:21

common_metrics_on_video_quality

You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.

MMA-Diffusion

[CVPR2024] MMA-Diffusion: MultiModal Attack on Diffusion Models

Language:PythonLicense:NOASSERTIONStargazers:123Issues:5Issues:11

Awesome-MLLM-Safety

Accepted by IJCAI-24 Survey Track

Language:PythonLicense:MITStargazers:98Issues:7Issues:0

Duolando

Code for ICLR 2024 paper "Duolando: Follower GPT with Off-Policy Reinforcement Learning for Dance Accompaniment"

Uniaa

Unified Multi-modal IAA Baseline and Benchmark

curiosity_redteam

Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizXgXU)

Language:Jupyter NotebookLicense:MITStargazers:53Issues:5Issues:5

FineR

[ICLR'24] Democratizing Fine-grained Visual Recognition with Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:32Issues:0Issues:0

Awesome-T2I-safety-Papers

List of T2I safety papers, updated daily, welcome to discuss using Discussions

License:MITStargazers:26Issues:0Issues:0

AdaShield

[ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shield Prompting."

Universal-Prompt-Injection

The official implementation of our pre-print paper "Automatic and Universal Prompt Injection Attacks against Large Language Models".

JailBreakV_28K

JailBreakV-28K: A comprehensive benchmark designed to evaluate the transferability of LLM jailbreak attacks to MLLMs, and further assess the robustness and safety of MLLMs against a variety of jailbreak attacks.

ActiveGCD

Code for our CVPR 2024 paper "Active Generalized Category Discovery"

Language:PythonLicense:MITStargazers:20Issues:3Issues:0
Language:PythonLicense:MITStargazers:20Issues:1Issues:3

Multimodal-Roadmap-for-freshman

本项目用于Multimodal领域新手的学习路线,包括该领域的经典论文,项目及课程。旨在希望学习者在一定的时间内达到对这个领域有较为深刻的认知,能够自己进行的独立研究。

FedoSSL

code for paper "Towards Unbiased Training in Federated Open-world Semi-supervised Learning"

Language:PythonStargazers:10Issues:0Issues:0

Divide-and-Conquer-Attack

Divide-and-Conquer Attack: Harnessing the Power of LLM to Bypass the Censorship of Text-to-Image Generation Mode

Language:PythonStargazers:8Issues:0Issues:0