Ma Jiajian (497662892)

497662892

Geek Repo

Location:Shenzhen, Guangdong Province

Github PK Tool:Github PK Tool

Ma Jiajian's starred repositories

awesome-ai-residency

List of AI Residency Programs

Stargazers:3039Issues:0Issues:0

motion-latent-diffusion

[CVPR 2023] Executing your Commands via Motion Diffusion in Latent Space, a fast and high-quality motion diffusion model

Language:PythonLicense:MITStargazers:555Issues:0Issues:0

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonLicense:Apache-2.0Stargazers:1558Issues:0Issues:0

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonLicense:Apache-2.0Stargazers:1864Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5809Issues:0Issues:0

Magic-Me

Codes for ID-Specific Video Customized Diffusion

Language:PythonLicense:Apache-2.0Stargazers:444Issues:0Issues:0

SEINE

[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Language:PythonLicense:Apache-2.0Stargazers:871Issues:0Issues:0

Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:6573Issues:0Issues:0

edm

Elucidating the Design Space of Diffusion-Based Generative Models (EDM)

Language:PythonLicense:NOASSERTIONStargazers:1232Issues:0Issues:0

FreeInit

[ECCV 2024] FreeInit: Bridging Initialization Gap in Video Diffusion Models

Language:PythonLicense:MITStargazers:462Issues:0Issues:0

text2cinemagraph

Text2Cinemagraph: Text-Guided Synthesis of Eulerian Cinemagraphs [SIGGRAPH ASIA 2023]

Language:PythonLicense:MITStargazers:359Issues:0Issues:0

Video-Motion-Customization

VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models (CVPR 2024)

Language:PythonLicense:Apache-2.0Stargazers:154Issues:0Issues:0

VGen

Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models

Language:PythonStargazers:2818Issues:0Issues:0

PIA

[CVPR 2024] PIA, your Personalized Image Animator. Animate your images by text prompt, combing with Dreambooth, achieving stunning videos. PIA,你的个性化图像动画生成器,利用文本提示将图像变为奇妙的动画

Language:PythonLicense:Apache-2.0Stargazers:842Issues:0Issues:0

LivePhoto

Official implementations for paper: LivePhoto: Real Image Animation with Text-guided Motion Control

License:MITStargazers:170Issues:0Issues:0

PhotoMaker

PhotoMaker [CVPR 2024]

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:9058Issues:0Issues:0

AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Language:PythonLicense:MITStargazers:3863Issues:0Issues:0

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:PythonStargazers:22239Issues:0Issues:0

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10650Issues:0Issues:0

facechain

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8791Issues:0Issues:0

I2V-Adapter-repo

I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models

Stargazers:183Issues:0Issues:0

LAMP

Official implement code of LAMP: Learn a Motion Pattern by Few-Shot Tuning a Text-to-Image Diffusion Model (Few-shot-based text-to-video diffusion)

Language:PythonLicense:NOASSERTIONStargazers:245Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:222Issues:0Issues:0

MedSAM

Segment Anything in Medical Images

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2529Issues:0Issues:0

WSI-HGNN

[CVPR'23] Histopathology Whole Slide Image Analysis with Heterogeneous Graph Representation Learning

Language:PythonStargazers:63Issues:0Issues:0

EndoGS

EndoGS: Deformable Endoscopic Tissues Reconstruction with Gaussian Splatting

Language:PythonStargazers:94Issues:0Issues:0

SAM-Med3D

SAM-Med3D: An Efficient General-purpose Promptable Segmentation Model for 3D Volumetric Medical Image

Language:PythonLicense:Apache-2.0Stargazers:433Issues:0Issues:0

dsmil-wsi

DSMIL: Dual-stream multiple instance learning networks for tumor detection in Whole Slide Image

Language:PythonLicense:MITStargazers:340Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2998Issues:0Issues:0

XMem

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Language:PythonLicense:MITStargazers:1684Issues:0Issues:0