Yuechen (JulianJuaner)

JulianJuaner

Geek Repo

Company:CUHK, SmartMore

Location:Hong Kong SAR

Home Page:julianjuaner.github.io

Github PK Tool:Github PK Tool

Yuechen's starred repositories

U-ViT

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

Language:Jupyter NotebookLicense:MITStargazers:863Issues:0Issues:0

LLM-Shearing

[ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning

Language:PythonLicense:MITStargazers:509Issues:0Issues:0

PixArt-sigma

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Language:PythonLicense:AGPL-3.0Stargazers:1533Issues:0Issues:0

SEED-X

Multimodal Models in Real World

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:348Issues:0Issues:0
Language:PythonStargazers:323Issues:0Issues:0

evolutionary-model-merge

Official repository of Evolutionary Optimization of Model Merging Recipes

Language:PythonLicense:Apache-2.0Stargazers:1126Issues:0Issues:0

MiraData

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"

Language:PythonLicense:GPL-3.0Stargazers:311Issues:0Issues:0

FRESCO

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:694Issues:0Issues:0

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3908Issues:0Issues:0
Language:PythonLicense:GPL-3.0Stargazers:322Issues:0Issues:0
Language:PythonStargazers:18Issues:0Issues:0

ComfyUI-Tripo

Custom nodes for using Tripo in ComfyUI.

Language:PythonLicense:MITStargazers:82Issues:0Issues:0
Language:PythonLicense:MITStargazers:4138Issues:0Issues:0

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Language:Jupyter NotebookStargazers:1520Issues:0Issues:0

SEINE

[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Language:PythonLicense:Apache-2.0Stargazers:871Issues:0Issues:0

ALLaVA

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Language:PythonLicense:Apache-2.0Stargazers:229Issues:0Issues:0

AnimateDiff

AnimationDiff with train

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:111Issues:0Issues:0

LivePhoto

Official implementations for paper: LivePhoto: Real Image Animation with Text-guided Motion Control

License:MITStargazers:171Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11086Issues:0Issues:0

Visual-CoT

Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Language:PythonLicense:Apache-2.0Stargazers:81Issues:0Issues:0

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3113Issues:0Issues:0

GeoWizard

[ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

Language:PythonStargazers:664Issues:0Issues:0

StreamingT2V

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Language:PythonStargazers:1141Issues:0Issues:0

clarity-upscaler

Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative

Language:PythonLicense:AGPL-3.0Stargazers:3364Issues:0Issues:0
Language:PythonStargazers:29Issues:0Issues:0

pexels-crawler

The web crawler for pexels

Language:PythonLicense:MITStargazers:6Issues:0Issues:0

HD-VG-130M

The HD-VG-130M Dataset

Stargazers:101Issues:0Issues:0

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49236Issues:0Issues:0

GroupContrast

[CVPR 2024] GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D Understanding

License:MITStargazers:42Issues:0Issues:0

stable-diffusion-webui-wd14-tagger

Labeling extension for Automatic1111's Web UI

Language:PythonStargazers:558Issues:0Issues:0