Yuechen (JulianJuaner)

JulianJuaner

Geek Repo

Company:CUHK, SmartMore

Location:Hong Kong SAR

Home Page:julianjuaner.github.io

Github PK Tool:Github PK Tool

Yuechen's starred repositories

multidiffusion-upscaler-for-automatic1111

Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0

Language:PythonLicense:NOASSERTIONStargazers:4651Issues:0Issues:0

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5420Issues:0Issues:0

LLMGA

This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024

Language:PythonLicense:Apache-2.0Stargazers:428Issues:0Issues:0

LLaMA-VID

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

Language:PythonLicense:Apache-2.0Stargazers:661Issues:0Issues:0

StableSR

[IJCV2024] Exploiting Diffusion Prior for Real-World Image Super-Resolution

Language:PythonLicense:NOASSERTIONStargazers:2032Issues:0Issues:0

cross-image-attention

Officail Implementation for "Cross-Image Attention for Zero-Shot Appearance Transfer"

Language:PythonLicense:MITStargazers:293Issues:0Issues:0

ScaleCrafter

[ICLR 2024 Spotlight] Official implementation of ScaleCrafter for higher-resolution visual generation at inference time.

Language:PythonStargazers:472Issues:0Issues:0

Uni-ControlNet

[NeurIPS 2023] Uni-ControlNet: All-in-One Control to Text-to-Image Diffusion Models

Language:PythonLicense:MITStargazers:563Issues:0Issues:0

MVDream

Multi-view Diffusion for 3D Generation

Language:PythonLicense:MITStargazers:743Issues:0Issues:0

neuraltalk2

Efficient Image Captioning code in Torch, runs on GPU

Language:Jupyter NotebookStargazers:5491Issues:0Issues:0

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonLicense:NOASSERTIONStargazers:2653Issues:0Issues:0

zero123plus

Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.

Language:PythonLicense:Apache-2.0Stargazers:1624Issues:0Issues:0

awesome-3D-gaussian-splatting

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

License:MITStargazers:5308Issues:0Issues:0

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6405Issues:0Issues:0

SEED-Bench

(CVPR2024)A benchmark for evaluating Multimodal LLMs using multiple-choice questions.

Language:PythonLicense:NOASSERTIONStargazers:286Issues:0Issues:0

InternLM-XComposer

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Language:PythonLicense:Apache-2.0Stargazers:2349Issues:0Issues:0

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

Language:PythonLicense:BSD-3-ClauseStargazers:3129Issues:0Issues:0

dreamgaussian

[ICLR 2024 Oral] Generative Gaussian Splatting for Efficient 3D Content Creation

Language:PythonLicense:MITStargazers:3819Issues:0Issues:0

DreamLLM

[ICLR 2024 Spotlight] DreamLLM: Synergistic Multimodal Comprehension and Creation

Language:PythonLicense:Apache-2.0Stargazers:361Issues:0Issues:0

rich-text-to-image

Rich-Text-to-Image Generation

Language:PythonLicense:MITStargazers:750Issues:0Issues:0

SA-1B-Downloader

Simple script to parallelize download and extract files for SA-1B Dataset.

Language:PythonLicense:MITStargazers:24Issues:0Issues:0

OpenPSG

Benchmarking Panoptic Scene Graph Generation (PSG), ECCV'22

Language:PythonLicense:MITStargazers:404Issues:0Issues:0

DQTrack

Official PyTorch implementation of End-to-end 3D Tracking with Decoupled Queries [ICCV 2023]

Language:PythonLicense:NOASSERTIONStargazers:52Issues:0Issues:0

LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Language:PythonLicense:Apache-2.0Stargazers:2568Issues:0Issues:0

IETrans-SGG.pytorch

This is the code of ECCV 2022 (Oral) paper "Fine-Grained Scene Graph Generation with Data Transfer".

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:88Issues:0Issues:0

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonLicense:MITStargazers:6068Issues:0Issues:0

ARC-AGI

The Abstraction and Reasoning Corpus

Language:JavaScriptLicense:Apache-2.0Stargazers:3164Issues:0Issues:0

ComfyUI_examples

Examples of ComfyUI workflows

Language:HTMLLicense:NOASSERTIONStargazers:1439Issues:0Issues:0

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:44949Issues:0Issues:0

mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:3325Issues:0Issues:0