dawnyc

followers

following

stars

Nanjing University

Nanjing, China

https://dawnyc.github.io/homepage/

Yidong Cai's starred repositories

SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

Language:PythonApache-2.096000

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

CC0-1.01704100

MAE-Lite

Official implement for ICML2023 paper: "A Closer Look at Self-Supervised Lightweight Vision Transformers"

Language:PythonApache-2.011000

VastTrack

VastTrack: Vast Category Visual Object Tracking

Language:Python4100

LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

Apache-2.0196300

sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Language:PythonApache-2.0376400

wiou

Wise-IoU: Bounding Box Regression Loss with Dynamic Focusing Mechanism

Language:Python5900

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonApache-2.01851700

Visual-Tracking-Development

Visual Object Tracking

Language:Python42900

Fooocus

Focus on prompting and generating

Language:PythonGPL-3.03966300

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonApache-2.0803500

Transformer_Tracking

This repository is a paper digest of Transformer-related approaches in visual tracking tasks.

oft

Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".

Language:PythonMIT27500

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Language:PythonNOASSERTION3564500

UserControllableLT

PyTorch implementation of ``User-Controllable Latent Transformer for StyleGAN Image Layout Editing'' [Computer Graphics Forum (Proc. of Pacific Graphics 2022)]

Language:PythonMIT27600

Tune-A-Video

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Language:PythonApache-2.0418600