Yidong Cai (dawnyc)

dawnyc

Geek Repo

Company:Nanjing University

Location:Nanjing, China

Home Page:https://dawnyc.github.io/homepage/

Github PK Tool:Github PK Tool

Yidong Cai's starred repositories

SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

Language:PythonLicense:Apache-2.0Stargazers:960Issues:0Issues:0

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

License:CC0-1.0Stargazers:17041Issues:0Issues:0

MAE-Lite

Official implement for ICML2023 paper: "A Closer Look at Self-Supervised Lightweight Vision Transformers"

Language:PythonLicense:Apache-2.0Stargazers:110Issues:0Issues:0

VastTrack

VastTrack: Vast Category Visual Object Tracking

Language:PythonStargazers:41Issues:0Issues:0

LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

License:Apache-2.0Stargazers:1963Issues:0Issues:0

sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Language:PythonLicense:Apache-2.0Stargazers:3764Issues:0Issues:0

wiou

Wise-IoU: Bounding Box Regression Loss with Dynamic Focusing Mechanism

Language:PythonStargazers:59Issues:0Issues:0

mlc-llm

Universal LLM Deployment Engine with ML Compilation

Language:PythonLicense:Apache-2.0Stargazers:18517Issues:0Issues:0

Visual-Tracking-Development

Visual Object Tracking

Language:PythonStargazers:429Issues:0Issues:0

Fooocus

Focus on prompting and generating

Language:PythonLicense:GPL-3.0Stargazers:39663Issues:0Issues:0

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonLicense:Apache-2.0Stargazers:8035Issues:0Issues:0

Transformer_Tracking

This repository is a paper digest of Transformer-related approaches in visual tracking tasks.

Stargazers:262Issues:0Issues:0

oft

Official implementation of "Controlling Text-to-Image Diffusion by Orthogonal Finetuning".

Language:PythonLicense:MITStargazers:275Issues:0Issues:0

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Language:PythonLicense:NOASSERTIONStargazers:35645Issues:0Issues:0

UserControllableLT

PyTorch implementation of ``User-Controllable Latent Transformer for StyleGAN Image Layout Editing'' [Computer Graphics Forum (Proc. of Pacific Graphics 2022)]

Language:PythonLicense:MITStargazers:276Issues:0Issues:0

Tune-A-Video

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Language:PythonLicense:Apache-2.0Stargazers:4186Issues:0Issues:0