CHENGY12's starred repositories
Tree-Transformer
Implementation of the paper Tree Transformer
Structured-Diffusion-Guidance
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
awesome-english-ebooks
ē»ęµå¦äŗŗ(å«é³é¢)ćēŗ½ēŗ¦å®¢ćå«ę„ćčæēŗæć大č„æę“ęåēč±čÆęåæå č“¹äøč½½,ęÆęepubćmobićpdfę ¼å¼, ęÆåØę“ę°
LaVi-Bridge
[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation
i-stylegan
Multi-domain image generation and translation with identifiability guarantees
Chain-of-Spot
Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Models
SoraReview
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models".
unmasked_teacher
[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Revisiting-Contrastive-SSL
Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations. [NeurIPS 2021]