Yunlong (Yolo) Tang's repositories
Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
LaunchpadGPT
Repo for ICMC 2023 paper: LaunchpadGPT: Language Model as Music Visualization Designer on Launchpad
Awesome-RegionLLMs
Large Language Models for Fine-grained Vision Understanding
PosterLayout-CVPR2023
Official repository for "PosterLayout: A New Benchmark and Approach for Content-aware Visual-Textual Presentation Layout" (CVPR 2023).
MMComposition
Repo for MMComposition Benchmark
video-cover-gen
Undergraduate thesis project: Video Cover Generation
name-my-model
Generate a cool name for your model proposed in your paper!
Awesome-Anything
AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask
computer_vision_course
material of computer vision course
Intelligent-Robots-Lab
Lab materials for the intelligent robotics course
const_layout
Official implementation of the MM'21 paper "Constrained Graphic Layout Generation via Latent Optimization" (LayoutGAN++, CLG-LO, and Layout evaluation)
Context-GEBC
Second-place solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2022 workshop)
devicon
Set of icons representing programming languages, designing & development tools
Emu
Emu: An Open Multimodal Generalist
github-readme-stats
:zap: Dynamically generated stats for your github readmes
gpt4free
decentralising the Ai Industry, just some language model api's...
ifseg
IFSeg: Image-free Semantic Segmentation via Vision-Language Model (CVPR 2023)
MaskCLIP
Official PyTorch implementation of "Extract Free Dense Labels from CLIP" (ECCV 22 Oral)
modelscope
ModelScope is committed to empowering a wide-spectrum of developers to leverage AI models from various domains. (致力于通过开放的社区合作,开源AI模型以及相关创新技术,推动基于模型即服务的生态繁荣发展。)
paper-reading
深度学习经典、新论文逐段精读
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
so-vits-svc
SoftVC VITS Singing Voice Conversion
Turtlebot3_PControlFollowWall_Yoyov3
Final course project on intelligent robotics at Southern University of Science and Technology (SUSTech) in spring 2022.
Untrimmed-Video-Feature-Extractor
A simple and effective feature extractor for untrimmed videos