yunlong10

Yunlong (Yolo) Tang's repositories

Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

LaunchpadGPT

Repo for ICMC 2023 paper: LaunchpadGPT: Language Model as Music Visualization Designer on Launchpad

Language:PythonMIT15 20

Ads-1k

Dataset with 1000+ video advertisemets proposed by "Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward" (ACCV 2022)

Language:PythonBSD-2-Clause9 20

Awesome-RegionLLMs

Large Language Models for Fine-grained Vision Understanding

6 20

PosterLayout-CVPR2023

Official repository for "PosterLayout: A New Benchmark and Approach for Content-aware Visual-Textual Presentation Layout" (CVPR 2023).

Language:Python500

MMComposition

Repo for MMComposition Benchmark

4 10

video-cover-gen

Undergraduate thesis project: Video Cover Generation

Language:Jupyter Notebook4 10

name-my-model

Generate a cool name for your model proposed in your paper!

Language:Python3 10

yunlong10

3 10

Awesome-Anything

AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask

100

computer_vision_course

material of computer vision course

010

Intelligent-Robots-Lab

Lab materials for the intelligent robotics course

000

Awesome-Large-World-Models

010

Awesome-World-Models

010

const_layout

Official implementation of the MM'21 paper "Constrained Graphic Layout Generation via Latent Optimization" (LayoutGAN++, CLG-LO, and Layout evaluation)

Language:PythonAGPL-3.0000

Context-GEBC

Second-place solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2022 workshop)

Language:PythonMIT000

devicon

Set of icons representing programming languages, designing & development tools

Language:PythonMIT000

Emu

Emu: An Open Multimodal Generalist

Language:Python000

github-readme-stats

:zap: Dynamically generated stats for your github readmes

Language:JavaScriptMIT000

gpt4free

decentralising the Ai Industry, just some language model api's...

Language:PythonGPL-3.0000

ifseg

IFSeg: Image-free Semantic Segmentation via Vision-Language Model (CVPR 2023)

Language:Python000

MaskCLIP

Official PyTorch implementation of "Extract Free Dense Labels from CLIP" (ECCV 22 Oral)

Language:PythonApache-2.0000

modelscope

ModelScope is committed to empowering a wide-spectrum of developers to leverage AI models from various domains. (致力于通过开放的社区合作，开源AI模型以及相关创新技术，推动基于模型即服务的生态繁荣发展。)

Language:PythonApache-2.0000

paper-reading

深度学习经典、新论文逐段精读

Apache-2.0000

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.0000

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonBSD-3-Clause000

Turtlebot3_PControlFollowWall_Yoyov3

Final course project on intelligent robotics at Southern University of Science and Technology (SUSTech) in spring 2022.

Language:Makefile000

Untrimmed-Video-Feature-Extractor

A simple and effective feature extractor for untrimmed videos

Language:PythonMIT000

yunlong10.github.io

Language:HTMLMIT010

yunlong10_old.github.io

Language:JavaScriptCC0-1.0000