Yusong Hu's starred repositories
home-robot
Mobile manipulation research tools for roboticists
Cascade-CLIP
Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation
awesome-kan
A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and more, for researchers and developers in the Kolmogorov-Arnold Network field.
HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
VLM_survey
Collection of AWESOME vision-language models for vision tasks
StoryDiffusion
Create Magic Story!
Awesome-LLM-Robotics
A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites
MoneyPrinterTurbo
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
PhotoMaker
PhotoMaker
CoDA_NeurIPS2023
Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection
MIL-NCE_HowTo100M
PyTorch GPU distributed training code for MIL-NCE HowTo100M
ACROSS-ACL23
Official code repo for paper: ACROSS: An Alignment-based Framework for Low-Resource Many-to-One Cross-Lingual Summarization
Continual-CLIP
Official repository for "CLIP model is an Efficient Continual Learner".
RIDCP_dehazing
[CVPR 2023] | RIDCP: Revitalizing Real Image Dehazing via High-Quality Codebook Priors
CVPR2023-DMVFN
CVPR2023 (highlight) - A Dynamic Multi-Scale Voxel Flow Network for Video Prediction