Yusong Hu (Ethanhuhuhu)

Ethanhuhuhu

Geek Repo

Company:Nankai University

Location:Tianjin China

Github PK Tool:Github PK Tool

Yusong Hu's starred repositories

Stargazers:10Issues:0Issues:0

home-robot

Mobile manipulation research tools for roboticists

Language:PythonLicense:MITStargazers:819Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20701Issues:0Issues:0

Cascade-CLIP

Official implement of ICML2024 Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation

Language:PythonLicense:MITStargazers:27Issues:0Issues:0

LE3D

HDR 3D Scene Editing!

License:NOASSERTIONStargazers:125Issues:0Issues:0

awesome-kan

A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and more, for researchers and developers in the Kolmogorov-Arnold Network field.

Stargazers:1785Issues:0Issues:0

Libra

Simple PyTorch implementation of "Libra: Building Decoupled Vision System on Large Language Models" (accepted by ICML 2024)

Language:PythonLicense:Apache-2.0Stargazers:35Issues:0Issues:0

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonLicense:Apache-2.0Stargazers:3332Issues:0Issues:0

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonLicense:NOASSERTIONStargazers:2791Issues:0Issues:0

CoIN

Instruction Tuning in Continual Learning paradigm

Language:PythonLicense:Apache-2.0Stargazers:18Issues:0Issues:0

VLM_survey

Collection of AWESOME vision-language models for vision tasks

Stargazers:2019Issues:0Issues:0

StoryDiffusion

Create Magic Story!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5503Issues:0Issues:0

Awesome-LLM-Robotics

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

License:BSD-3-ClauseStargazers:2531Issues:0Issues:0

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4V. 接近GPT-4V表现的可商用开源多模态对话模型

Language:PythonLicense:MITStargazers:4188Issues:0Issues:0

GMM

Generative Multi-modal Models are Good Class Incremental Learners, CVPR 2024 [PyTorch Code]

Language:PythonStargazers:25Issues:0Issues:0

MoneyPrinterTurbo

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Language:PythonLicense:MITStargazers:14999Issues:0Issues:0

LED

[ICCV 2023] Lighting Every Darkness in Two Pairs: A Calibration-Free Pipeline for RAW Denoising && [Arxiv 2023] Make Explicit Calibration Implicit: Calibrate Denoiser Instead of the Noise Model

Language:PythonLicense:NOASSERTIONStargazers:291Issues:0Issues:0

GET

GET: Unlocking the Multi-modal Potential of CLIP for Generalized Category Discovery

License:MITStargazers:8Issues:0Issues:0

PhotoMaker

PhotoMaker

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:8675Issues:0Issues:0

CoDA_NeurIPS2023

Official code for NeurIPS2023 paper: CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection

Language:Jupyter NotebookLicense:MITStargazers:170Issues:0Issues:0

MQ-Det

Official PyTorch implementation of "Multi-modal Queried Object Detection in the Wild" (accepted by NeurIPS 2023)

Language:PythonLicense:Apache-2.0Stargazers:249Issues:0Issues:0
Language:PythonStargazers:98Issues:0Issues:0

CorrMatch

Official code for "CorrMatch: Label Propagation via Correlation Matching for Semi-Supervised Semantic Segmentation"

Language:PythonStargazers:103Issues:0Issues:0

dytox

Dynamic Token Expansion with Continual Transformers, accepted at CVPR 2022

Language:PythonLicense:Apache-2.0Stargazers:133Issues:0Issues:0

ms-nerf

Multi-Space Neural Radiance Fields(CVPR 2023)

Language:PythonLicense:NOASSERTIONStargazers:165Issues:0Issues:0

MIL-NCE_HowTo100M

PyTorch GPU distributed training code for MIL-NCE HowTo100M

Language:PythonLicense:Apache-2.0Stargazers:212Issues:0Issues:0

ACROSS-ACL23

Official code repo for paper: ACROSS: An Alignment-based Framework for Low-Resource Many-to-One Cross-Lingual Summarization

Stargazers:12Issues:0Issues:0

Continual-CLIP

Official repository for "CLIP model is an Efficient Continual Learner".

Language:PythonLicense:Apache-2.0Stargazers:69Issues:0Issues:0

RIDCP_dehazing

[CVPR 2023] | RIDCP: Revitalizing Real Image Dehazing via High-Quality Codebook Priors

Language:PythonLicense:NOASSERTIONStargazers:188Issues:0Issues:0

CVPR2023-DMVFN

CVPR2023 (highlight) - A Dynamic Multi-Scale Voxel Flow Network for Video Prediction

Language:Jupyter NotebookLicense:MITStargazers:328Issues:0Issues:0