Haoran Duan's repositories
Awesome-Human-Activity-Recognition
An up-to-date & curated list of Awesome IMU-based Human Activity Recognition(Ubiquitous Computing) papers, methods & resources. Please note that most of the collections of researches are mainly based on IMU data.
Awesome-Embodied-AI
A curated list of awesome papers on Embodied AI and related research/industry-driven resources.
Awesome-Text-to-Video-Generation
A list for Text-to-Video, Image-to-Video works
3DTopia
Text-to-3D Generation within 5 Minutes
all-seeing
[ICLR 2024] This is the official implementation of the paper "The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World"
ASPIRe
[CVPR 2024] HIG: Hierarchical Interlacement Graph Approach to Scene Graph Generation in Video Understanding
Awesome-CVPR2024-Low-Level-Vision
A Collection of Papers and Codes in CVPR2023/2022 about low level vision
Awesome-Generative-Image-Composition
A curated list of papers, code, and resources pertaining to generative image composition.
chatgpt-on-wechat
基于大模型搭建的微信聊天机器人,同时支持微信、企业微信、公众号、飞书、钉钉接入,可选择GPT3.5/GPT4.0/Claude/文心一言/讯飞星火/通义千问/Gemini/GLM-4/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
Deformable-3D-Gaussians
[CVPR 2024] Official implementation of "Deformable 3D Gaussians for High-Fidelity Monocular Dynamic Scene Reconstruction"
FeatUp
Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024
generative-ai-for-beginners
18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
generative-models
Generative Models by Stability AI
GPT4Point
[CVPR 2024] GPT4Point: A Unified Framework for Point-Language Understanding and Generation.
LMDrive
[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models
MobiLlama
MobiLlama : Small Language Model tailored for edge devices
MonoGS
[CVPR'24] Gaussian Splatting SLAM
Mora
Mora: More like Sora for Generalist Video Generation
Multi-LoRA-Composition
Repository for the Paper "Multi-LoRA Composition for Image Generation"
Neural-Network-Diffusion
We introduce a novel approach for parameter generation, named neural network diffusion (\textbf{p-diff}, p stands for parameter), which employs a standard latent diffusion model to synthesize a new set of parameters
nsfc
nsfc - 国家自然科学基金项目LaTeX模版(面青地)
OOTDiffusion
Official implementation of OOTDiffusion
Pointcept
Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24), PPT (CVPR'24), MSC (CVPR'23), PTv2 (NeurIPS'22)
self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
V3D
V3D: Video Diffusion Models are Effective 3D Generators
ViDAR
[CVPR 2024 Highlight] Visual Point Cloud Forecasting
ViT-Lens
[CVPR 2024] ViT-Lens: Towards Omni-modal Representations
VMamba
VMamba: Visual State Space Models,code is based on mamba
World-Models-Autonomous-Driving-Latest-Survey
A curated list of world models for autonomous driving. Keep updated.
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection