Haoran Duan's repositories

Awesome-Human-Activity-Recognition

An up-to-date & curated list of Awesome IMU-based Human Activity Recognition(Ubiquitous Computing) papers, methods & resources. Please note that most of the collections of researches are mainly based on IMU data.

License:MITStargazers:196Issues:14Issues:0

Awesome-Embodied-AI

A curated list of awesome papers on Embodied AI and related research/industry-driven resources.

License:MITStargazers:114Issues:6Issues:0

Awesome-Text-to-Video-Generation

A list for Text-to-Video, Image-to-Video works

Stargazers:0Issues:0Issues:0

3DTopia

Text-to-3D Generation within 5 Minutes

License:Apache-2.0Stargazers:0Issues:0Issues:0

all-seeing

[ICLR 2024] This is the official implementation of the paper "The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World"

Language:PythonStargazers:0Issues:0Issues:0

ASPIRe

[CVPR 2024] HIG: Hierarchical Interlacement Graph Approach to Scene Graph Generation in Video Understanding

Stargazers:0Issues:0Issues:0

Awesome-CVPR2024-Low-Level-Vision

A Collection of Papers and Codes in CVPR2023/2022 about low level vision

Stargazers:0Issues:0Issues:0

Awesome-Generative-Image-Composition

A curated list of papers, code, and resources pertaining to generative image composition.

Stargazers:0Issues:0Issues:0

chatgpt-on-wechat

基于大模型搭建的微信聊天机器人,同时支持微信、企业微信、公众号、飞书、钉钉接入,可选择GPT3.5/GPT4.0/Claude/文心一言/讯飞星火/通义千问/Gemini/GLM-4/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。

License:MITStargazers:0Issues:0Issues:0

Deformable-3D-Gaussians

[CVPR 2024] Official implementation of "Deformable 3D Gaussians for High-Fidelity Monocular Dynamic Scene Reconstruction"

License:MITStargazers:0Issues:0Issues:0

FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

License:MITStargazers:0Issues:0Issues:0

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

License:MITStargazers:0Issues:0Issues:0

generative-models

Generative Models by Stability AI

License:MITStargazers:0Issues:0Issues:0

GPT4Point

[CVPR 2024] GPT4Point: A Unified Framework for Point-Language Understanding and Generation.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

LMDrive

[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models

License:Apache-2.0Stargazers:0Issues:0Issues:0

MobiLlama

MobiLlama : Small Language Model tailored for edge devices

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

MonoGS

[CVPR'24] Gaussian Splatting SLAM

License:NOASSERTIONStargazers:0Issues:0Issues:0

Mora

Mora: More like Sora for Generalist Video Generation

Stargazers:0Issues:0Issues:0

Multi-LoRA-Composition

Repository for the Paper "Multi-LoRA Composition for Image Generation"

Stargazers:0Issues:0Issues:0

Neural-Network-Diffusion

We introduce a novel approach for parameter generation, named neural network diffusion (\textbf{p-diff}, p stands for parameter), which employs a standard latent diffusion model to synthesize a new set of parameters

Stargazers:0Issues:0Issues:0

nsfc

nsfc - 国家自然科学基金项目LaTeX模版(面青地)

Stargazers:0Issues:0Issues:0

OOTDiffusion

Official implementation of OOTDiffusion

License:NOASSERTIONStargazers:0Issues:0Issues:0

Pointcept

Pointcept: a codebase for point cloud perception research. Latest works: PTv3 (CVPR'24), PPT (CVPR'24), MSC (CVPR'23), PTv2 (NeurIPS'22)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

License:MITStargazers:0Issues:0Issues:0

V3D

V3D: Video Diffusion Models are Effective 3D Generators

Stargazers:0Issues:0Issues:0

ViDAR

[CVPR 2024 Highlight] Visual Point Cloud Forecasting

License:Apache-2.0Stargazers:0Issues:0Issues:0

ViT-Lens

[CVPR 2024] ViT-Lens: Towards Omni-modal Representations

License:NOASSERTIONStargazers:0Issues:0Issues:0

VMamba

VMamba: Visual State Space Models,code is based on mamba

Stargazers:0Issues:0Issues:0

World-Models-Autonomous-Driving-Latest-Survey

A curated list of world models for autonomous driving. Keep updated.

Stargazers:0Issues:0Issues:0

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:0Issues:0Issues:0