haoranD

Haoran Duan's repositories

Awesome-Human-Activity-Recognition

An up-to-date & curated list of Awesome IMU-based Human Activity Recognition(Ubiquitous Computing) papers, methods & resources. Please note that most of the collections of researches are mainly based on IMU data.

MIT244 150

Awesome-Embodied-AI

A curated list of awesome papers on Embodied AI and related research/industry-driven resources.

MIT242 90

LoG

Level of Gaussians

100

Awesome-Text-to-Video-Generation

A list for Text-to-Video, Image-to-Video works

000

Dreamer-XL

000

3DTopia

Text-to-3D Generation within 5 Minutes

Language:PythonApache-2.0000

all-seeing

[ICLR 2024] This is the official implementation of the paper "The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World"

Language:Python000

Awesome-CVPR2024-Low-Level-Vision

A Collection of Papers and Codes in CVPR2023/2022 about low level vision

000

awesome-described-object-detection

A curated list of papers and resources related to Described Object Detection, Open-Vocabulary/Open-World Object Detection and Referring Expression Comprehension. Updated frequently and pull requests welcomed.

000

Awesome-Evaluation-of-Visual-Generation

A list of works on evaluation of visual generation models, including evaluation metrics, models, and systems

000

Awesome-Generative-Image-Composition

A curated list of papers, code, and resources pertaining to generative image composition.

000

ChatTTS

ChatTTS is a generative speech model for daily dialogue.

Language:Jupyter NotebookNOASSERTION000

EasyVolcap

[SIGGRAPH Asia 2023 (Technical Communications)] EasyVolcap: Accelerating Neural Volumetric Video Research

Language:PythonNOASSERTION000

FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Language:Jupyter NotebookMIT000

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookMIT000

gpt-researcher

GPT based autonomous agent that does online comprehensive research on any given topic

MIT000

llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Language:Jupyter NotebookMIT000

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookApache-2.0000

LMDrive

[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models

Language:Jupyter NotebookApache-2.0000

Mamba_State_Space_Model_Paper_List

[Mamba-Survey-2024] Paper list for State-Space-Model/Mamba and it's Applications

MIT000

MonoGS

[CVPR'24] Gaussian Splatting SLAM

NOASSERTION000

Mora

Mora: More like Sora for Generalist Video Generation

Language:Jupyter Notebook000

Neural-Network-Diffusion

We introduce a novel approach for parameter generation, named neural network diffusion (\textbf{p-diff}, p stands for parameter), which employs a standard latent diffusion model to synthesize a new set of parameters

Language:Python000

pykan

Kolmogorov Arnold Networks

Language:Jupyter NotebookMIT000

self-rag

This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.

MIT000

V3D

V3D: Video Diffusion Models are Effective 3D Generators

000

ViDAR

[CVPR 2024 Highlight] Visual Point Cloud Forecasting

Language:PythonApache-2.0000

VMamba

VMamba: Visual State Space Models，code is based on mamba

Language:Python000

World-Models-Autonomous-Driving-Latest-Survey

A curated list of world models for autonomous driving. Keep updated.

000

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:Jupyter NotebookGPL-3.0000