Haoran Duan's repositories

Screws

SCREWS: A Modular Framework for Reasoning with Revisions

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

Adala

Adala: Autonomous DAta (Labeling) Agent framework

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ArXivChatGuru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

License:MITStargazers:0Issues:0Issues:0

ChatDev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Stargazers:0Issues:0Issues:0

CogVLM

a state-of-the-art-level open visual language model

Stargazers:0Issues:0Issues:0

Dataset-Diffusion

Dataset Diffusion: Diffusion-based Synthetic Data Generation for Pixel-Level Semantic Segmentation (NeurIPS2023)

License:AGPL-3.0Stargazers:0Issues:0Issues:0

deep-chat

Fully customizable AI chat component for your website

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

DeepSpeedExamples

Example models using DeepSpeed

License:Apache-2.0Stargazers:0Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

License:NOASSERTIONStargazers:0Issues:0Issues:0

efficientvit

EfficientViT is a new family of vision models for efficient high-resolution vision.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Eureka

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models"

License:MITStargazers:0Issues:0Issues:0

fast-DiT

Fast Diffusion Models with Transformers

License:NOASSERTIONStargazers:0Issues:0Issues:0

FoodSAM

FoodSAM: Any Food Segmentation

License:Apache-2.0Stargazers:0Issues:0Issues:0

GenSim

GenSim: Generating Robotic Simulation Tasks via Large Language Models

License:MITStargazers:0Issues:0Issues:0

groundingLMM

Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Stargazers:0Issues:0Issues:0

idify

Make ID photo right in the browser.

License:GPL-3.0Stargazers:0Issues:0Issues:0

MasQCLIP

(ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation

License:NOASSERTIONStargazers:0Issues:0Issues:0

MosaicFusion

MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation

License:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

openpilot

openpilot is an open source driver assistance system. openpilot performs the functions of Automated Lane Centering and Adaptive Cruise Control for 250+ supported car makes and models.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

OutfitAnyone

Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person

Stargazers:0Issues:0Issues:0

panacea

[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

rich-text-to-image

Rich-Text-to-Image Generation

License:MITStargazers:0Issues:0Issues:0

sd-webui-EasyPhoto

📷 EasyPhoto | Your Smart AI Photo Generator.

License:Apache-2.0Stargazers:0Issues:0Issues:0

TimeLlama

The official repo of TimeLlama, an instruction-finetuned Llama2 series that improve complex temporal reasoning ability.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Tracking-Anything-with-DEVA

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

Stargazers:0Issues:0Issues:0

vditor

♏ 一款浏览器端的 Markdown 编辑器,支持所见即所得(富文本)、即时渲染(类似 Typora)和分屏预览模式。An In-browser Markdown editor, support WYSIWYG (Rich Text), Instant Rendering (Typora-like) and Split View modes.

License:MITStargazers:0Issues:0Issues:0

waymax

A JAX-based simulator for autonomous driving research.

License:NOASSERTIONStargazers:0Issues:0Issues:0

WebODM

User-friendly, commercial-grade software for processing aerial imagery. 🛩

License:AGPL-3.0Stargazers:0Issues:0Issues:0