Haoran Duan's repositories
Adala
Adala: Autonomous DAta (Labeling) Agent framework
ArXivChatGuru
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
ChatDev
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
city-dreamer
The official implementation of "CityDreamer: Compositional Generative Model of Unbounded 3D Cities". (arXiv 2309.00610)
CogVLM
a state-of-the-art-level open visual language model
deep-chat
Fully customizable AI chat component for your website
DeepSpeedExamples
Example models using DeepSpeed
efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models"
fast-DiT
Fast Diffusion Models with Transformers
FoodSAM
FoodSAM: Any Food Segmentation
Generalization-in-OOD-Detection
Realisitic Out-of-Distribution (OOD) Detection
Generative-AI
Multimodal Image Synthesis and Editing: The Generative AI Era [TPAMI 2023]
GenSim
GenSim: Generating Robotic Simulation Tasks via Large Language Models
idify
Make ID photo right in the browser.
MasQCLIP
(ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation
MosaicFusion
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation
open-interpreter
OpenAI's Code Interpreter in your terminal, running locally.
openpilot
openpilot is an open source driver assistance system. openpilot performs the functions of Automated Lane Centering and Adaptive Cruise Control for 250+ supported car makes and models.
rich-text-to-image
Rich-Text-to-Image Generation
RM-PRT
Realistic Robotic Manipulation Simulator and Benchmark with Progressive Reasoning Tasks
sd-webui-EasyPhoto
📷 EasyPhoto | Your Smart AI Photo Generator.
skypilot
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
Tracking-Anything-with-DEVA
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
vditor
♏ 一款浏览器端的 Markdown 编辑器,支持所见即所得(富文本)、即时渲染(类似 Typora)和分屏预览模式。An In-browser Markdown editor, support WYSIWYG (Rich Text), Instant Rendering (Typora-like) and Split View modes.
waymax
A JAX-based simulator for autonomous driving research.
WebODM
User-friendly, commercial-grade software for processing aerial imagery. 🛩