Haoran Duan's repositories

Screws

SCREWS: A Modular Framework for Reasoning with Revisions

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

Adala

Adala: Autonomous DAta (Labeling) Agent framework

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ArXivChatGuru

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

License:MITStargazers:0Issues:0Issues:0

ChatDev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

Stargazers:0Issues:0Issues:0

city-dreamer

The official implementation of "CityDreamer: Compositional Generative Model of Unbounded 3D Cities". (arXiv 2309.00610)

Stargazers:0Issues:0Issues:0

CogVLM

a state-of-the-art-level open visual language model

Stargazers:0Issues:0Issues:0

deep-chat

Fully customizable AI chat component for your website

Language:TypeScriptLicense:MITStargazers:0Issues:0Issues:0

DeepSpeedExamples

Example models using DeepSpeed

License:Apache-2.0Stargazers:0Issues:0Issues:0

efficientvit

EfficientViT is a new family of vision models for efficient high-resolution vision.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Eureka

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models"

License:MITStargazers:0Issues:0Issues:0

fast-DiT

Fast Diffusion Models with Transformers

License:NOASSERTIONStargazers:0Issues:0Issues:0

FoodSAM

FoodSAM: Any Food Segmentation

License:Apache-2.0Stargazers:0Issues:0Issues:0

Generalization-in-OOD-Detection

Realisitic Out-of-Distribution (OOD) Detection

Language:PythonStargazers:0Issues:0Issues:0

Generative-AI

Multimodal Image Synthesis and Editing: The Generative AI Era [TPAMI 2023]

Stargazers:0Issues:0Issues:0

GenSim

GenSim: Generating Robotic Simulation Tasks via Large Language Models

License:MITStargazers:0Issues:0Issues:0

idify

Make ID photo right in the browser.

License:GPL-3.0Stargazers:0Issues:0Issues:0

MasQCLIP

(ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation

License:NOASSERTIONStargazers:0Issues:0Issues:0

MosaicFusion

MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation

License:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

open-interpreter

OpenAI's Code Interpreter in your terminal, running locally.

License:MITStargazers:0Issues:0Issues:0

openpilot

openpilot is an open source driver assistance system. openpilot performs the functions of Automated Lane Centering and Adaptive Cruise Control for 250+ supported car makes and models.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

rich-text-to-image

Rich-Text-to-Image Generation

License:MITStargazers:0Issues:0Issues:0

RM-PRT

Realistic Robotic Manipulation Simulator and Benchmark with Progressive Reasoning Tasks

License:Apache-2.0Stargazers:0Issues:0Issues:0

sd-webui-EasyPhoto

📷 EasyPhoto | Your Smart AI Photo Generator.

License:Apache-2.0Stargazers:0Issues:0Issues:0

skypilot

SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

License:Apache-2.0Stargazers:0Issues:0Issues:0

Tracking-Anything-with-DEVA

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

Stargazers:0Issues:0Issues:0

vditor

♏ 一款浏览器端的 Markdown 编辑器,支持所见即所得(富文本)、即时渲染(类似 Typora)和分屏预览模式。An In-browser Markdown editor, support WYSIWYG (Rich Text), Instant Rendering (Typora-like) and Split View modes.

License:MITStargazers:0Issues:0Issues:0

waymax

A JAX-based simulator for autonomous driving research.

License:NOASSERTIONStargazers:0Issues:0Issues:0

WebODM

User-friendly, commercial-grade software for processing aerial imagery. 🛩

License:AGPL-3.0Stargazers:0Issues:0Issues:0