cheng zhang's repositories
Instant-angelo
Instant-angelo: Build high-fidelity Digital Twin within 20 Minutes!
A-Scan2BIM
Official implementation of the paper A-Scan2BIM: Assistive Scan to Building Information Modeling
acme
A library of reinforcement learning components and agents
bao
Chat Bot with LLM and Fact Reference. Retriever Augmented Generation backed
BarrierNet
Safe robot learning
code-act
Official Repo for paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.
DeepSeek-VL
DeepSeek-VL: Towards Real-World Vision-Language Understanding
Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
embedchain
The Open Source RAG framework
gim
GIM: Learning Generalizable Image Matcher From Internet Videos (ICLR 2024 Spotlight)
IntrinsicImageDiffusion
Intrinsic Image Diffusion for Single-view Material Estimation
LaVague
Large Action Model framework to automate browser interaction
liquid-s4
Liquid Structural State-Space Models
LlamaGym
Fine-tune LLM agents with online reinforcement learning
lobe-chat
🤖 Lobe Chat - an open-source, high-performance chatbot framework that supports speech synthesis, multimodal, and extensible Function Call plugin system. Supports one-click free deployment of your private ChatGPT/LLM web application.
MODRLC
The Advanced Controls Test Bed (ACTB) is a virtual buildings test bed that interfaces external controllers to high-fidelity Spawn of EnergyPlus models.
MVDiffusion
MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion, NeurIPS 2023 (spotlight)
neuromancer
Pytorch-based framework for solving parametric constrained optimization problems, physics-informed system identification, and parametric model predictive control.
OpenVoice
Instant voice cloning by MyShell.
PatchFusion
An End-to-End Tile-Based Framework for High-Resolution Monocular Metric Depth Estimation
s4
Structured state space sequence models
Scaffold-GS
[CVPR 2024] Scaffold-GS: Structured 3D Gaussians for View-Adaptive Rendering
Semantic-UI
Semantic is a UI component framework based around useful principles from natural language.
sinergym
Gym environment for building simulation and control using reinforcement learning
skypilot
SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.
SuGaR
Official PyTorch implementation of SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering
TDengine
TDengine is an open source, high-performance, cloud native time-series database optimized for Internet of Things (IoT), Connected Cars, Industrial IoT and DevOps.
vstar
PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"