zhujiagang's starred repositories

TrackDiffusion

Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)

Language:PythonStargazers:44Issues:0Issues:0

GeoDiffusion

Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)

Language:PythonLicense:MITStargazers:37Issues:0Issues:0

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonLicense:Apache-2.0Stargazers:1302Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:10231Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:16717Issues:0Issues:0

MobileAgent

Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception

Language:PythonLicense:MITStargazers:1842Issues:0Issues:0

openpilot

openpilot is an open source driver assistance system. openpilot performs the functions of Automated Lane Centering and Adaptive Cruise Control for 250+ supported car makes and models.

Language:PythonLicense:MITStargazers:47768Issues:0Issues:0
Stargazers:1614Issues:0Issues:0

EdgeGPT

Reverse engineered API of Microsoft's Bing Chat AI

Language:PythonLicense:UnlicenseStargazers:8104Issues:0Issues:0

bot-on-anything

Connect AI models (like ChatGPT-3.5/4.0, Baidu Yiyan, New Bing, Bard) to apps (like Wechat, public account, DingTalk, Telegram, QQ). 将 ChatGPT、必应、文心一言、谷歌Bard 等对话模型连接各类应用,如微信、公众号、QQ、Telegram、Gmail、Slack、Web、企业微信、飞书、钉钉等。

Language:PythonLicense:MITStargazers:3702Issues:0Issues:0

llm-guard

The Security Toolkit for LLM Interactions

Language:PythonLicense:MITStargazers:859Issues:0Issues:0

LLaMA-VID

Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:583Issues:0Issues:0

SelfOcc

[CVPR 2024] SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction

Language:PythonLicense:Apache-2.0Stargazers:227Issues:0Issues:0

OccWorld

3D World Model for Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:269Issues:0Issues:0

UniPC

[NeurIPS 2023] UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models

Language:Jupyter NotebookLicense:MITStargazers:275Issues:0Issues:0

bevfusion

[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

Language:PythonLicense:Apache-2.0Stargazers:2027Issues:0Issues:0

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:8868Issues:0Issues:0

MagicDrive

[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”

Language:PythonLicense:AGPL-3.0Stargazers:358Issues:0Issues:0

ijepa

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."

Language:PythonLicense:NOASSERTIONStargazers:2673Issues:0Issues:0

DCNv2_latest

DCNv2 supports decent pytorch such as torch 1.5+ (now 1.8+)

Language:C++License:BSD-3-ClauseStargazers:597Issues:0Issues:0

lanenet-lane-detection

Unofficial implemention of lanenet model for real time lane detection

Language:PythonLicense:Apache-2.0Stargazers:2269Issues:0Issues:0

mmgeneration

MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.

Language:PythonLicense:Apache-2.0Stargazers:1810Issues:0Issues:0

taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Language:Jupyter NotebookLicense:MITStargazers:5397Issues:0Issues:0

magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Language:PythonLicense:Apache-2.0Stargazers:849Issues:0Issues:0

MiniGPT-5

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Language:PythonLicense:Apache-2.0Stargazers:816Issues:0Issues:0

ADAPT

This repository is an official implementation of ADAPT: Action-aware Driving Caption Transformer, accepted by ICRA 2023.

Language:PythonLicense:MITStargazers:367Issues:0Issues:0

PTI

Official Implementation for "Pivotal Tuning for Latent-based editing of Real Images" (ACM TOG 2022) https://arxiv.org/abs/2106.05744

Language:Jupyter NotebookLicense:MITStargazers:883Issues:0Issues:0

Driving-with-LLMs

PyTorch implementation for the paper "Driving with LLMs: Fusing Object-Level Vector Modality for Explainable Autonomous Driving"

Language:PythonLicense:Apache-2.0Stargazers:310Issues:0Issues:0

consistency_models

Official repo for consistency models.

Language:PythonLicense:MITStargazers:5944Issues:0Issues:0