zhujiagang's starred repositories

openpilot

openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system in 275+ supported cars.

Language:PythonLicense:MITStargazers:49256Issues:1298Issues:2744

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21636Issues:182Issues:478

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11264Issues:159Issues:295

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:10287Issues:104Issues:348

EdgeGPT

Reverse engineered API of Microsoft's Bing Chat AI

Language:PythonLicense:UnlicenseStargazers:8085Issues:92Issues:364

taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Language:Jupyter NotebookLicense:MITStargazers:5707Issues:76Issues:219

bot-on-anything

Connect AI models (like ChatGPT-3.5/4.0, Baidu Yiyan, New Bing, Bard) to apps (like Wechat, public account, DingTalk, Telegram, QQ). 将 ChatGPT、必应、文心一言、谷歌Bard 等对话模型连接各类应用,如微信、公众号、QQ、Telegram、Gmail、Slack、Web、企业微信、飞书、钉钉等。

Language:PythonLicense:MITStargazers:3893Issues:37Issues:381

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonLicense:NOASSERTIONStargazers:3284Issues:40Issues:166

ijepa

Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive architecture."

Language:PythonLicense:NOASSERTIONStargazers:2789Issues:59Issues:58

MobileAgent

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Language:PythonLicense:MITStargazers:2684Issues:45Issues:51

lanenet-lane-detection

Unofficial implemention of lanenet model for real time lane detection

Language:PythonLicense:Apache-2.0Stargazers:2337Issues:54Issues:560

bevfusion

[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

Language:PythonLicense:Apache-2.0Stargazers:2243Issues:40Issues:607

mmgeneration

MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.

Language:PythonLicense:Apache-2.0Stargazers:1884Issues:27Issues:123
Language:PythonLicense:Apache-2.0Stargazers:1742Issues:121Issues:22

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonLicense:Apache-2.0Stargazers:1640Issues:23Issues:105

llm-guard

The Security Toolkit for LLM Interactions

Language:PythonLicense:MITStargazers:1131Issues:18Issues:59

magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Language:PythonLicense:Apache-2.0Stargazers:940Issues:69Issues:22

PTI

Official Implementation for "Pivotal Tuning for Latent-based editing of Real Images" (ACM TOG 2022) https://arxiv.org/abs/2106.05744

Language:Jupyter NotebookLicense:MITStargazers:896Issues:24Issues:58

MiniGPT-5

Official implementation of paper "MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens"

Language:PythonLicense:Apache-2.0Stargazers:842Issues:12Issues:42

HiDiffusion

[ECCV 2024] HiDiffusion: Increases the resolution and speed of your diffusion model by only adding a single line of code!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:737Issues:6Issues:31

LLaMA-VID

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

Language:PythonLicense:Apache-2.0Stargazers:690Issues:13Issues:103

DCNv2_latest

DCNv2 supports decent pytorch such as torch 1.5+ (now 1.8+)

Language:C++License:BSD-3-ClauseStargazers:625Issues:8Issues:71

MagicDrive

[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”

Language:PythonLicense:AGPL-3.0Stargazers:553Issues:16Issues:83

ADAPT

This repository is an official implementation of ADAPT: Action-aware Driving Caption Transformer, accepted by ICRA 2023.

Language:PythonLicense:MITStargazers:386Issues:8Issues:21

OccWorld

[ECCV 2024] 3D World Model for Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:330Issues:9Issues:27

UniPC

[NeurIPS 2023] UniPC: A Unified Predictor-Corrector Framework for Fast Sampling of Diffusion Models

Language:Jupyter NotebookLicense:MITStargazers:291Issues:8Issues:12

SelfOcc

[CVPR 2024] SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction

Language:PythonLicense:Apache-2.0Stargazers:278Issues:14Issues:24

TrackDiffusion

Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)

GeoDiffusion

Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)

Language:PythonLicense:MITStargazers:58Issues:4Issues:20