allenpeng (AllenPeng0209)

AllenPeng0209

Geek Repo

Location:Taiwan

Github PK Tool:Github PK Tool

allenpeng's starred repositories

Deep-Live-Cam

real time face swap and one-click video deepfake with only a single image

Language:PythonLicense:AGPL-3.0Stargazers:38305Issues:231Issues:487

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:35059Issues:343Issues:2757

LLM101n

LLM101n: Let's build a Storyteller

flux

Official inference repo for FLUX.1 models

Language:PythonLicense:Apache-2.0Stargazers:14783Issues:129Issues:138

flash-attention

Fast and memory-efficient exact attention

Language:PythonLicense:BSD-3-ClauseStargazers:13726Issues:114Issues:1062

nerfstudio

A collaboration friendly studio for NeRFs

Language:PythonLicense:Apache-2.0Stargazers:9376Issues:117Issues:1630

Unique3D

Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

Language:PythonLicense:MITStargazers:2961Issues:36Issues:102

MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Language:PythonLicense:Apache-2.0Stargazers:1983Issues:6Issues:243

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonLicense:Apache-2.0Stargazers:1880Issues:27Issues:121

nano-llama31

nanoGPT style version of Llama 3.1

openvla

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Language:PythonLicense:MITStargazers:1125Issues:18Issues:119

ovsam

[ECCV 2024] The official code of paper "Open-Vocabulary SAM".

Language:PythonLicense:NOASSERTIONStargazers:921Issues:13Issues:44

flowmap

Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, Ayush Tewari, and Vincent Sitzmann

Language:PythonLicense:MITStargazers:875Issues:23Issues:47

GaussianPro

[ICML2024] Official code for GaussianPro: 3D Gaussian Splatting with Progressive Propagation

Language:PythonLicense:MITStargazers:650Issues:26Issues:63

DriveAGI

[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System

Language:PythonLicense:Apache-2.0Stargazers:559Issues:28Issues:7

diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Language:PythonLicense:NOASSERTIONStargazers:528Issues:6Issues:23

ComfyFlowApp

From comfyui workflow to web app, in seconds

Language:PythonLicense:GPL-3.0Stargazers:516Issues:15Issues:29

1d-tokenizer

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:415Issues:12Issues:28

S3Gaussian

Official Implementation of Self-Supervised Street Gaussians for Autonomous Driving

Language:PythonLicense:NOASSERTIONStargazers:404Issues:12Issues:24

l4casadi

Use PyTorch Models with CasADi for data-driven optimization or learning-based optimal control. Supports Acados.

Language:PythonLicense:MITStargazers:353Issues:8Issues:48

street-gaussians-ns

Unofficial implementation of "Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting", ECCV2024.

Language:PythonLicense:Apache-2.0Stargazers:309Issues:12Issues:53

autoregressive-diffusion-pytorch

Implementation of Autoregressive Diffusion in Pytorch

Language:PythonLicense:MITStargazers:261Issues:12Issues:4

fast-gaussian-rasterization

A geometry-shader-based, global CUDA sorted high-performance 3D Gaussian Splatting rasterizer. Can achieve a 5-10x speedup in rendering compared to the vanialla diff-gaussian-rasterization.

Language:PythonLicense:MITStargazers:258Issues:11Issues:6

TokenPacker

The code for "TokenPacker: Efficient Visual Projector for Multimodal LLM".

titok-pytorch

Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"

Language:PythonLicense:MITStargazers:160Issues:9Issues:3

OccSora

OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:137Issues:7Issues:13

CVT-Occ

CVT-Occ: Cost Volume Temporal Fusion for 3D Occupancy Prediction

ComfyUI-Llama-3-2

Using Llama-3.2 in ComfyUI

Language:PythonLicense:GPL-3.0Stargazers:15Issues:2Issues:0

S3Gaussian

Project Page for S3Gaussian

Language:PythonLicense:NOASSERTIONStargazers:4Issues:0Issues:0