Beast code in Giters

Zeyuan Chen's starred repositories

EasySpider

A visual no-code/code-free web crawler/spider易采集：一个可视化浏览器自动化测试/数据采集/爬虫软件，可以无代码图形化的设计和执行爬虫任务。别名：ServiceWrapper面向Web应用的智能化服务封装系统。

Language:JavaScriptNOASSERTION34409 223 495

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION26206 217 237

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonApache-2.021669 182 478

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonMIT11276 160 296

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++Apache-2.08231 87 1801

ECCV2022-RIFE

ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Language:PythonMIT4364 76 329

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonMIT3660 47 175

Unique3D

Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

Language:PythonMIT2877 36 99

EfficientSAM

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Language:Jupyter NotebookApache-2.02092 25 68

Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Language:PythonMIT2030 31 84

MeshAnything

From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"

Language:PythonNOASSERTION1978 30 26

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonApache-2.01803 26 118

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Language:PythonApache-2.01413 23 60

Bunny

A family of lightweight multimodal models.

Language:PythonApache-2.0883 19 114

GaussianObject

GaussianObject: High-Quality 3D Object Reconstruction from Four Views with Gaussian Splatting (SIGGRAPH Asia 2024, TOG)

Language:Jupyter Notebook801 22 47

MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

731 24 9

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonApache-2.0716 11 38

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonMIT537 28 35

Panda-70M

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Language:Python494 11 44

ScoreHMR

ScoreHMR: Score-Guided Diffusion for 3D Human Recovery (CVPR 2024)

Language:PythonMIT380 13 29

SEED-X

Multimodal Models in Real World

Language:Jupyter NotebookNOASSERTION374 19 24

FiT

[ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model

Apache-2.0357 32 7

NeuScraper

[ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".

Language:PythonMIT208 10 9

multi-hmr

Pytorch demo code and models for Multi-HMR

Language:PythonNOASSERTION185 9 34

VisFusion

[CVPR 2023] Code for "VisFusion: Visibility-aware Online 3D Scene Reconstruction from Videos"

Language:PythonApache-2.0182 3 6

VidProM

VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models

95 2 1

HMT-pytorch

Official Implementation of "HMT: Hierarchical Memory Transformer for Long Context Language Processing"

Language:PythonApache-2.057 2 3

VQA-With-Multimodal-Transformers

Exploring multimodal fusion-type transformer models for visual question answering (on DAQUAR dataset)

Language:Jupyter NotebookApache-2.033 3 1

Inter4K

Official repository for downloading and using Inter4K video interpolation dataset

Language:PythonNOASSERTION25 2 4

mistral-7b-tensorrt-llm-truss

Language:Python500