Zeyuan Chen (zeyuanchen23)

zeyuanchen23

Geek Repo

Company:Salesforce Research

Github PK Tool:Github PK Tool

Zeyuan Chen's starred repositories

EasySpider

A visual no-code/code-free web crawler/spider易采集:一个可视化浏览器自动化测试/数据采集/爬虫软件,可以无代码图形化的设计和执行爬虫任务。别名:ServiceWrapper面向Web应用的智能化服务封装系统。

Language:JavaScriptLicense:NOASSERTIONStargazers:29157Issues:200Issues:407

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:22902Issues:187Issues:189

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20288Issues:176Issues:353

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:Apache-2.0Stargazers:10883Issues:162Issues:194

TensorRT-LLM

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

Language:C++License:Apache-2.0Stargazers:7402Issues:84Issues:1546

ECCV2022-RIFE

ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Language:PythonLicense:MITStargazers:4187Issues:76Issues:319

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonLicense:MITStargazers:3560Issues:47Issues:172

lida

Automatic Generation of Visualizations and Infographics using Large Language Models

Language:Jupyter NotebookLicense:MITStargazers:2573Issues:37Issues:91

EfficientSAM

EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1971Issues:23Issues:64

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Language:PythonLicense:Apache-2.0Stargazers:1290Issues:23Issues:54

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonLicense:Apache-2.0Stargazers:882Issues:19Issues:68

Bunny

A family of lightweight multimodal models.

Language:PythonLicense:Apache-2.0Stargazers:783Issues:22Issues:92

GaussianObject

Code for "GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting"

MagicDance

[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

Language:PythonLicense:NOASSERTIONStargazers:613Issues:32Issues:34

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonLicense:Apache-2.0Stargazers:596Issues:11Issues:28

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonLicense:MITStargazers:480Issues:30Issues:33

Panda-70M

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

ScoreHMR

ScoreHMR: Score-Guided Diffusion for 3D Human Recovery (CVPR 2024)

Language:PythonLicense:MITStargazers:367Issues:13Issues:24

FiT

[ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model

SEED-X

Multimodal Models in Real World

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:296Issues:18Issues:17

NeuScraper

[ACL 2024] This is the code repo for our ACL’24 paper "Cleaner Pretraining Corpus Curation with Neural Web Scraping".

Language:PythonLicense:MITStargazers:196Issues:10Issues:5

ChartVLM

Official Repository of ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning

Language:PythonLicense:CC-BY-4.0Stargazers:188Issues:12Issues:13

VisFusion

[CVPR 2023] Code for "VisFusion: Visibility-aware Online 3D Scene Reconstruction from Videos"

Language:PythonLicense:Apache-2.0Stargazers:177Issues:3Issues:6

multi-hmr

Pytorch demo code and models for Multi-HMR

Language:PythonLicense:NOASSERTIONStargazers:146Issues:6Issues:21

VidProM

VidProM: A Million-scale Real Prompt-Gallery Dataset for Text-to-Video Diffusion Models

single-video-curation-svd

Educational repository for applying the main video data curation techniques presented in the Stable Video Diffusion paper.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:79Issues:3Issues:1

HMT-pytorch

Official Implementation of "HMT: Hierarchical Memory Transformer for Long Context Language Processing"

Language:PythonLicense:Apache-2.0Stargazers:50Issues:0Issues:0

VQA-With-Multimodal-Transformers

Exploring multimodal fusion-type transformer models for visual question answering (on DAQUAR dataset)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:31Issues:3Issues:1

Inter4K

Official repository for downloading and using Inter4K video interpolation dataset

Language:PythonLicense:NOASSERTIONStargazers:23Issues:2Issues:4
Language:PythonStargazers:5Issues:0Issues:0