Ziyi Wu (Wuziyi616)

Wuziyi616

Geek Repo

Company:@uoft-isl

Location:University of Toronto, Canada

Home Page:https://wuziyi616.github.io/

Twitter:@Dazitu_616

Github PK Tool:Github PK Tool


Organizations
pairlab
VectorInstitute

Ziyi Wu's starred repositories

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:27458Issues:224Issues:4602

flux

Official inference repo for FLUX.1 models

Language:PythonLicense:Apache-2.0Stargazers:14159Issues:125Issues:130

clip-as-service

🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP

Language:PythonLicense:NOASSERTIONStargazers:12389Issues:220Issues:607

segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:11031Issues:64Issues:256

progress

Linux tool to show progress for cp, mv, dd, ... (formerly known as cv)

Language:CLicense:GPL-3.0Stargazers:8532Issues:140Issues:111

AI-Scientist

The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑‍🔬

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7621Issues:86Issues:97

torchtitan

A native PyTorch Library for large model training

Language:PythonLicense:BSD-3-ClauseStargazers:2217Issues:37Issues:135

Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Language:PythonLicense:MITStargazers:2035Issues:31Issues:84

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonLicense:Apache-2.0Stargazers:1645Issues:23Issues:106

glomap

GLOMAP - Global Structured-from-Motion Revisited

Language:C++License:BSD-3-ClauseStargazers:1323Issues:22Issues:73

yarn

YaRN: Efficient Context Window Extension of Large Language Models

Language:PythonLicense:MITStargazers:1315Issues:14Issues:56

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonLicense:MITStargazers:1212Issues:21Issues:54

MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Language:PythonLicense:NOASSERTIONStargazers:512Issues:6Issues:20

attention_with_linear_biases

Code for the ALiBi method for transformer language models (ICLR 2022)

Language:PythonLicense:MITStargazers:501Issues:12Issues:19

ReconX

ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model

TransNetV2

TransNet V2: Shot Boundary Detection Neural Network

Language:PythonLicense:MITStargazers:445Issues:9Issues:47

vfusion3d

[ECCV 2024] Code for VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models

Language:PythonLicense:NOASSERTIONStargazers:390Issues:13Issues:9

VisionLLaMA

VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks

MiraData

Official repo for paper "MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions"

Language:PythonLicense:GPL-3.0Stargazers:351Issues:14Issues:15

rerope

Rectified Rotary Position Embeddings

unmasked_teacher

[ICCV2023 Oral] Unmasked Teacher: Towards Training-Efficient Video Foundation Models

Language:PythonLicense:MITStargazers:284Issues:13Issues:47

DOVER

[ICCV 2023, Official Code] for paper "Exploring Video Quality Assessment on User Generated Contents from Aesthetic and Technical Perspectives". Official Weights and Demos provided.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:264Issues:4Issues:33

Netflix-Prime-Auto-Skip

Automatically skip Ads, Intros, Recaps, Credits, etc. on Netflix, Prime video, Disney+ (Hotstar, STAR+), Crunchyroll and HBO max

Language:HTMLLicense:GPL-3.0Stargazers:238Issues:7Issues:83

PDVC

End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)

Language:PythonLicense:MITStargazers:202Issues:7Issues:59

VidChapters

[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale

Language:Jupyter NotebookLicense:MITStargazers:173Issues:3Issues:21

rope-vit

[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"

Language:PythonLicense:NOASSERTIONStargazers:160Issues:10Issues:9

flashattention2-custom-mask

Triton implementation of FlashAttention2 that adds Custom Masks.

Language:PythonLicense:Apache-2.0Stargazers:62Issues:4Issues:4

VideoScore

official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]

Language:PythonLicense:MITStargazers:38Issues:2Issues:3