Jian (valencebond)

valencebond

Geek Repo

Company:CASIA

Location:beijing

Github PK Tool:Github PK Tool

Jian's starred repositories

RecFormer

Replication of the paper "Text Is All You Need: Learning Language Representations for Sequential Recommendation" on KDD'23.

Language:PythonStargazers:70Issues:0Issues:0

OpenP5

OpenP5: An Open-Source Platform for Developing, Training, and Evaluating LLM-based Recommender Systems

Language:PythonLicense:Apache-2.0Stargazers:196Issues:0Issues:0

Open-MAGVIT2

Open-MAGVIT2: Democratizing Autoregressive Visual Generation

Language:PythonLicense:Apache-2.0Stargazers:346Issues:0Issues:0

Kolors

Kolors Team

Language:PythonLicense:Apache-2.0Stargazers:2781Issues:0Issues:0

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:8105Issues:0Issues:0

awesome-video-generation

A collection of awesome video generation studies.

Language:TeXLicense:MITStargazers:186Issues:0Issues:0

DiG

DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention

Language:PythonLicense:MITStargazers:100Issues:0Issues:0

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonLicense:MITStargazers:501Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:94Issues:0Issues:0

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonLicense:NOASSERTIONStargazers:2943Issues:0Issues:0

pytube

A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.

Language:PythonLicense:UnlicenseStargazers:11734Issues:0Issues:0

recognize-anything

Open-source and strong foundation image recognition models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2634Issues:0Issues:0
Language:PythonStargazers:1382Issues:0Issues:0

PLLaVA

Official repository for the paper PLLaVA

Language:PythonStargazers:501Issues:0Issues:0

MultimodalRecSys

A curated list of awesome resources about multimodal recommender systems.

License:GPL-3.0Stargazers:259Issues:0Issues:0

LaVIT

LaVIT: Empower the Large Language Model to Understand and Generate Visual Content

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:462Issues:0Issues:0

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:3110Issues:0Issues:0

Awesome-CVPR2024-ECCV2024-AIGC

A Collection of Papers and Codes for CVPR2024/ECCV2024 AIGC

Stargazers:368Issues:0Issues:0

MiniGPT4-video

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Language:PythonLicense:BSD-3-ClauseStargazers:487Issues:0Issues:0

nvitop

An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.

Language:PythonLicense:Apache-2.0Stargazers:4359Issues:0Issues:0

k-diffusion

Karras et al. (2022) diffusion models for PyTorch

Language:PythonLicense:MITStargazers:2209Issues:0Issues:0

fvd-comparison

Comparison between Frechet Video Distance implementation from StyleGAN-V and the original paper

Language:PythonStargazers:70Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20963Issues:0Issues:0

OOTDiffusion

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Language:PythonLicense:NOASSERTIONStargazers:5207Issues:0Issues:0

Panda-70M

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

Language:PythonStargazers:461Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11039Issues:0Issues:0

minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Language:PythonLicense:Apache-2.0Stargazers:1134Issues:0Issues:0

LVDM

LVDM: Latent Video Diffusion Models for High-Fidelity Long Video Generation

Language:PythonLicense:MITStargazers:428Issues:0Issues:0

LaVie

LaVie: High-Quality Video Generation with Cascaded Latent Diffusion Models

Language:PythonLicense:Apache-2.0Stargazers:793Issues:0Issues:0

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2591Issues:0Issues:0