canqin001's starred repositories

Language:PythonStargazers:1Issues:0Issues:0

switch-cuda

A simple bash script for switching between installed versions of CUDA.

Language:ShellLicense:MITStargazers:567Issues:0Issues:0

Video-Dataset-Loading-Pytorch

Generic PyTorch dataset implementation to load and augment VIDEOS for deep learning training loops.

Language:PythonLicense:BSD-2-ClauseStargazers:440Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:62532Issues:0Issues:0

Online-RLHF

A recipe for online RLHF.

Language:PythonStargazers:342Issues:0Issues:0

scaling_on_scales

When do we not need larger vision models?

Language:PythonLicense:MITStargazers:273Issues:0Issues:0

MiniGPT4-video

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Language:PythonLicense:BSD-3-ClauseStargazers:486Issues:0Issues:0

DoRA

[ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation

Language:PythonLicense:NOASSERTIONStargazers:481Issues:0Issues:0

SoM-LLaVA

[COLM-2024] List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs

Language:PythonStargazers:103Issues:0Issues:0

emoji-cheat-sheet

A markdown version emoji cheat sheet

Language:TypeScriptLicense:MITStargazers:12146Issues:0Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:24510Issues:0Issues:0

T2I-CompBench

[Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation

Language:PythonLicense:MITStargazers:175Issues:0Issues:0

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonLicense:NOASSERTIONStargazers:2638Issues:0Issues:0

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2582Issues:0Issues:0

SQ-LLaVA

Visual self-questioning for large vision-language assistant.

Language:PythonLicense:MITStargazers:17Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20902Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:134Issues:0Issues:0

vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Language:PythonLicense:MITStargazers:2225Issues:0Issues:0

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonLicense:Apache-2.0Stargazers:1536Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5772Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18366Issues:0Issues:0

Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonLicense:CC-BY-4.0Stargazers:1088Issues:0Issues:0

DiffSynth-Studio

Enjoy the magic of Diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:6002Issues:0Issues:0

VBench

[CVPR2024 Highlight] VBench - We Evaluate Video Generation

Language:PythonLicense:Apache-2.0Stargazers:415Issues:0Issues:0

AIGCBench

Official repo for AIGCBench: Comprehensive Evaluation of Image-to-Video Content Generated by AI

Language:PythonLicense:Apache-2.0Stargazers:26Issues:0Issues:0

decord

An efficient video loader for deep learning with smart shuffling that's super easy to digest

Language:C++License:Apache-2.0Stargazers:1745Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:34022Issues:0Issues:0

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

Stargazers:10864Issues:0Issues:0
Language:PythonStargazers:256Issues:0Issues:0
Stargazers:733Issues:0Issues:0