caozhenxiang-kouji

caozhenxiang-kouji

Geek Repo

Github PK Tool:Github PK Tool

caozhenxiang-kouji's starred repositories

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25182Issues:222Issues:452

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24241Issues:192Issues:3822

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:20877Issues:180Issues:403

Awesome-Chinese-LLM

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

Language:HTMLLicense:MITStargazers:10503Issues:266Issues:45

FinRL

FinRL: Financial Reinforcement Learning. 🔥

Language:Jupyter NotebookLicense:MITStargazers:9462Issues:199Issues:707

nerfstudio

A collaboration friendly studio for NeRFs

Language:PythonLicense:Apache-2.0Stargazers:8981Issues:113Issues:1555

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:8446Issues:60Issues:1436

PaddleSeg

Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Matting, 3D Segmentation, etc.

Language:PythonLicense:Apache-2.0Stargazers:8445Issues:90Issues:2078

mmsegmentation

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Language:PythonLicense:Apache-2.0Stargazers:7773Issues:53Issues:2343

awesome-NeRF

A curated list of awesome neural radiance fields papers

Language:TeXLicense:MITStargazers:6366Issues:236Issues:19

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5697Issues:66Issues:406

MobileSAM

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4539Issues:46Issues:121

VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:4046Issues:40Issues:347

Text2Video-Zero

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

Language:PythonLicense:NOASSERTIONStargazers:3932Issues:65Issues:70

torch-ngp

A pytorch CUDA extension implementation of instant-ngp (sdf and nerf), with a GUI.

Language:PythonLicense:MITStargazers:2053Issues:42Issues:196

TradeMaster

TradeMaster is an open-source platform for quantitative trading empowered by reinforcement learning :fire: :zap: :rainbow:

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1260Issues:36Issues:66

Awesome-Talking-Face

📖 A curated list of resources dedicated to talking face.

AvatarCLIP

[SIGGRAPH 2022 Journal Track] AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars

Language:PythonLicense:NOASSERTIONStargazers:1055Issues:20Issues:20

RAD-NeRF

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

Language:PythonLicense:MITStargazers:865Issues:30Issues:94

taichi-nerfs

Implementations of NeRF variants based on Taichi + PyTorch

Language:PythonLicense:Apache-2.0Stargazers:717Issues:14Issues:31

LLaMA-VID

Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:651Issues:12Issues:99

nerfshop

NeRFshop: Interactive Editing of Neural Radiance Fields

Language:CudaLicense:NOASSERTIONStargazers:442Issues:17Issues:24

sft_datasets

开源SFT数据集整理,随时补充

3d-cinemagraphy

[CVPR 2023] 3D Cinemagraphy from a Single Image

Language:PythonLicense:Apache-2.0Stargazers:254Issues:29Issues:6

Awesome_Multimodel_LLM

Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-context learning, visual reasoning, foundational models, and more. Stay updated with the latest advancement.

mobile-deeplab-v3-plus

Deeplab-V3+ model with MobilenetV2/MobilenetV3 on TensorFlow for mobile deployment.

Language:PythonLicense:MITStargazers:160Issues:4Issues:17

samurai

SAMURAI: Shape And Material from Unconstrained Real-world Arbitrary Image collections - NeurIPS2022

Language:PythonLicense:Apache-2.0Stargazers:97Issues:8Issues:9