yuqiangCAS

yuqiangCAS

Geek Repo

Github PK Tool:Github PK Tool

yuqiangCAS's starred repositories

MGM

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Language:PythonLicense:Apache-2.0Stargazers:2834Issues:0Issues:0

Mora

Mora: More like Sora for Generalist Video Generation

Language:Jupyter NotebookStargazers:1219Issues:0Issues:0

open-webui

User-friendly WebUI for LLMs (Formerly Ollama WebUI)

Language:SvelteLicense:MITStargazers:15617Issues:0Issues:0

ollama

Get up and running with Llama 3, Mistral, Gemma, and other large language models.

Language:GoLicense:MITStargazers:59492Issues:0Issues:0

mmpose

OpenMMLab Pose Estimation Toolbox and Benchmark.

Language:PythonLicense:Apache-2.0Stargazers:4990Issues:0Issues:0

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonLicense:Apache-2.0Stargazers:1265Issues:0Issues:0
Language:PythonStargazers:351Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:15158Issues:0Issues:0

minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Language:PythonLicense:Apache-2.0Stargazers:1008Issues:0Issues:0

StableVideo

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Language:PythonLicense:Apache-2.0Stargazers:1325Issues:0Issues:0

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:10080Issues:0Issues:0

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

Language:PythonLicense:Apache-2.0Stargazers:18396Issues:0Issues:0

torch-ngp

A pytorch CUDA extension implementation of instant-ngp (sdf and nerf), with a GUI.

Language:PythonLicense:MITStargazers:2004Issues:0Issues:0

GeneFace

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Language:PythonLicense:MITStargazers:2333Issues:0Issues:0

OpenAgents

OpenAgents: An Open Platform for Language Agents in the Wild

Language:PythonLicense:Apache-2.0Stargazers:3399Issues:0Issues:0

OpenFace

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

Language:MATLABLicense:NOASSERTIONStargazers:6592Issues:0Issues:0

sd-webui-animatediff

AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI

Language:PythonStargazers:2747Issues:0Issues:0

QuickLook

Bring macOS “Quick Look” feature to Windows

Language:C#License:GPL-3.0Stargazers:16205Issues:0Issues:0

Rerender_A_Video

[SIGGRAPH Asia 2023] Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2886Issues:0Issues:0

Silent-Face-Anti-Spoofing

静默活体检测(Silent-Face-Anti-Spoofing)

Language:PythonLicense:Apache-2.0Stargazers:1229Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:19624Issues:0Issues:0

Awesome-Deblurring

A curated list of resources for Image and Video Deblurring

Stargazers:2253Issues:0Issues:0

Fantasia3D

(ICCV2023) official repository for "Fantasia3D: Disentangling Geometry and Appearance for High-quality Text-to-3D Content Creation"

Language:PythonLicense:Apache-2.0Stargazers:656Issues:0Issues:0
Language:PythonStargazers:1670Issues:0Issues:0

Paddle

PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)

Language:C++License:Apache-2.0Stargazers:21608Issues:0Issues:0

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:PythonStargazers:21254Issues:0Issues:0

YUView

The Free and Open Source Cross Platform YUV Viewer with an advanced analytics toolset

Language:C++License:NOASSERTIONStargazers:1730Issues:0Issues:0

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Language:PythonLicense:NOASSERTIONStargazers:34933Issues:0Issues:0

MNN

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba

Language:C++Stargazers:8294Issues:0Issues:0

roop

one-click face swap

Language:PythonLicense:GPL-3.0Stargazers:24853Issues:0Issues:0