cseti007

cseti007

Geek Repo

Github PK Tool:Github PK Tool

cseti007's starred repositories

Language:PythonLicense:Apache-2.0Stargazers:119Issues:0Issues:0

ComfyUI_ProPainter_Nodes

🖌️ ComfyUI implementation of ProPainter framework for video inpainting.

Language:PythonLicense:GPL-3.0Stargazers:203Issues:0Issues:0

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:66356Issues:0Issues:0

SenseVoice

Multilingual Voice Understanding Model

Language:PythonLicense:NOASSERTIONStargazers:2213Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:5600Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32884Issues:0Issues:0

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonLicense:Apache-2.0Stargazers:4084Issues:0Issues:0

MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Language:PythonLicense:AGPL-3.0Stargazers:9348Issues:0Issues:0

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonLicense:MITStargazers:8378Issues:0Issues:0

StoryDiffusion

Create Magic Story!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5688Issues:0Issues:0

ComfyUI-3D-Pack

An extensive node suite that enables ComfyUI to process 3D inputs (Mesh & UV Texture, etc) using cutting edge algorithms (3DGS, NeRF, etc.)

Language:PythonLicense:MITStargazers:2032Issues:0Issues:0

SEINE

[ICLR 2024] SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Language:PythonLicense:Apache-2.0Stargazers:874Issues:0Issues:0

krita-ai-diffusion

Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.

Language:PythonLicense:GPL-3.0Stargazers:6139Issues:0Issues:0

stable-diffusion-webui-extensions

Extension index for stable-diffusion-webui

Stargazers:472Issues:0Issues:0

ECCV2022-RIFE

ECCV2022 - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Language:PythonLicense:MITStargazers:4275Issues:0Issues:0

frame-interpolation

FILM: Frame Interpolation for Large Motion, In ECCV 2022.

Language:PythonLicense:Apache-2.0Stargazers:2794Issues:0Issues:0

sd-webui-lcm

Latent Consistency Model for AUTOMATIC1111 Stable Diffusion WebUI

Language:PythonLicense:MITStargazers:612Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18855Issues:0Issues:0

Stable-Diffusion-WebUI-TensorRT

TensorRT Extension for Stable Diffusion Web UI

Language:PythonLicense:MITStargazers:1881Issues:0Issues:0

TensorRT

NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.

Language:C++License:Apache-2.0Stargazers:10469Issues:0Issues:0

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:10119Issues:0Issues:0

fast-stable-diffusion

fast-stable-diffusion + DreamBooth

Language:PythonLicense:MITStargazers:7457Issues:0Issues:0

sd-webui-deforum

Deforum extension for AUTOMATIC1111's Stable Diffusion webui

Language:PythonLicense:NOASSERTIONStargazers:2650Issues:0Issues:0